question model "bin files are updated to safetensors Add chat_template"

Compared to the other two models, DeepSeek-V3-4bit is different, this one has been quantified by mlx. It seems that it cannot be easily converted to other formats, and it is best to run only under the mlx framework.

kalle07

about 1 month ago

i see...
but seems your janus conversion can create at least a safaetensor

can you doo the same on the models above?

wnma3mz

Owner 30 days ago

Yes, but I found that the deepseek-vl2-small and deepseek-vl2-tiny are themselves in safaetensor format.

Do you want to extract the language model part of it?

kalle07

30 days ago

to answer, yes mlx is allready special a MAC(Apple) format.

extract the language model, yes but only if its not to hard to get ... ;)
i dont know much about - maybe ater extraction its usefull for LLM

GGUF is the most used format for low level users at home for LLM they can be converted by llama.cpp from pytorch. but only a hand full of formats are supported.
so you can easy use JAN or GPT4all to talk with these models.

wnma3mz

Owner 30 days ago

•

edited 30 days ago

So far, I'm not really sure what your ultimate needs are.

Want a janus gguf format?
Or the deepseek-vl2-tiny and deepseek-vl2-small gguf formats?

gguf is something I'm not good at, and for the janus architecture, additional code is needed to support inference. I won't be working on gguf for a while.

kalle07

30 days ago

i see... all fine

only extract the language model part of deep seek, tiny and small. can be that both the same and only the vision part differ ...

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment