SmolVLM: Redefining small and efficient multimodal models
Paper
•
2504.05299
•
Published
•
93
Local LLMs
@image
: Generates an image using Stable Diffusion xL. @3d
: Generates a 3D mesh. @web
: Web search agents.@rAgent
: Initiates a reasoning chain using Llama mode for coding explanations. @tts1-♀
, @tts2-♂
: Voice generation (Female and Male voices). @yolo
: Object Detection