microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 23 hours ago • 441k • 1.12k
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated 1 day ago • 55