zhanglu's picture

22

zhanglu

zhanglu

·

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

liked a model 7 months ago

lmms-lab/llama3-llava-next-8b

liked a model 7 months ago

tennant/llava-llama-3-8b-hqedit

View all activity

Organizations

None yet

zhanglu's activity

authored a paper 21 days ago

OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

Paper • 2406.16620 • Published Jun 24, 2024 • 2

liked 3 models 7 months ago

lmms-lab/llama3-llava-next-8b

Text Generation • Updated Aug 17, 2024 • 25.7k • 92

tennant/llava-llama-3-8b-hqedit

Text Generation • Updated Apr 29, 2024 • 15 • 17

iampanda/zpoint_large_embedding_zh

Updated Jul 22, 2024 • 2.59k • 48

reacted to visheratin's post with ❤️ 8 months ago

Post

Keep stacking cool stuff and getting better results! After I changed the standard vision encoder to SigLIP, NLLB-CLIP got a 10% average performance improvement. And now, I added matryoshka layers (https://arxiv.org/abs/2205.13147) to enable smaller embeddings and got another 6% performance boost! Plus, thanks to MRL, 4.5x smaller embeddings retain 90%+ quality.

The large model is finally SoTA for both image and text multilingual retrieval!

The models are available on the hub:
- visheratin/nllb-siglip-mrl-base
- visheratin/nllb-siglip-mrl-large

2 replies

·

liked 2 Spaces 9 months ago

Multilingual Zero Shot Image Clf

Comparing powerful multilingual zero-shot image clf models

Draw To Search Art

Draw/upload image and search among WikiART using SigLIP

liked a model 11 months ago

timm/ViT-SO400M-14-SigLIP-384

Zero-Shot Image Classification • Updated Oct 27, 2023 • 168k • 79

liked a Space about 1 year ago

MTEB Leaderboard

Select and filter benchmarks for text embedding tasks

updated a collection about 1 year ago

multi-modal

1 item • Updated Feb 2, 2024

liked 7 models about 1 year ago

llmrails/ember-v1

Feature Extraction • Updated Aug 21, 2024 • 80.3k • 61

thenlper/gte-large

Sentence Similarity • Updated Nov 15, 2024 • 524k • • 266

BAAI/bge-base-en-v1.5

Feature Extraction • Updated Feb 21, 2024 • 1.61M • • 275

BAAI/bge-large-en-v1.5

Feature Extraction • Updated Feb 21, 2024 • 2.23M • • 486

WhereIsAI/UAE-Large-V1

Feature Extraction • Updated Dec 31, 2024 • 1.14M • • 217

intfloat/e5-mistral-7b-instruct

Feature Extraction • Updated Apr 23, 2024 • 198k • 487

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17, 2024 • 1.31M • • 4.21k

liked 2 models over 1 year ago

DAMO-NLP-MT/polylm-13b

Text Generation • Updated Aug 10, 2023 • 1.71k • 53

sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

Sentence Similarity • Updated Nov 5, 2024 • 9.44M • • 780