AI & ML interests
VLM, Geo Reasoning, Visual Retriving
Recent Activity
Organization Card
TLV R&D VLMs for Image Retrieving and Visual Reasoning
Vision-Language Retrieving Models
| Model Name | Model Type | Base Model | Training Set | Owner | Link | Freezed Parameters |
|---|---|---|---|---|---|---|
| ImiClip | CLIP | openai/clip-vit-base-patch32 | DM | Etzion | TLVLM/ImiClip | Vision Encoder |
| ImiClip_v2 | CLIP | openai/clip-vit-base-patch32 | DM + RSICD | Etzion | TLVLM/ImiClip_v2 | Vision Encoder |
| ImiClip_v3 | CLIP | openai/clip-vit-base-patch32 | DM + RSICD | Etzion | TLVLM/ImiClip_v3 | ❌ |
| ImiGlip | SigLIP | google/siglip-so400m-patch14-384 | DM | Etzion | TLVLM/ImiGlip | Vision Encoder |
| ImiGlip_V2 | SigLIP | google/siglip-so400m-patch14-384 | DM + RSICD | Etzion | TLVLM/ImiGlip_V2 | Vision Encoder |
| ImiGlip_V3 | SigLIP | google/siglip-so400m-patch14-384 | DM + RSICD | Etzion | TLVLM/ImiGlip_V3 | ❌ |
| ImiGlip2 | SigLIP2 | google/siglip2-so400m-patch14-384 | DM + RSICD | Etzion | TLVLM/ImiGlip2 | Both Encoders + Logits |
| ImiGlip2n | SigLIP2 | google/siglip2-so400m-patch16-naflex | DM + RSICD | Etzion | TLVLM/ImiGlip2n | Both Encoders + Logits |
Collections
Here you can find the model Collections
- CLIP based finetuned models: TLVLM/clips
- SigLIP based finetuned models: TLVLM/siglips
- SigLIP 2 based finetuned models: TLVLM/siglips2
models 0
None public yet
datasets 0
None public yet