Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
55
Mex Ivanov
MexIvanov
Follow
21world's profile picture
evilfreelancer's profile picture
2 followers
Ā·
8 following
MexIvanov
AI & ML interests
NLP, Coding, Quantum Computing and more.
Recent Activity
reacted
to
singhsidhukuldeep
's
post
with š„
1 day ago
Exciting News in AI: JinaAI Releases JINA-CLIP-v2! The team at Jina AI has just released a groundbreaking multilingual multimodal embedding model that's pushing the boundaries of text-image understanding. Here's why this is a big deal: š Technical Highlights: - Dual encoder architecture combining a 561M parameter Jina XLM-RoBERTa text encoder and a 304M parameter EVA02-L14 vision encoder - Supports 89 languages with 8,192 token context length - Processes images up to 512Ć512 pixels with 14Ć14 patch size - Implements FlashAttention2 for text and xFormers for vision processing - Uses Matryoshka Representation Learning for efficient vector storage ā”ļø Under The Hood: - Multi-stage training process with progressive resolution scaling (224ā384ā512) - Contrastive learning using InfoNCE loss in both directions - Trained on massive multilingual dataset including 400M English and 400M multilingual image-caption pairs - Incorporates specialized datasets for document understanding, scientific graphs, and infographics - Uses hard negative mining with 7 negatives per positive sample š Performance: - Outperforms previous models on visual document retrieval (52.65% nDCG@5) - Achieves 89.73% image-to-text and 79.09% text-to-image retrieval on CLIP benchmark - Strong multilingual performance across 30 languages - Maintains performance even with 75% dimension reduction (256D vs 1024D) šÆ Key Innovation: The model solves the long-standing challenge of unifying text-only and multi-modal retrieval systems while adding robust multilingual support. Perfect for building cross-lingual visual search systems! Kudos to the research team at Jina AI for this impressive advancement in multimodal AI!
reacted
to
singhsidhukuldeep
's
post
with š
3 days ago
Exciting breakthrough in AI: @Meta's new Byte Latent Transformer (BLT) revolutionizes language models by eliminating tokenization! The BLT architecture introduces a groundbreaking approach that processes raw bytes instead of tokens, achieving state-of-the-art performance while being more efficient and robust. Here's what makes it special: >> Key Innovations Dynamic Patching: BLT groups bytes into variable-sized patches based on entropy, allocating more compute power where the data is more complex. This results in up to 50% fewer FLOPs during inference compared to traditional token-based models. Three-Component Architecture: ā¢ Lightweight Local Encoder that converts bytes to patch representations ā¢ Powerful Global Latent Transformer that processes patches ā¢ Local Decoder that converts patches back to bytes >> Technical Advantages ā¢ Matches performance of Llama 3 at 8B parameters while being more efficient ā¢ Superior handling of non-English languages and rare character sequences ā¢ Remarkable 99.9% accuracy on spelling tasks ā¢ Better scaling properties than token-based models >> Under the Hood The system uses an entropy model to determine patch boundaries, cross-attention mechanisms for information flow, and hash n-gram embeddings for improved representation. The architecture allows simultaneous scaling of both patch and model size while maintaining fixed inference costs. This is a game-changer for multilingual AI and could reshape how we build future language models. Excited to see how this technology evolves!
liked
a model
8 days ago
CohereForAI/c4ai-command-r7b-12-2024
View all activity
Organizations
None yet
models
6
Sort:Ā Recently updated
MexIvanov/MistRAG-7B-ruen-v1-merged
Text Generation
ā¢
Updated
about 1 month ago
ā¢
22
MexIvanov/MistRAG-7B-ruen-v1
Text Generation
ā¢
Updated
about 1 month ago
MexIvanov/MistRAG-7B-ruen-v1-gguf
Text Generation
ā¢
Updated
about 1 month ago
ā¢
63
MexIvanov/zephyr-python-ru
Text Generation
ā¢
Updated
Nov 11
ā¢
2
MexIvanov/zephyr-python-ru-merged
Text Generation
ā¢
Updated
Nov 11
ā¢
430
MexIvanov/zephyr-python-ru-gguf
Text Generation
ā¢
Updated
Nov 11
ā¢
36
ā¢
4
datasets
4
Sort:Ā Recently updated
MexIvanov/RAG-v1-ruen
Viewer
ā¢
Updated
Nov 11
ā¢
51.4k
ā¢
41
MexIvanov/image-gen-vector-consistency
Viewer
ā¢
Updated
Aug 30
ā¢
184
ā¢
43
MexIvanov/CodeExercise-Python-27k-ru
Viewer
ā¢
Updated
Dec 19, 2023
ā¢
27.2k
ā¢
61
ā¢
1
MexIvanov/Vezora-Tested-22k-Python-Alpaca-ru
Viewer
ā¢
Updated
Dec 19, 2023
ā¢
22.6k
ā¢
59
ā¢
2