5 39

Mohamed Salama PRO

Salama1429

salama1429

AI & ML interests

NLP

Recent Activity

liked a model about 1 month ago

NAMAA-Space/AraModernBert-Base-V1.0

liked a model 2 months ago

nomic-ai/nomic-embed-text-v2-moe

updated a model 3 months ago

Salama1429/xlm-roberta_punctuation_fullstop_truecase

View all activity

Organizations

Salama1429's activity

liked a model about 1 month ago

NAMAA-Space/AraModernBert-Base-V1.0

Fill-Mask • Updated Mar 3 • 531 • 7

liked a model 2 months ago

nomic-ai/nomic-embed-text-v2-moe

updated a model 3 months ago

Salama1429/xlm-roberta_punctuation_fullstop_truecase

Text2Text Generation • Updated Feb 2 • 8

published a model 3 months ago

Salama1429/xlm-roberta_punctuation_fullstop_truecase

Text2Text Generation • Updated Feb 2 • 8

liked a Space 4 months ago

536

Open Source Ai Year In Review 2024

😻

What happened in open-source AI this year, and what’s next?

liked a dataset 5 months ago

UBC-NLP/Casablanca

Viewer • Updated Nov 14, 2024 • 13.6k • 1.13k • 15

liked a model 5 months ago

minishlab/M2V_multilingual_output

Updated Jan 21 • 1.32k • 18

New activity in Salama1429/tarteel-ai-everyayah-Quran 6 months ago

Test missing

#2 opened about 1 year ago by

HadiSDev

updated a dataset 6 months ago

Salama1429/tarteel-ai-everyayah-Quran

Viewer • Updated Nov 2, 2024 • 90k • 1.27k • 11

liked a model 7 months ago

laion/clap-htsat-unfused

Feature Extraction • Updated Apr 24, 2023 • 90k • 51

liked a model 8 months ago

Alibaba-NLP/gte-multilingual-base

liked 4 models 9 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 373k • • 1.06k

HuggingFaceFW/fineweb-edu-classifier

Text Classification • Updated Nov 17, 2024 • 262k • 175

Snowflake/snowflake-arctic-embed-m-v1.5

google/madlad400-3b-mt

Translation • Updated Nov 27, 2023 • 8.26k • 119

liked a Space 10 months ago

5.46k

MTEB Leaderboard

🥇

Embedding Leaderboard

liked a dataset 10 months ago

Flux9665/BibleMMS

Viewer • Updated Jun 16, 2024 • 736k • 433 • 66

liked a Space 10 months ago

188

MassivelyMultilingualTTS

🌍

Convert text to speech in multiple languages

reacted to their post with 👀🤝 11 months ago

Post

2562

📺 Introducing the YouTube-Commons Dataset 📺

🌐 Overview: The YouTube Commons Dataset is a comprehensive collection of 30 billion words from 15,112,121 original and automatically translated transcripts, drawn from 2,063,066 videos on YouTube.

🔗 License: All videos are shared under the CC-BY license, with the majority (71%) in English.

🤖 Applications: This dataset is ideal for training powerful AI models for converting speech to text (ASR) and translation models.

📊 Utilization: The text can be used for model training and is republishable for reproducibility purposes.

🤝 Collaboration: This dataset is the result of a collaboration between state start-up LANGU:IA, the French Ministry of Culture, and DINUM. It will be expanded in the coming months.

🔗 Explore the dataset here: https://lnkd.in/d_paWKFE

#YouTubeCommons #AIResearch #MachineLearning #OpenData #ArtificialIntelligence #NLP #Dataset #TechCollaboration #Innovation #DigitalTransformation