Ali El Filali's picture

Ali El Filali

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Other interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

updated a dataset 1 day ago

OALL/requests

liked a dataset 2 days ago

atlasia/TerjamaBench

updated a dataset 2 days ago

inceptionai/requests-dataset

View all activity

Articles

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

Introducing the Open Arabic LLM Leaderboard

Organizations

alielfilali01's activity

upvoted a paper 4 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 4 days ago • 72

upvoted a paper 9 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 10 days ago • 45

upvoted a paper 11 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 74

upvoted a collection 14 days ago

Deepseek Papers

Deepseek papers collection • 14 items • Updated 14 days ago • 9

upvoted 2 papers 14 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 17 days ago • 20

Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Paper • 2412.15255 • Published 28 days ago • 3

upvoted a paper 24 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 24 days ago • 339

upvoted a collection 24 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 5 days ago • 78

upvoted a collection 27 days ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 6 items • Updated about 1 month ago • 9

upvoted a paper 28 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 101

upvoted 4 collections about 1 month ago

🧪 FineWeb v1 data experiments

Ablation models trained for our data experiments. • 22 items • Updated Jun 12, 2024 • 4

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 35

AraDICE

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs • 12 items • Updated about 1 month ago • 4

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated about 1 month ago • 126

upvoted 2 articles about 1 month ago

Article

Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation

By

•

Dec 2, 2024

• 5

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

Dec 8, 2024

• 21

upvoted a collection about 1 month ago

🥂 FineWeb2

3 items • Updated Dec 8, 2024 • 11

upvoted an article about 1 month ago

Article

Comparing Open-source and Proprietary LLMs in Medical AI

By

•

Oct 3, 2024

• 16

upvoted a paper about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

upvoted an article about 2 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

By

•

Nov 21, 2024

• 35