25 186 629

Florent Daudens

fdaudens

AI & ML interests

AI & Journalism

Recent Activity

liked a model about 22 hours ago

Qwen/QwQ-32B-GGUF

posted an update about 23 hours ago

I read the 456-page AI Index report so you don't have to (kidding). The wild part? While AI gets ridiculously more accessible, the power gap is actually widening: 1️⃣ The democratization of AI capabilities is accelerating rapidly: - The gap between open and closed models is basically closed: difference in benchmarks like MMLU and HumanEval shrunk to just 1.7% in 2024 - The cost to run GPT-3.5-level performance dropped 280x in 2 years - Model size is shrinking while maintaining performance - Phi-3-mini hitting 60%+ MMLU at fraction of parameters of early models like PaLM 2️⃣ But we're seeing concerning divides deepening: - Geographic: US private investment ($109B) dwarfs everyone else - 12x China's $9.3B - Research concentration: US and China dominate highly-cited papers (50 and 34 respectively in 2023), while next closest is only 7 - Gender: Major gaps in AI skill penetration rates - US shows 2.39 vs 1.71 male/female ratio The tech is getting more accessible but the benefits aren't being distributed evenly. Worth thinking about as these tools become more central to the economy. Give it a read - fascinating portrait of where AI is heading! https://hai-production.s3.amazonaws.com/files/hai_ai_index_report_2025.pdf

liked a model 2 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

View all activity

Organizations

fdaudens's activity

liked a model about 22 hours ago

Qwen/QwQ-32B-GGUF

Text Generation • Updated 26 days ago • 137k • 180

posted an update about 23 hours ago

Post

1565

I read the 456-page AI Index report so you don't have to (kidding). The wild part? While AI gets ridiculously more accessible, the power gap is actually widening:

1️⃣ The democratization of AI capabilities is accelerating rapidly:
- The gap between open and closed models is basically closed: difference in benchmarks like MMLU and HumanEval shrunk to just 1.7% in 2024
- The cost to run GPT-3.5-level performance dropped 280x in 2 years
- Model size is shrinking while maintaining performance - Phi-3-mini hitting 60%+ MMLU at fraction of parameters of early models like PaLM

2️⃣ But we're seeing concerning divides deepening:
- Geographic: US private investment ($109B) dwarfs everyone else - 12x China's $9.3B
- Research concentration: US and China dominate highly-cited papers (50 and 34 respectively in 2023), while next closest is only 7
- Gender: Major gaps in AI skill penetration rates - US shows 2.39 vs 1.71 male/female ratio

The tech is getting more accessible but the benefits aren't being distributed evenly. Worth thinking about as these tools become more central to the economy.

Give it a read - fascinating portrait of where AI is heading! https://hai-production.s3.amazonaws.com/files/hai_ai_index_report_2025.pdf

1 reply

liked a model 2 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 2 days ago • 101k • • 618

liked a Space 2 days ago

Llama-4-Maverick-17B Research

🏃

Llama-4-Maverick-17B + Real Time Deep Research

posted an update 3 days ago

Post

2217

See that purple banner on the Llama 4 models? It's Xet storage, and this is actually huge for anyone building with AI models. Let's geek out a little bit 🤓

Current problem: AI models are massive files using Git LFS. But with models getting bigger and downloads exploding, we needed something better.
Xet lets you version large files like code, with compression and deduplication, all Git-compatible. That means less bandwidth, faster sharing, and smoother collaboration.

Real numbers: ~25% deduplication on Llama 4 models, hitting ~40% for finetunes.

Scale matters here - the Hub served 2B model downloads in 30 days, Llama models alone at 60M. The upcoming Llama 4 Behemoth has 2T parameters! Xet's chunk-based system was built exactly for this.

This is the kind of engineering that makes the next wave of large models actually usable. Kudos to the team! 🧨

Check out the models collection: meta-llama/llama-4-67f0c30d9fe03840bc9d0164

reacted to jeffboudier's post with 🚀 3 days ago

Post

1985

Llama4 is out and Scout is already on the Dell Enterprise Hub to deploy on Dell systems 👉 dell.huggingface.co

reacted to jsulz's post with 🔥 3 days ago

Post

3339

Huge week for

xet-team as Llama 4 is the first major model on Hugging Face uploaded with Xet providing the backing! Every byte downloaded comes through our infrastructure.

Using Xet on Hugging Face is the fastest way to download and iterate on open source models and we've proved it with Llama 4 giving a boost of ~25% across all models.

We expect builders on the Hub to see even more improvements, helping power innovation across the community.

With the models on our infrastructure, we can peer in and see how well our dedupe performs across the Llama 4 family. On average, we're seeing ~25% dedupe, providing huge savings to the community who iterate on these state-of-the-art models. The attached image shows a few selected models and how they perform on Xet.

Thanks to the

meta-llama team for launching on Xet!

upvoted an article 3 days ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

and 6 others •

4 days ago

• 128

liked a model 3 days ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • Updated 3 days ago • 12.4k • • 236

upvoted a collection 3 days ago

Llama 4

Collection

Llama 4 release • 10 items • Updated 3 days ago • 396

posted an update 4 days ago

Post

2379

"Am I going to be replaced by AI?" - Crucial question, but maybe we're asking the wrong one.

📈 There's a statistic from my reads this week that stays with me: Tomer Cohen, LinkedIn's CPO, shares to Jeremy Kahn that 70% of skills used in most jobs will change by 2030. Not jobs disappearing, but transforming. And he calls out bad leadership: "If in one year's time, you are disappointed that your workforce is not 'AI native,' it is your fault."

🔄 Apparently, the Great Recalibration has begun. We're now heading into an era where AI is fundamentally redefining the nature of work itself, by forcing a complete reassessment of human value in the workplace, according to a piece in Fast Company. But it might be driven more by "the need for humans to change the way they work" than AI.

⚡ The Washington Post draws a crucial parallel: We're facing an "AI shock" similar to manufacturing's "China shock" - but hitting knowledge workers. Especially entry-level, white-collar work could get automated. The key difference? "Winning the AI tech competition with other countries won't be enough. It's equally vital to win the battle to re-skill workers."

Digging into these big questions in this week’s AI in the News: https://fdaudens.substack.com/publish/posts/detail/160596301

Also, I'm curious: how are you keeping up with this pace of change? What strategies are working for you?

liked a Space 4 days ago

242

starvector-1b-im2svg

📈

Convert images and text into scalable vector graphics (SVG) code

updated a Space 4 days ago

First Llm Classifier

💻

Learn to use LLMS to organize and analyze massive datasets

upvoted a collection 5 days ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 5 days ago • 103

liked a Space 5 days ago

2.87k

CLIP Interrogator

🕵

Analyze image to generate descriptive prompt

liked a model 6 days ago

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated 5 days ago • 5.23k • 308

published an article 6 days ago

Article

The NLP Course is becoming the LLM Course!

and 9 others •

6 days ago

• 63

posted an update 6 days ago

Post

2144

Did we just drop personalized AI evaluation?! This tool auto-generates custom benchmarks on your docs to test which models are the best.

Most benchmarks test general capabilities, but what matters is how models handle your data and tasks. YourBench helps answer critical questions like:
- Do you really need a hundreds-of-billions-parameter model sledgehammer to crack a nut?
- Could a smaller, fine-tuned model work better?
- How well do different models understand your domain?

Some cool features:
📚 Generates custom benchmarks from your own documents (PDFs, Word, HTML)
🎯 Tests models on real tasks, not just general capabilities
🔄 Supports multiple models for different pipeline stages
🧠 Generate both single-hop and multi-hop questions
🔍 Evaluate top models and deploy leaderboards instantly
💰 Full cost analysis to optimize for your budget
🛠️ Fully configurable via a single YAML file

26 SOTA models tested for question generation. Interesting finding: Qwen2.5 32B leads in question diversity, while smaller Qwen models and Gemini 2.0 Flash offer great value for cost.

You can also run it locally on any models you want.

I'm impressed. Try it out: yourbench/demo

liked a Space 6 days ago

Try YourBench!

🪄

Generate a custom benchmark from any document