5 6 165

TheFireHacker PRO

TheFireHacker

https://aiedx.com

AI & ML interests

LLM, sub quadraric attention, AI Agents , Synthetic Data

Recent Activity

liked a dataset 3 days ago

HuggingFaceFW/fineweb

upvoted a collection 4 days ago

Qwen3

liked a dataset 7 days ago

alibaba-pai/AgenticQwen-Data

View all activity

Organizations

liked a dataset 3 days ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 923k • 2.79k

upvoted a collection 4 days ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.78k

liked a dataset 7 days ago

alibaba-pai/AgenticQwen-Data

Viewer • Updated Mar 16 • 37.4k • 216 • 4

liked a model 7 days ago

alibaba-pai/AgenticQwen-8B

8B • Updated Mar 17 • 87 • 14

liked a Space 8 days ago

Info Lens

🔭

Explore the informational nature of LLMs and language.

liked a model about 2 months ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4

Text Generation • 67B • Updated 14 days ago • 920k • 303

liked a dataset 2 months ago

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 521k • 730

liked a model 2 months ago

Qwen/Qwen3.5-0.8B

Image-Text-to-Text • 0.9B • Updated Mar 2 • 2.84M • 533

liked 2 models 3 months ago

SakanaAI/doc-to-lora

Updated Feb 12 • 15

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 196k • 952

liked a Space 3 months ago

Evaluation Guidebook

📝

318

Explore LLM benchmark trends over time

liked a dataset 3 months ago

kjj0/fineweb10B-gpt2

Updated Sep 28, 2024 • 7.03k • 11

liked a model 4 months ago

arcee-ai/Trinity-Large-Base

Text Generation • 399B • Updated Apr 1 • 270 • 57

liked a dataset 4 months ago

MathLLMs/MathVision

Viewer • Updated 21 days ago • 3.34k • 18.4k • 142

liked 2 models 4 months ago

moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • 16B • Updated Jan 30 • 262k • 259

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 15 days ago • 1.86M • • 2.78k

liked a model 5 months ago

bubblspace/Timecapsule2.7B-g3n-mix-match

Image-Text-to-Text • 7B • Updated Aug 6, 2025 • 7 • 1

upvoted an article 6 months ago

Article

Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC

TensorSlay

•

Nov 27, 2025

• 3

liked a model 6 months ago

PleIAs/Monad

Text Generation • 56.7M • Updated Dec 14, 2025 • 4.62k • 69

liked a dataset 6 months ago

PleIAs/SYNTH

Viewer • Updated 9 days ago • 68M • 12.1k • 262