TheFireHacker's picture

TheFireHacker

TheFireHacker

·

https://aiedx.com

AI & ML interests

LLM, sub quadraric attention, AI Agents , Synthetic Data

Recent Activity

published a model about 7 hours ago

bubblspace/Bubblspace-steer-gemma-2-2b-it

liked a model about 9 hours ago

Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404

liked a model about 9 hours ago

google/gemma-2-2b-it

View all activity

Organizations

TheFireHacker's activity

upvoted a paper 3 months ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 51

upvoted 2 papers about 1 year ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 55