7 183 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

updated a collection 27 days ago

Nifty

upvoted a paper 27 days ago

LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation

View all activity

Organizations

None yet

bfuzzy1's activity

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 15 days ago • 73

upvoted a paper 27 days ago

LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation

Paper • 2503.19950 • Published 29 days ago • 11

upvoted a paper 28 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published about 1 month ago • 117

upvoted 2 papers about 1 month ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 110

MeshPad: Interactive Sketch Conditioned Artistic-designed Mesh Generation and Editing

Paper • 2503.01425 • Published Mar 3 • 14

upvoted 2 papers about 2 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 26

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69

upvoted 6 papers 2 months ago

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published Feb 17 • 6

Dyve: Thinking Fast and Slow for Dynamic Process Verification

Paper • 2502.11157 • Published Feb 16 • 7

CRANE: Reasoning with constrained LLM generation

Paper • 2502.09061 • Published Feb 13 • 19

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16 • 22

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17 • 45

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published Feb 13 • 16

upvoted a collection 2 months ago

Tools for learning AI

Collection

This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated Feb 26 • 67

upvoted a paper 2 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

upvoted an article 2 months ago

Article

1 Billion Classifications

Feb 13

• 43

upvoted 4 papers 2 months ago