Robin Williams's picture

Robin Williams PRO

bfuzzy1

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

updated a collection 27 days ago

upvoted a paper 27 days ago

LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation

View all activity

Organizations

None yet

bfuzzy1's activity

commented a paper 5 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 13 •

commented 3 papers 6 months ago

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published Nov 7, 2024 • 24 •

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 20 •

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 20 •

commented 3 papers 7 months ago

Erasing Conceptual Knowledge from Language Models

Paper • 2410.02760 • Published Oct 3, 2024 • 14 •

Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise

Paper • 2410.03017 • Published Oct 3, 2024 • 29 •

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 33 •

commented a paper about 1 year ago

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11, 2024 • 48 •