Bowen Peng's picture

Bowen Peng

bloc97

·

bloc97

AI & ML interests

Machine Learning, Computer Graphics, Language Models

Recent Activity

upvoted a collection 9 days ago

Nemotron-UltraLong

updated a model about 2 months ago

bloc97/150m-auto-88000

published a model about 2 months ago

bloc97/150m-auto-88000

View all activity

Organizations

bloc97's activity

upvoted a collection 9 days ago

Nemotron-UltraLong

3 items • Updated 5 days ago • 12

upvoted a paper 3 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 57

upvoted a paper 8 months ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 51

upvoted a paper 10 months ago

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28, 2024 • 32

upvoted a collection about 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 748

upvoted 6 papers about 1 year ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 128

V3D: Video Diffusion Models are Effective 3D Generators

Paper • 2403.06738 • Published Mar 11, 2024 • 31

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

Resonance RoPE: Improving Context Length Generalization of Large Language Models

Paper • 2403.00071 • Published Feb 29, 2024 • 25

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29, 2024 • 54

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 116

upvoted 4 papers over 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89