Ben Pope's picture

Ben Pope

realbenpope

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

upvoted a paper 10 days ago

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

upvoted a paper 10 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

View all activity

Organizations

None yet

realbenpope's activity

upvoted 3 papers 10 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 16 days ago • 88

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Paper • 2412.20005 • Published 13 days ago • 17

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published 25 days ago • 50

upvoted 2 papers 13 days ago

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published 18 days ago • 30

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 18 days ago • 61

upvoted a paper 23 days ago

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 25 days ago • 33

upvoted a paper about 1 month ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 71

upvoted a paper 3 months ago

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3, 2024 • 48

upvoted 4 papers 5 months ago

Memory-Efficient LLM Training with Online Subspace Descent

Paper • 2408.12857 • Published Aug 23, 2024 • 13

Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time

Paper • 2408.13233 • Published Aug 23, 2024 • 22

Towards flexible perception with visual memory

Paper • 2408.08172 • Published Aug 15, 2024 • 21

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 11

upvoted a collection 5 months ago

MoE

1 item • Updated Aug 17, 2024 • 1

upvoted 5 papers 5 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 64

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 38

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

ProCreate, Dont Reproduce! Propulsive Energy Diffusion for Creative Generation

Paper • 2408.02226 • Published Aug 5, 2024 • 10

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 79

upvoted 2 papers 6 months ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 54

TroL: Traversal of Layers for Large Language and Vision Models

Paper • 2406.12246 • Published Jun 18, 2024 • 34