John Schaefer's picture

16 1

John Schaefer

johnschaefer

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

upvoted a paper 16 days ago

Z1: Efficient Test-time Scaling with Code

upvoted a paper 19 days ago

Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging

View all activity

Organizations

None yet

johnschaefer's activity

upvoted 2 papers 16 days ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published 18 days ago • 74

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published 17 days ago • 25

upvoted 2 papers 19 days ago

Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging

Paper • 2503.22236 • Published 21 days ago • 11

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving

Paper • 2503.21821 • Published 23 days ago • 17

upvoted 2 papers 22 days ago

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Paper • 2503.19462 • Published 24 days ago • 10

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search

Paper • 2503.20757 • Published 23 days ago • 9

upvoted 3 papers 28 days ago

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published 29 days ago • 38

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 95

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published 29 days ago • 87

upvoted 3 papers about 1 month ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 69

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 108

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 20

upvoted 3 papers 3 months ago

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Paper • 2501.12273 • Published Jan 21 • 14

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 43

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 46

liked a dataset 3 months ago

yale-nlp/MMVU

Viewer • Updated Feb 28 • 1k • 1.67k • 55

upvoted a paper 3 months ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86