-
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Paper • 2312.08344 • Published • 13 -
Diffusion Priors for Dynamic View Synthesis from Monocular Videos
Paper • 2401.05583 • Published • 11 -
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Paper • 2401.14405 • Published • 13 -
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Paper • 2501.07301 • Published • 99
James Burgess
jmhb
AI & ML interests
Vision-language models, evaluation, biology applications
Recent Activity
upvoted a paper 3 days ago
Self-Steering Language Models upvoted a paper 5 days ago
SmolVLM: Redefining small and efficient multimodal modelsOrganizations
Collections 1
models
None public yet