SimonHinton

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

liked a Space about 1 month ago

black-forest-labs/FLUX.1-Fill-dev

liked a Space about 1 month ago

yslan/GaussianAnything-AIGC3D

View all activity

Organizations

None yet

SimonHinton's activity

upvoted a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 122

liked 4 Spaces about 1 month ago

Running on Zero

183

🖌️

QwQ-32B-Preview

upvoted 6 papers about 1 month ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 105

Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding

Paper • 2412.00493 • Published Nov 30, 2024 • 16

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Paper • 2412.03085 • Published Dec 4, 2024 • 12

NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

Paper • 2412.02030 • Published Dec 2, 2024 • 18

CleanDIFT: Diffusion Features without Noise

Paper • 2412.03439 • Published Dec 4, 2024 • 12

Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Paper • 2412.03515 • Published Dec 4, 2024 • 25

liked a model about 1 month ago

black-forest-labs/FLUX.1-Fill-dev

Updated Nov 25, 2024 • 35.4k • 456

upvoted 6 papers about 1 month ago

Trajectory Attention for Fine-grained Video Motion Control

Paper • 2411.19324 • Published Nov 28, 2024 • 12

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Paper • 2411.19108 • Published Nov 28, 2024 • 17

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 25

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Paper • 2411.18478 • Published Nov 27, 2024 • 33

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27, 2024 • 30

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 52

SimonHinton

AI & ML interests

Recent Activity

Organizations

SimonHinton's activity

FLUX.1 Fill Dev

GaussianAnything-AIGC3D

OminiControl

QwQ-32B-Preview