Hanan Gani's picture

5 1

Hanan Gani

hanangani

·

https://hananshafi.github.io/

AI & ML interests

Deep Learning

Recent Activity

liked a dataset about 1 month ago

cmu-lti/sotopia

updated a model 3 months ago

hanangani/rgb-wm-ckpts

published a model 3 months ago

hanangani/rgb-wm-ckpts

View all activity

Organizations

upvoted a paper 8 months ago

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Paper • 2506.05336 • Published Jun 5, 2025 • 9

upvoted a paper 12 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6, 2025 • 72

upvoted 3 papers over 1 year ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs

Paper • 2406.10326 • Published Jun 14, 2024 • 1

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Paper • 2310.10640 • Published Oct 16, 2023 • 2