zhanghang's picture

1 11 9

zhanghang

hangzhang-nlp

·

hangzhang-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

upvoted a paper 15 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

upvoted a paper 29 days ago

Qwen2.5 Technical Report

View all activity

Organizations

hangzhang-nlp's activity

upvoted 2 papers 15 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 18 days ago • 41

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 16 days ago • 95

upvoted a paper 29 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 30 days ago • 340

upvoted 2 papers 3 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 31

upvoted a paper 6 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 56

upvoted an article 9 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

upvoted a collection 9 months ago

WizardLM

0 items • Updated 10 days ago • 104

upvoted a paper 9 months ago

Audio Dialogues: Dialogues dataset for audio and music understanding

Paper • 2404.07616 • Published Apr 11, 2024 • 16

upvoted 2 papers over 1 year ago

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 37

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Paper • 2309.09958 • Published Sep 18, 2023 • 18