htian's picture

1 7 21

htian

htian01

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

upvoted a paper 4 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

upvoted a paper 4 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

View all activity

Organizations

None yet

htian01's activity

upvoted a paper about 1 month ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 48

upvoted 2 papers 4 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 38

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 155

liked 9 models 4 months ago

OpenGVLab/InternViT-6B-448px-V2_5

Image Feature Extraction • Updated Dec 9, 2024 • 768 • 33

OpenGVLab/InternViT-300M-448px-V2_5

Image Feature Extraction • Updated Dec 9, 2024 • 22.4k • 30

OpenGVLab/InternVL2_5-1B

Image-Text-to-Text • Updated 27 days ago • 28.1k • 57

OpenGVLab/InternVL2_5-2B

Image-Text-to-Text • Updated 27 days ago • 10.2k • 29

OpenGVLab/InternVL2_5-4B

Image-Text-to-Text • Updated 27 days ago • 29.5k • 49

OpenGVLab/InternVL2_5-8B

Image-Text-to-Text • Updated 27 days ago • 35.2k • 88

OpenGVLab/InternVL2_5-26B

Image-Text-to-Text • Updated 27 days ago • 6.77k • 33

OpenGVLab/InternVL2_5-38B

Image-Text-to-Text • Updated 27 days ago • 35.2k • 50

OpenGVLab/InternVL2_5-78B

Image-Text-to-Text • Updated 27 days ago • 9.68k • 191

upvoted a paper 5 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 81

liked 7 models 9 months ago

OpenGVLab/InternVL2-Llama3-76B

Image-Text-to-Text • Updated 27 days ago • 443 • 213

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated 27 days ago • 43.7k • 71

OpenGVLab/InternVL2-2B-AWQ

Image-Text-to-Text • Updated 27 days ago • 10.6k • 16

OpenGVLab/InternVL2-2B

Image-Text-to-Text • Updated 27 days ago • 281k • 67

OpenGVLab/InternVL2-4B

Image-Text-to-Text • Updated 27 days ago • 9.08k • 53

OpenGVLab/InternVL2-8B

Image-Text-to-Text • Updated 27 days ago • 41.7k • 173

OpenGVLab/InternVL2-26B

Image-Text-to-Text • Updated 27 days ago • 1.53k • 120