ShareGPT4Omni (ShareGPT4Omni)

Lin-Chen

authored a paper 3 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

Lin-Chen

authored a paper 4 months ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published Nov 28, 2024 • 33

Lin-Chen

authored a paper 8 months ago

VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models

Paper • 2407.11691 • Published Jul 16, 2024 • 14

Lin-Chen

authored 2 papers 9 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 95

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20, 2024 • 35

Lin-Chen

authored a paper 10 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 75

Lin-Chen

updated a dataset 10 months ago

ShareGPT4Omni/ShareGPT4Omni

Updated Jun 7, 2024 • 4 • 6

Lin-Chen

authored a paper 12 months ago

Are We on the Right Way for Evaluating Large Vision-Language Models?

Paper • 2403.20330 • Published Mar 29, 2024 • 6

Lin-Chen

authored 2 papers over 1 year ago

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Paper • 2311.12793 • Published Nov 21, 2023 • 18

FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing

Paper • 2307.04684 • Published Jul 10, 2023 • 1

ShareGPT4Omni

AI & ML interests

ShareGPT4Omni's activity

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Open-Sora Plan: Open-Source Large Video Generation Model

VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

ShareGPT4Omni/ShareGPT4Omni

Are We on the Right Way for Evaluating Large Vision-Language Models?

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing

AI & ML interests

Team members 1

ShareGPT4Omni's activity