Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Guowei Xu's picture
11 30 14

Guowei Xu PRO

Xkev
akhaliq's profile picture Sachin1615's profile picture jicknan's profile picture
·
https://xugw-kevin.github.io
  • Kevin_GuoweiXu
  • XuGW-Kevin

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
upvoted a paper about 2 months ago
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
upvoted a paper about 2 months ago
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
View all activity

Organizations

None yet

Collections 1

LLaVA-CoT
  • Running on Zero
    38
    38

    Llama 3.2V 11B Cot

    💬

    Generate descriptions and answers by combining text and images

  • Xkev/Llama-3.2V-11B-cot

    Image-Text-to-Text • Updated Dec 16, 2024 • 4.93k • 153
  • Xkev/LLaVA-CoT-100k

    Viewer • Updated Nov 27, 2024 • 98.6k • 1.91k • 92
  • LLaVA-o1: Let Vision Language Models Reason Step-by-Step

    Paper • 2411.10440 • Published Nov 15, 2024 • 125

Papers 5

arxiv:2411.10440
arxiv:2410.14972
arxiv:2402.14528
arxiv:2310.19668

spaces 1

Running on Zero
38

Llama 3.2V 11B Cot

💬

Generate descriptions and answers by combining text and images

Mar 18

models 1

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 4.93k • 153

datasets 1

Xkev/LLaVA-CoT-100k

Viewer • Updated Nov 27, 2024 • 98.6k • 1.91k • 92
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs