-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 107 -
Are Vision-Language Models Truly Understanding Multi-vision Sensor?
Paper • 2412.20750 • Published • 20 -
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper • 2412.21187 • Published • 42 -
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper • 2412.18925 • Published • 101
Xiao Li
lixiao20
AI & ML interests
ML security
Recent Activity
upvoted
a
paper
about 2 months ago
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based
Perspective
liked
a model
about 2 months ago
suixin1424/oddefense
new activity
2 months ago
lixiao20/PartImageNetPP:[bot] Conversion to Parquet
Organizations
Collections
1
Papers
1
models
None public yet