1 7 24

haikuoxin

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

commented a paper 16 days ago

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting

upvoted a collection 16 days ago

Relighting

View all activity

Organizations

None yet

haikuoxin's activity

upvoted a paper 9 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 11 days ago • 92

commented a paper 16 days ago

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting

Paper • 2412.00177 • Published Nov 29, 2024 • 7 •

upvoted a collection 16 days ago

Relighting

Collection

6 items • Updated 27 days ago • 1

upvoted 2 papers 19 days ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 27 days ago • 41

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 23 days ago • 38

liked a Space about 1 month ago

Running

157

📊

VBench Leaderboard

liked a model about 1 month ago

AuraDiffusion/16ch-vae

Updated Jul 3, 2024 • 43 • 67

upvoted a paper about 2 months ago

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Paper • 2411.10640 • Published Nov 16, 2024 • 44

liked a model 4 months ago

allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 537k • 492

liked a Space 5 months ago

Running on Zero

932

📈

IC Light

liked 3 models 6 months ago

liked a model 7 months ago

deepseek-ai/DeepSeek-Coder-V2-Instruct

Text Generation • Updated Aug 21, 2024 • 164k • 528

liked a model 9 months ago

OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated Dec 9, 2024 • 781 • 36

liked 5 models 10 months ago

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12, 2024 • 2.22k • 80

01-ai/Yi-VL-34B

Image-Text-to-Text • Updated Jun 26, 2024 • 125 • 261

sentence-transformers/clip-ViT-B-32-multilingual-v1

SUSTech-NLP/mclip_base

Feature Extraction • Updated Jan 23, 2024 • 1

Lin-Chen/ShareCaptioner

Feature Extraction • Updated Jun 6, 2024 • 432 • 55