Yuxian Gu

t1101675

AI & ML interests

Efficient methods for language models

Recent Activity

Organizations

Conversational AI (CoAI) group from Tsinghua University's profile picture Efficient-Large-Model's profile picture MiniLLM's profile picture Data Selection's profile picture VILA / Molmo's profile picture

t1101675's activity

New activity in MiniLLM/MiniLLM-gpt2-340M 25 days ago
New activity in MiniLLM/SFT-gpt2-120M 25 days ago
New activity in MiniLLM/SFT-gpt2-760M 25 days ago
New activity in Data-Selection/PDS-470M 25 days ago
New activity in Data-Selection/PDS-160M 25 days ago
New activity in Data-Selection/PDS-1B 25 days ago

Add link to code repository

#2 opened 25 days ago by
nielsr
New activity in Data-Selection/PDS-1.7B 25 days ago
New activity in Data-Selection/BSL-1.7B 25 days ago

Add link to code

#2 opened 25 days ago by
nielsr
New activity in MiniLLM/MiniPLM-Mamba-130M 25 days ago
New activity in MiniLLM/MiniPLM-Qwen-1.2B 25 days ago

Add link to code

#1 opened 25 days ago by
nielsr
New activity in MiniLLM/Ref-Pretrain-Qwen-104M 25 days ago

Add link to code

#1 opened 25 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-1.2B 25 days ago

Add link to code

#1 opened 25 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-500M 25 days ago

No changes needed

#1 opened 25 days ago by
nielsr
New activity in MiniLLM/Pretrain-Qwen-200M 25 days ago

Add link to code

#1 opened 25 days ago by
nielsr
New activity in MiniLLM/VanillaKD-Pretrain-Qwen-500M 25 days ago

Add link to code

#1 opened 25 days ago by
nielsr