4 20 13

Tung-Lin Wu

tunglinwu

tunglinwood

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago

GLM-4-0414

upvoted a paper 12 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

upvoted a paper 17 days ago

Training Sparse Mixture Of Experts Text Embedding Models

View all activity

Organizations

None yet

tunglinwu's activity

upvoted a collection 7 days ago

GLM-4-0414

Collection

GLM-4-0414 series model • 8 items • Updated 7 days ago • 104

upvoted a paper 12 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

upvoted a paper 17 days ago

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11 • 6

upvoted a collection 18 days ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 26 days ago • 89

upvoted a paper about 1 month ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 116

upvoted an article about 1 month ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 211

upvoted 2 papers about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 29

upvoted a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 384

upvoted an article 2 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 567

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

upvoted a collection 4 months ago

Llama 3.3

Collection

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 157

upvoted a paper 6 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 24

upvoted a collection 6 months ago

Emu3

Collection

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 71

upvoted a paper 7 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

upvoted a collection 7 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 598

upvoted a paper 8 months ago

Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

Paper • 2408.04567 • Published Aug 8, 2024 • 27