Bai Yang's picture

Bai Yang

ShacklesLay

·

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

OpenMOSS-Team/moss-video-preview-base

liked a model 17 days ago

OpenMOSS-Team/moss-video-preview-sft

liked a model 17 days ago

OpenMOSS-Team/moss-video-preview-realtime-sft

View all activity

Organizations

upvoted a collection 28 days ago

MOSS-VL

2 items • Updated 19 days ago • 54

upvoted a paper 3 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

upvoted a paper 9 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79

upvoted a paper about 1 year ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3, 2025 • 86

upvoted 7 papers over 1 year ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 130

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 449

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9, 2025 • 44

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 76

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published Oct 31, 2024 • 21

HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks

Paper • 2410.12381 • Published Oct 16, 2024 • 43

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 86

upvoted 5 papers about 2 years ago

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 65

Repetition Improves Language Model Embeddings

Paper • 2402.15449 • Published Feb 23, 2024 • 2

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

Paper • 2401.11206 • Published Jan 20, 2024 • 2

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109

upvoted 2 papers over 2 years ago

The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Paper • 2312.01552 • Published Dec 4, 2023 • 31

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 40