Yash Marathe's picture

Yash Marathe

yashmarathe

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

all-hands/openhands-critic-32b-exp-20250417

published a model about 11 hours ago

yashmarathe/MathMind

upvoted a collection 2 days ago

View all activity

Organizations

yashmarathe's activity

upvoted a collection 2 days ago

Skywork-OR1

Skywork Open Reasoner 1 • 8 items • Updated 4 days ago • 20

upvoted a collection 3 days ago

Kimina Prover Preview

State-of-the-Art Models for Formal Mathematical Reasoning • 4 items • Updated 3 days ago • 26

upvoted a collection 8 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 5 days ago • 59

upvoted a collection 30 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 3 days ago • 282

upvoted a collection about 1 month ago

L1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 2 items • Updated Mar 7 • 5

upvoted a collection 2 months ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Feb 20 • 51

upvoted an article 2 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 143

upvoted 2 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted a collection 4 months ago

Toy Models to Study

9 items • Updated Mar 17, 2024 • 2

upvoted 2 collections 5 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Mar 13 • 78

Models for dataset curation

9 items • Updated Dec 5, 2024 • 17

upvoted 2 papers 5 months ago

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 32

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 43

upvoted a paper 6 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 55

upvoted a collection 7 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted an article 9 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

• 66

upvoted a paper about 1 year ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109