1 37 56

InHo Won

kotmul

AI & ML interests

None yet

Recent Activity

liked a model 11 days ago

robbyant/lingbot-world-base-cam

liked a dataset about 1 month ago

nvidia/NitroGen

liked a dataset about 1 month ago

nvidia/PhysicalAI-Autonomous-Vehicles

View all activity

Organizations

upvoted an article about 2 months ago

Article

Deriving the PPO Loss from First Principles

Dec 25, 2025

•

upvoted a paper 3 months ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 42

upvoted a collection 4 months ago

KORMo-10B

Collection

KORMo-10B models • 4 items • Updated Oct 13, 2025 • 19

upvoted a paper 4 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 86

upvoted 3 papers about 1 year ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 440

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

upvoted an article about 1 year ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

Sep 27, 2024

•

upvoted 6 papers almost 2 years ago

Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean

Paper • 2403.10882 • Published Mar 16, 2024 • 6

X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment

Paper • 2403.11399 • Published Mar 18, 2024 • 6

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Paper • 2401.06443 • Published Jan 12, 2024 • 2

upvoted a collection almost 2 years ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 895

upvoted 5 papers almost 2 years ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 24

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 72

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Paper • 2402.15506 • Published Feb 23, 2024 • 17

In deep reinforcement learning, a pruned network is a good network

Paper • 2402.12479 • Published Feb 19, 2024 • 19

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109