wongyukim's picture

270 66

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

upvoted a paper 2 days ago

o3-mini vs DeepSeek-R1: Which One is Safer?

upvoted a paper 2 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

View all activity

Organizations

None yet

wongyukim's activity

upvoted a paper about 8 hours ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Paper • 2412.16855 • Published Dec 22, 2024 • 1

upvoted 3 papers 2 days ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 4 days ago • 18

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 4 days ago • 36

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 4 days ago • 71

upvoted 2 papers 3 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 5 days ago • 42

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 7 days ago • 29

upvoted 2 papers 4 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 6 days ago • 28

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 6 days ago • 85

upvoted 3 papers 5 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 7 days ago • 23

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 8 days ago • 46

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 8 days ago • 48

upvoted 4 papers 6 days ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 10 days ago • 18

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published 14 days ago • 24

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 10 days ago • 41

Humanity's Last Exam

Paper • 2501.14249 • Published 10 days ago • 50

upvoted a paper 8 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 11 days ago • 33

upvoted a paper 9 days ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published 11 days ago • 22

upvoted 2 papers 10 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 12 days ago • 23

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 12 days ago • 78

upvoted a paper 11 days ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published 13 days ago • 39