5 339 59

Minbyul Jeong

Minbyul

https://minstar.github.io/

AI & ML interests

Biomedical Natural Language Processing, Graph Network

Recent Activity

liked a dataset 2 days ago

dmis-lab/RF-Collection

liked a dataset 16 days ago

agentica-org/DeepScaleR-Preview-Dataset

upvoted a paper 22 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

View all activity

Organizations

Minbyul's activity

upvoted 2 papers 22 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 27 days ago • 27

CoRe^2: Collect, Reflect and Refine to Generate Better and Faster

Paper • 2503.09662 • Published 28 days ago • 33

upvoted a paper 28 days ago

CompAct: Compressing Retrieved Documents Actively for Question Answering

Paper • 2407.09014 • Published Jul 12, 2024 • 1

upvoted a paper about 1 month ago

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 20

upvoted 3 papers about 2 months ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20 • 26

Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models

Paper • 2401.15269 • Published Jan 27, 2024 • 1

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published Feb 17 • 15

upvoted 2 papers 4 months ago

Monet: Mixture of Monosemantic Experts for Transformers

Paper • 2412.04139 • Published Dec 5, 2024 • 13

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 52

upvoted a collection 4 months ago

Inference-Time Intervention (ITI) Models

Collection

A collection of Llama models with Inference-Time Intervention (Li et al.) applied to them. Codebase: https://github.com/likenneth/honest_llama • 6 items • Updated Jan 27 • 3

upvoted 5 papers 5 months ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 36

upvoted a collection 5 months ago

Solar Pro

Collection

The most intelligent LLM on a single GPU • 4 items • Updated Nov 15, 2024 • 14

upvoted 4 papers 5 months ago

Sample-Efficient Alignment for LLMs

Paper • 2411.01493 • Published Nov 3, 2024 • 12

Survey of Cultural Awareness in Language Models: Text and Beyond

Paper • 2411.00860 • Published Oct 30, 2024 • 24

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published Oct 24, 2024 • 44

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 56