Boris Shaposhnikov's picture

4

Boris Shaposhnikov

borisshapa

·

borisshapa

AI & ML interests

NLP

Recent Activity

authored a paper 1 day ago

The Differences Between Direct Alignment Algorithms are a Blur

upvoted a paper 1 day ago

The Differences Between Direct Alignment Algorithms are a Blur

updated a model 2 months ago

borisshapa/rm-opt-350m-hs2

View all activity

Organizations

None yet

borisshapa's activity

upvoted a paper 1 day ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 2 days ago • 100

upvoted 2 papers 8 months ago

BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM

Paper • 2406.12168 • Published Jun 18, 2024 • 7

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 87

upvoted a paper 10 months ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 83