placebomancer
placebomancer
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
Offline Regularised Reinforcement Learning for Large Language Models
Alignment
upvoted
a
paper
6 days ago
Concise Reasoning via Reinforcement Learning
new activity
9 months ago
TheDrummer/Tiger-Gemma-9B-v1:Differences between Tiger Gemma, Smegmma and Broken Gemma
Organizations
None yet
models
None public yet
datasets
None public yet