placebomancer
placebomancer
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
12 days ago
Offline Regularised Reinforcement Learning for Large Language Models
Alignment
upvoted
a
paper
12 days ago
Concise Reasoning via Reinforcement Learning
new activity
10 months ago
TheDrummer/Tiger-Gemma-9B-v1:Differences between Tiger Gemma, Smegmma and Broken Gemma
Organizations
None yet
placebomancer's activity
Differences between Tiger Gemma, Smegmma and Broken Gemma
22
#1 opened 10 months ago
by
isr431
WizardLM-8x22B Evaluation failed
28
25
#823 opened 10 months ago
by
llama-anon
