placebomancer's picture

2 2 1

placebomancer

placebomancer

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

upvoted a paper 9 days ago

Concise Reasoning via Reinforcement Learning

new activity 9 months ago

TheDrummer/Tiger-Gemma-9B-v1:Differences between Tiger Gemma, Smegmma and Broken Gemma

View all activity

Organizations

None yet

placebomancer's activity

upvoted a paper 8 days ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Paper • 2405.19107 • Published May 29, 2024 • 15

upvoted a paper 9 days ago

Concise Reasoning via Reinforcement Learning

Paper • 2504.05185 • Published 15 days ago • 2

New activity in TheDrummer/Tiger-Gemma-9B-v1 9 months ago

Differences between Tiger Gemma, Smegmma and Broken Gemma

#1 opened 9 months ago by

liked a Space 10 months ago

Gemma 2 llama.cpp 2B/9B/27B

Chat with Gemma 2 for text-based conversations

New activity in open-llm-leaderboard/open_llm_leaderboard 10 months ago

WizardLM-8x22B Evaluation failed

#823 opened 10 months ago by