amirabdullah19852020/gpt-neo-125m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 13
amirabdullah19852020/pythia-70m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 13
amirabdullah19852020/pythia-160m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 13
amirabdullah19852020/gpt-neo-125m_sentiment_reward Reinforcement Learning • Updated Feb 10, 2024 • 14
amirabdullah19852020/pythia-160m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 13
amirabdullah19852020/pythia-70m_utility_reward Reinforcement Learning • Updated Feb 10, 2024 • 18