Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward1 Reinforcement Learning • Updated Jul 15, 2023 • 1
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward2 Reinforcement Learning • Updated Jul 15, 2023 • 2
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_v2 Reinforcement Learning • Updated Jul 15, 2023 • 34
Evan-Lin/Bart-RL-many-keywordmax-entailment-attractive-reward5 Reinforcement Learning • Updated Jul 16, 2023 • 1
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_v3 Reinforcement Learning • Updated Jul 16, 2023 • 6
amirabdullah19852020/pythia_70m_ppo_imdb_sentiment_with_checkpoints Reinforcement Learning • Updated Jul 16, 2023 • 4
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer Reinforcement Learning • Updated Jul 22, 2023 • 2
Evan-Lin/Bart-RL-many-keywordmax1-entailment1-attractive1-reward1-epoch2 Reinforcement Learning • Updated Jul 20, 2023 • 1
Evan-Lin/Bart-RL-many-keywordmax1-attractive1-reward1-epoch2 Reinforcement Learning • Updated Jul 21, 2023 • 1
Evan-Lin/Bart-Amazon-many-keywordmax1-attractive1-reward1-epoch0 Reinforcement Learning • Updated Jul 23, 2023 • 1
Evan-Lin/Bart-Amazon-many-keywordmax1-attractive1-reward1-epoch1 Reinforcement Learning • Updated Jul 23, 2023 • 1
Evan-Lin/Bart-RL-rouge-attractive1-lr5e-06-factor0.1 Reinforcement Learning • Updated Jul 25, 2023 • 1
Evan-Lin/Bart-RL-rougebatch-attractive1-lr5e-06-factor0.1 Reinforcement Learning • Updated Jul 26, 2023 • 2
Evan-Lin/Bart-Amazon-rougelastbatch-attractive2-keywordmax1 Reinforcement Learning • Updated Jul 27, 2023 • 1
Evan-Lin/Bart-Yelp-rougelastbatch-attractive1-keywordmax1-decoding Reinforcement Learning • Updated Jul 28, 2023 • 1
Evan-Lin/Bart-Yelp-rougelastbatch2-attractive1-keywordmax1-len0 Reinforcement Learning • Updated Jul 28, 2023 • 1
Evan-Lin/Bart-Yelp-rougelastbatch-attractive1-keywordmax1-decoding-test Reinforcement Learning • Updated Jul 28, 2023 • 1
Evan-Lin/Bart-Yelp-rougelastbatch-enc0.5-rep0.5-len0 Reinforcement Learning • Updated Jul 29, 2023 • 2
Evan-Lin/Bart-Amazon-rougelastbatch1-attractive2-keywordmax1 Reinforcement Learning • Updated Aug 1, 2023 • 2
Evan-Lin/Bart-cnn-Yelp-abs-attractive1-keywordmax1epoch0 Reinforcement Learning • Updated Aug 2, 2023 • 1