Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Setpember 's Collections
DPO Alpaca
DPO HH
PPO Jon
DPO Jon
MIMIC

DPO HH

updated Dec 17, 2024
Upvote
-

  • Setpember/pythia-1b-helpful-sft

    Updated Nov 26, 2024

  • Setpember/HH_pythia_DPO_props_epi_point1

    Text Generation • 1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_pythia_DPO_props_epi_point5

    Text Generation • 1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_pythia_DPO_props_epi_1

    Text Generation • 1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_pythia_DPO_props_epi_2

    Text Generation • 1B • Updated Nov 26, 2024 • 2

  • Setpember/gpt2-hh

    Updated Nov 26, 2024

  • Setpember/HH_GPT2_DPO_props_epi_point1

    Text Generation • 0.1B • Updated Nov 27, 2024 • 1

  • Setpember/HH_GPT2_DPO_props_epi_point5

    Text Generation • 0.1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_GPT2_DPO_props_epi_1

    Text Generation • 0.1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_GPT2_DPO_props_epi_2

    Text Generation • 0.1B • Updated Nov 26, 2024 • 2

  • Setpember/HH_GPT2M_DPSGD_epi_point1

    Updated Nov 26, 2024

  • Setpember/HH_GPT2M_DPSGD_epi_point5

    Updated Nov 26, 2024

  • Setpember/HH_GPT2M_DPSGD_epi_1

    Updated Nov 26, 2024

  • Setpember/HH_GPT2M_DPSGD_epi_2

    Updated Nov 26, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs