Datasets used to train the LION pipeline. Paper: https://arxiv.org/abs/2407.06542; Code: https://github.com/Columbia-NLP-Lab/LionAlignment

Columbia NLP
university
AI & ML interests
Natural language processing group at Columbia University
Recent Activity
View all activity
Organization Card
Columbia University - NLP
models
20

Columbia-NLP/LION-Gemma-2b-sft-v1.0
Text Generation
•
Updated
•
70

Columbia-NLP/LION-Gemma-2b-dpo-v1.0
Text Generation
•
Updated
•
15

Columbia-NLP/LION-Gemma-2b-odpo-v1.0
Text Generation
•
Updated
•
21
•
4

Columbia-NLP/LION-LLaMA-3-8b-sft-v1.0
Text Generation
•
Updated
•
19

Columbia-NLP/LION-LLaMA-3-8b-dpo-v1.0
Text Generation
•
Updated
•
28
•
2

Columbia-NLP/LION-LLaMA-3-8b-odpo-v1.0
Text Generation
•
Updated
•
52
•
2

Columbia-NLP/llama3-8b-instruct-rewriting-r-Decor
Text Generation
•
Updated
•
16

Columbia-NLP/llama3-8b-instruct-rewriting-nr-Decor
Text Generation
•
Updated
•
11

Columbia-NLP/llama2-7b-rewriting-r-Decor
Text Generation
•
Updated
•
14

Columbia-NLP/llama2-7b-rewriting-nr-Decor
Text Generation
•
Updated
•
12
datasets
18
Columbia-NLP/PUPA
Viewer
•
Updated
•
901
•
83
•
1
Columbia-NLP/DPO-hh-rlhf
Viewer
•
Updated
•
169k
•
82
Columbia-NLP/DPO-PKU-SafeRLHF
Viewer
•
Updated
•
136k
•
91
Columbia-NLP/DPO-HelpSteer
Viewer
•
Updated
•
9.17k
•
78
Columbia-NLP/DPO-tldr-summarisation-preferences
Viewer
•
Updated
•
177k
•
131
Columbia-NLP/DPO-py-dpo-v0.1
Viewer
•
Updated
•
9.47k
•
61
Columbia-NLP/DPO-UltraFeedback_binarized
Viewer
•
Updated
•
62.7k
•
55
Columbia-NLP/DPO-distilabel-intel-orca-dpo-pairs_cleaned
Viewer
•
Updated
•
12.8k
•
82
Columbia-NLP/DPO-distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
84
Columbia-NLP/DPO-Nectar
Viewer
•
Updated
•
183k
•
103