oceanpty
's Collections
oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
40
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
50
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
47
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
47
oceanpty/TOA-Ultrafeedback-SFT-Ensemble-model-num-4
Viewer
•
Updated
•
59.9k
•
34
oceanpty/TOA-Ultrafeedback-SFT-SeqRefine-model-num-4
Viewer
•
Updated
•
59.9k
•
43
oceanpty/TOA-Ultrafeedback-SFT-MoA-model-num-4
Viewer
•
Updated
•
59.4k
•
38
oceanpty/TOA-Ultrafeedback-SFT-TOA-model-num-4
Viewer
•
Updated
•
59.8k
•
44
oceanpty/TOA-Ultrafeedback-DPO-TOA-model-num-4
Viewer
•
Updated
•
57.1k
•
37
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-Rand-lla31-8b-inst
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-PRS-lla31-8b-inst
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-ensemble
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-SeqRefine
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-MoA
Updated
•
12
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-TOA
oceanpty/TOA-ultrafeedback-lla3-8b-inst-dpo-data-small-scale-mcts-n-40-pi-0-ni-30