PaTaRM PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. AIJian/PaTaRM-8B Text Generation • 0.5B • Updated 16 days ago • 864 AIJian/PaTaRM-data Preview • Updated 17 days ago • 35 AIJian/PaTaRM-14B Text Generation • 0.5B • Updated 17 days ago • 1.35k AIJian/PaTaRM Updated 17 days ago
PaTaRM PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. AIJian/PaTaRM-8B Text Generation • 0.5B • Updated 16 days ago • 864 AIJian/PaTaRM-data Preview • Updated 17 days ago • 35 AIJian/PaTaRM-14B Text Generation • 0.5B • Updated 17 days ago • 1.35k AIJian/PaTaRM Updated 17 days ago