Di Zhang

qq8933

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

Organizations

AI4Chem's profile picture SimpleBerry Research Lab's profile picture

qq8933's activity

New activity in AI4Chem/ChemBench4K 13 days ago
posted an update 25 days ago
replied to their post about 1 month ago
posted an update about 1 month ago
view post
Post
2594
LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.
·
posted an update about 1 month ago
New activity in qq8933/AIME_1983_2024 about 1 month ago

how about 2024 I

3
#2 opened 4 months ago by
hl0737