Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
11
8
Jie Liu
jieliu
Follow
sefira32's profile picture
dododododo's profile picture
MingleiShi's profile picture
16 followers
·
17 following
yifan123
AI & ML interests
Reinforcement Learning, Large Language Model
Recent Activity
upvoted
a
paper
16 days ago
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers
upvoted
a
paper
24 days ago
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
upvoted
a
paper
about 2 months ago
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
View all activity
Organizations
Papers
6
arxiv:
2501.13918
arxiv:
2407.16154
arxiv:
2406.11817
arxiv:
2402.12343
Expand 6 papers
models
7
Sort: Recently updated
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-math-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24-seq2048
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5
Updated
Sep 3, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-beta0.5
Updated
Jul 30, 2024
jieliu/Storm-7B
Text Generation
•
Updated
Jun 18, 2024
•
20
•
41
datasets
1
jieliu/homepage
Viewer
•
Updated
Feb 10
•
4
•
397