Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10605.0
TFLOPS
1
13
8
Adam Yanxiao Zhao
sdpkjc
Follow
fredericmenezes's profile picture
qgallouedec's profile picture
2 followers
·
9 following
https://sdpkjc.com
sdpkjc_adam
sdpkjc
yanxiao-zhao
AI & ML interests
Reinforcement Learning
Recent Activity
new
activity
21 days ago
Qwen/Qwen3-1.7B:
Fix chat template in case of multiple assistant messages and no thinking
updated
a model
about 1 month ago
sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-800
published
a model
about 1 month ago
sdpkjc/Qwen2.5-0.5B-SFT-24quiz-checkpoint-800
View all activity
Organizations
sdpkjc
's models
100
Sort: Recently updated
sdpkjc/Humanoid-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Humanoid-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Humanoid-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Swimmer-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Swimmer-v4-td3_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Swimmer-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-td3_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Swimmer-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Ant-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/HalfCheetah-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/HalfCheetah-v4-td3_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/HalfCheetah-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-td3_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/HalfCheetah-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Walker2d-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 19, 2023
sdpkjc/Hopper-v4-td3_continuous_action-seed5
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Hopper-v4-td3_continuous_action-seed4
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Walker2d-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Hopper-v4-td3_continuous_action-seed2
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Hopper-v4-td3_continuous_action-seed3
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Swimmer-v4-sac_continuous_action-seed1
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/Hopper-v4-sac_continuous_action-seed1
Reinforcement Learning
•
Updated
Dec 18, 2023
sdpkjc/DoubleDunk-v5-dqn_atari-seed1
Reinforcement Learning
•
Updated
Dec 16, 2023
sdpkjc/BattleZone-v5-dqn_atari-seed1
Reinforcement Learning
•
Updated
Dec 16, 2023
sdpkjc/NameThisGame-v5-dqn_atari-seed1
Reinforcement Learning
•
Updated
Dec 16, 2023
sdpkjc/Qbert-v5-dqn_atari-seed1
Reinforcement Learning
•
Updated
Dec 16, 2023
sdpkjc/Phoenix-v5-dqn_atari-seed1
Reinforcement Learning
•
Updated
Dec 16, 2023
Previous
1
2
3
4
Next