The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 10 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 5 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 10 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 5 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
models
62
hkust-nlp/WebExplorer-8B
Image-Text-to-Text
•
8B
•
Updated
•
147
•
9
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning
•
8B
•
Updated
•
7
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/R1-Distill-Verifier-1.5B
2B
•
Updated
•
5
•
1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning
•
8B
•
Updated
•
10
•
1
hkust-nlp/Laser-DE-L4096-1.5B
2B
•
Updated
•
1.03k
hkust-nlp/Laser-DE-L2048-1.5B
2B
•
Updated
•
6
hkust-nlp/Laser-DE-L1024-1.5B
2B
•
Updated
•
5
hkust-nlp/Laser-D-L4096-1.5B
2B
•
Updated
•
15
datasets
28
hkust-nlp/WebExplorer-QA
Viewer
•
Updated
•
100
•
221
•
4
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
46
•
2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
114
•
53
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
•
Updated
•
6.12k
•
13
•
1
hkust-nlp/deepscaler_simplelr
Viewer
•
Updated
•
40.3k
•
36
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
•
Updated
•
40.8k
•
96
hkust-nlp/LeetCode-O
Preview
•
Updated
•
54
hkust-nlp/GUIMid
Viewer
•
Updated
•
1.85M
•
229
•
5
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
408
•
6
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
377
•
11