Built on a completely new Arlow Architecture and Pre-Trained on more than 1 trillion tokens.
Yuchen Xie
yuchenxie
AI & ML interests
NLP, Transformers
Recent Activity
new activity
10 days ago
google/gemma-3-27b-it:How much gpu memory does gemma-3-27b-it require? can not run with vllm
published
a model
14 days ago
yuchenxie/ArlowGPT-Base
updated
a model
14 days ago
yuchenxie/ArlowGPT-Base
Organizations
Collections
2
models
12

yuchenxie/ArlowGPT-Base
Text Generation
•
Updated
•
3

yuchenxie/arlowgpt-tokenizer-v2
Updated

yuchenxie/ArlowGPT-Tokenizer
Updated

yuchenxie/ArlowGPT-VL-OCR
Image-Text-to-Text
•
Updated
•
1

yuchenxie/ArlowGPT-VLM-Untrained
Updated
•
5
•
2

yuchenxie/ArlowGPT-8B
Text Generation
•
Updated
•
2.41k
•
3

yuchenxie/ArlowGPT-3B
Text Generation
•
Updated
•
7
•
1

yuchenxie/GPT-2V
Image-Text-to-Text
•
Updated
•
19
•
1

yuchenxie/ArlowGPT-VL-CLiP
Image-Text-to-Text
•
Updated

yuchenxie/CLiP
Zero-Shot Image Classification
•
Updated
•
7
•
1
datasets
None public yet