Jay Shin
jshin49
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
trillionlabs/Trillion-7B-preview
published
a model
2 days ago
trillionlabs/Trillion-7B-preview
liked
a model
6 months ago
meta-llama/Meta-Llama-3-8B
Organizations
Collections
7
-
Pre-training Small Base LMs with Fewer Tokens
Paper • 2404.08634 • Published • 35 -
Ziya2: Data-centric Learning is All LLMs Need
Paper • 2311.03301 • Published • 20 -
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 42 -
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Paper • 2404.06395 • Published • 22
models
None public yet
datasets
None public yet