Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Patel
Kimish
Follow
shuyuej's profile picture
1 follower
·
4 following
AI & ML interests
None yet
Recent Activity
reacted
to
wassemgtk
's
post
with 😎
3 days ago
I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help? Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb
View all activity
Organizations
models
None public yet
datasets
None public yet