Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
760.8
TFLOPS
29
7
7
Stephen Fernandes
StephennFernandes
Follow
thak123's profile picture
21world's profile picture
chad-brouze's profile picture
4 followers
·
8 following
StephennFernandes
AI & ML interests
Natural Language Processing , Reinforcement Learning
Recent Activity
new
activity
10 days ago
microsoft/Phi-4-multimodal-instruct:
Experience with Phi-4-Multimodal vs. Whisper-1 for Speech-to-Text
reacted
to
wassemgtk
's
post
with 👀
11 days ago
I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help? Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb
upvoted
a
collection
12 days ago
Llama 4
View all activity
Organizations
StephennFernandes
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
14 days ago
glaiveai/reasoning-v1-20m
Viewer
•
Updated
29 days ago
•
22.2M
•
14k
•
192
liked
a model
3 months ago
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
about 8 hours ago
•
5.91k
•
570
liked
a model
4 months ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
21 days ago
•
672k
•
•
3.81k
liked
a dataset
8 months ago
argilla/magpie-ultra-v0.1
Viewer
•
Updated
Nov 26, 2024
•
50k
•
343
•
222
liked
a Space
about 1 year ago
Running
on
Zero
137
137
Beam Search Visualizer
✍
View how beam search decoding works, in detail!
liked
a dataset
about 1 year ago
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
14.8k
•
1.32k
liked
a Space
over 1 year ago
Runtime error
6
6
Bloom Chat
⚡