1 30 65

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

liked a model about 1 month ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

liked a dataset 6 months ago

HuggingFaceFW/finepdfs

upvoted a paper 7 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

View all activity

Organizations

liked a model about 1 month ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

Text Generation • 18B • Updated 12 days ago • 490k • 105

liked a dataset 6 months ago

HuggingFaceFW/finepdfs

Viewer • Updated Jan 9 • 476M • 28.4k • 818

upvoted a paper 7 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

liked a dataset 7 months ago

HuggingFaceTB/smoltalk

Viewer • Updated Feb 10, 2025 • 2.2M • 6.56k • 392

liked a model 7 months ago

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 42.6k • • 1.4k

liked a Space 8 months ago

GLM-4.1V-9B-Thinking-Demo

🐢

THUDM/GLM-4.1V-9B-Thinking Demo

liked 3 datasets 9 months ago

liked a dataset 10 months ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 17.7k • 227

liked 3 models 11 months ago

zai-org/GLM-4-32B-0414

Text Generation • Updated May 1, 2025 • 1.6k • • 484

zai-org/GLM-Z1-Rumination-32B-0414

Text Generation • 33B • Updated Apr 15, 2025 • 94 • 116

GSAI-ML/LLaDA-8B-Instruct

Text Generation • Updated Oct 21, 2025 • 376k • 350

upvoted a paper 11 months ago

Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding

Paper • 2504.06719 • Published Apr 9, 2025 • 8

liked a model 12 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 1.52M • • 1.91k

updated a model 12 months ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

Text Generation • 8B • Updated Mar 10, 2025 • 1

published a model 12 months ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

Text Generation • 8B • Updated Mar 10, 2025 • 1

upvoted 2 papers 12 months ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published Feb 27, 2025 • 27

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21

upvoted a paper about 1 year ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9, 2025 • 40

罗杰斯

AI & ML interests

Recent Activity

Organizations

rojasdiego's activity

GLM-4.1V-9B-Thinking-Demo