Ruben Roy's picture

Ruben Roy

rubenroy

·

https://www.ruben-roy.com

AI & ML interests

OCR, CV, Text-to-Image, Text-to-Video, Text-to-3D, NLP, Text Generation, AGI

Recent Activity

liked a model about 2 months ago

GSAI-ML/ReFusion

liked a Space about 2 months ago

MCP-1st-Birthday/MCP-birthday-hackathon-certificate-generator

liked a model 7 months ago

rubenroy/GPT2-GCv2-100k

View all activity

Organizations

upvoted an article 7 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

777

upvoted 2 papers 8 months ago

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published Jun 20, 2025 • 57

Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published Jun 5, 2025 • 55

upvoted a paper 9 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

upvoted a paper 10 months ago

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published Apr 16, 2025 • 35

upvoted 4 collections 12 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 182

Deepseek Papers

Deepseek papers collection • 29 items • Updated 3 days ago • 319

favs

my favorite models • 9 items • Updated Feb 2, 2025 • 4

DeepSeek-R1-abliterated

9 items • Updated May 30, 2025 • 124

upvoted a paper 12 months ago

Weak-to-Strong Diffusion with Reflection

Paper • 2502.00473 • Published Feb 1, 2025 • 24

upvoted a collection 12 months ago

DistilBERT release

Original DistilBERT model, checkpoints obtained from using teacher-student learning from the original BERT checkpoints. • 6 items • Updated Apr 17, 2024 • 36

upvoted 2 papers 12 months ago

DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis

Paper • 2309.12792 • Published Sep 22, 2023 • 1

MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 48

upvoted 7 collections about 1 year ago

Zurich 1.5B (GGUF)

Quantized versions of Zurich 1.5B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 14B Instruct • 12 items • Updated Feb 15, 2025 • 3

Geneva 12B (GGUF)

Quantized versions of Geneva 12B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Mistral NeMo Instruct 2407 • 12 items • Updated Feb 15, 2025 • 3

Zurich 14B (GGUF)

Quantized versions of Zurich 14B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 14B Instruct • 12 items • Updated Feb 15, 2025 • 3

Zurich 7B (GGUF)

Quantized versions of Zurich 7B Model Collection, compatible with llama.cpp. Quantized by mradermacher - Fine-tuned from Qwen 2.5 7B Instruct • 12 items • Updated Feb 15, 2025 • 4

Zurich 1.5B

The Zurich 1.5B Model Collection - Fine-tuned from Qwen 2.5 1.5B Instruct with GammaCorpus v2. • 6 items • Updated Feb 4, 2025 • 3

Gilgamesh

The Gilgamesh Model Collection, by Ruben Roy • 1 item • Updated Feb 4, 2025 • 3

GammaCorpus (CoT)

The GammaCorpus Dataset Collection for CoT (Chain of Thought) • 1 item • Updated Feb 4, 2025 • 9