Youmi Ma

maym15

AI & ML interests

None yet

Recent Activity

published a model about 1 month ago

maym15/Olmo-3-7B-Think-RetMask

published a model about 1 month ago

maym15/Olmo-3-7B-Instruct-RetMask

published a model about 1 month ago

maym15/Qwen3-8B-RetMask

View all activity

Organizations

published 4 models about 1 month ago

updated a model about 1 month ago

maym15/Olmo-3-7B-Think-RetMask

Text Generation • 7B • Updated Apr 21 • 3

updated a collection about 1 month ago

RetMask

Collection

Trained checkpoints for the paper "From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models" • 4 items • Updated Apr 21

updated 3 models about 1 month ago

maym15/Olmo-3-7B-Instruct-RetMask

Text Generation • 7B • Updated Apr 21 • 4

maym15/Llama-3.1-8B-Instruct-RetMask

Text Generation • 8B • Updated Apr 20 • 3

maym15/Qwen3-8B-RetMask

Text Generation • 8B • Updated Apr 20 • 6

published a model 11 months ago

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5

Text Generation • 8B • Updated Jun 25, 2025 • 2.09k • • 19

updated 2 models 11 months ago

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5

Text Generation • 8B • Updated Jun 25, 2025 • 2.09k • • 19

tokyotech-llm/Llama-3.1-Swallow-8B-v0.5

8B • Updated Jul 1, 2025 • 396 • 9

updated a Space 11 months ago

README

🌍

updated 4 models about 1 year ago

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Text Generation • 71B • Updated Jul 1, 2025 • 170 • • 13

tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3

Text Generation • 71B • Updated Apr 2, 2025 • 845 • • 13

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3

Text Generation • 8B • Updated Apr 2, 2025 • 7.78k • • 24

tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2

Text Generation • 8B • Updated Apr 2, 2025 • 36 • • 16

Youmi Ma

AI & ML interests

Recent Activity

Organizations

maym15's activity

README