Kamesh R PRO

Kameshr

https://kameshkanna.github.io/

AI & ML interests

None yet

Recent Activity

updated a model 28 days ago

Kameshr/reasoning-small-1B

published a model 28 days ago

Kameshr/reasoning-small-1B

liked a dataset 28 days ago

Kameshr/tamil-sangam-text-excerpt

View all activity

Organizations

Kameshr's activity

updated a model 28 days ago

Kameshr/reasoning-small-1B

Text Generation • Updated 28 days ago • 17

published a model 28 days ago

Kameshr/reasoning-small-1B

Text Generation • Updated 28 days ago • 17

liked a dataset 28 days ago

Kameshr/tamil-sangam-text-excerpt

Viewer • Updated Aug 21, 2024 • 1.05k • 37 • 2

updated a model about 1 month ago

Kameshr/flux-pipeline

Text-to-Image • Updated about 1 month ago • 37

updated 2 datasets about 1 month ago

Kameshr/Compiled-COT

Viewer • Updated Mar 15 • 2.67M • 58 • 1

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 64

published a dataset about 1 month ago

Stanford/Compiled_COT

Viewer • Updated Mar 15 • 2.23M • 64

reacted to KaiChen1998's post with 🔥 about 1 month ago

Post

4831

📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo

published a dataset 3 months ago

Kameshr/Compiled-COT

Viewer • Updated Mar 15 • 2.67M • 58 • 1

liked a dataset 3 months ago

ToheartZhang/JiuZhang3.0-Corpus-PT-CoT

Viewer • Updated May 24, 2024 • 4.38M • 113 • 6