Mohammed Machrouh's picture

Mohammed Machrouh

medmac01

AI & ML interests

NLP, Cyber Security

Recent Activity

Organizations

College of Computing at UM6P, Ben Guerir, Morocco's profile picture Arabic Machine Learning 's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Blog-explorers's profile picture ASAS AI's profile picture Nt3awnou's profile picture Mixed Arabic Datasets's profile picture AIniacs's profile picture ZeroGPU Explorers's profile picture 2A2I Legacy Models & Datasets's profile picture 2A2I's profile picture MLX Community's profile picture Moroccan Data Scientists's profile picture smart-فلاح's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture Hugging Face Discord Community's profile picture Data Is Better Together Contributor's profile picture

medmac01's activity

reacted to alvarobartt's post with 🔥 about 1 hour ago
view post
Post
1134
🔥 Agents can do anything! @microsoft Research just announced the release of Magma 8B!

Magma is a new Visual Language Model (VLM) with 8B parameters for multi-modal agents designed to handle complex interactions across virtual and real environments; and it's MIT licensed!

Magma comes with exciting new features such as:
- Introduces the Set-of-Mark and Trace-of-Mark techniques for fine-tuning
- Leverages a large amount of unlabeled video data to learn the spatial-temporal grounding and planning
- A strong generalization and ability to be fine-tuned for other agentic tasks
- SOTA in different multi-modal benchmarks spanning across UI navigation, robotics manipulation, image / video understanding and spatial understanding and reasoning
- Generates goal-driven visual plans and actions for agentic use cases

Model: microsoft/Magma-8B
Technical Report: Magma: A Foundation Model for Multimodal AI Agents (2502.13130)
upvoted an article about 3 hours ago
view article
Article

FastRTC: The Real-Time Communication Library for Python

70
upvoted an article 23 days ago