Mohammed Mohammed Ali's picture

10 2

Mohammed Mohammed Ali

MohammedEltoum

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

upvoted a paper 2 months ago

Natural Language Reinforcement Learning

upvoted a paper 4 months ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

View all activity

Organizations

MohammedEltoum's activity

upvoted a paper about 18 hours ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published 5 days ago • 15

upvoted a paper 2 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

upvoted a paper 4 months ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4, 2024 • 36

upvoted a collection 4 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 26 days ago • 294

upvoted a paper 4 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

upvoted a collection 4 months ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated 20 days ago • 69

upvoted a paper 4 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 106

upvoted 3 papers 5 months ago

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Paper • 2409.08513 • Published Sep 13, 2024 • 12

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 12

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 29