1 190 158

Mohammed Brıman

mohammedbriman

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision

Recent Activity

updated a collection 3 days ago

To read... eventually

upvoted a paper 3 days ago

Command A: An Enterprise-Ready Large Language Model

upvoted an article 10 days ago

Training and Finetuning Reranker Models with Sentence Transformers v4

View all activity

Organizations

None yet

mohammedbriman's activity

upvoted a paper 3 days ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published 4 days ago • 20

upvoted an article 10 days ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

11 days ago

• 96

upvoted 2 papers 12 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 16 days ago • 46

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published 16 days ago • 82

upvoted an article 16 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

25 days ago

• 371

upvoted 3 papers 20 days ago

upvoted 2 papers 2 months ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 28

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 373

upvoted 9 papers 3 months ago

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 23

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 48

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 22

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 54

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 273

Monolith: Real Time Recommendation System With Collisionless Embedding Table

Paper • 2209.07663 • Published Sep 16, 2022 • 1

Human-Timescale Adaptation in an Open-Ended Task Space

Paper • 2301.07608 • Published Jan 18, 2023 • 1

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 73