Arash Ahmadian

ArashAhmadian

aahmadian_

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

Command A: An Enterprise-Ready Large Language Model

published an article about 2 months ago

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

authored a paper 4 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

View all activity

Organizations

ArashAhmadian's activity

authored a paper 21 days ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published 22 days ago • 25

published an article about 2 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

and 3 others •

Mar 4

• 73

authored a paper 4 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 5

published an article 6 months ago

Article

A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality

and 3 others •

Oct 24, 2024

• 60

authored 2 papers 10 months ago

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Paper • 2407.02552 • Published Jul 2, 2024 • 4

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

Paper • 2309.05444 • Published Sep 11, 2023 • 1

published an article 11 months ago

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 87

updated 3 models 11 months ago

upvoted a paper 11 months ago

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 20

updated 4 models 11 months ago

ArashAhmadian/ppo_6.9b_new

Text Generation • Updated Jun 7, 2024

ArashAhmadian/rloo_6.9b_new

Text Generation • Updated Jun 7, 2024

ArashAhmadian/rloo_7b_f

Feature Extraction • Updated Jun 6, 2024 • 5

ArashAhmadian/ppo_rloo_bp_7b

Feature Extraction • Updated Jun 6, 2024 • 3

authored a paper 11 months ago

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 20

updated 4 models 11 months ago

ArashAhmadian/rloo_tldr_6.9b_defaultclip_512bs_05kl

Text Generation • Updated Jun 4, 2024

ArashAhmadian/rloo_tldr_6.9b_noratioclip

Text Generation • Updated Jun 1, 2024 • 11

ArashAhmadian/rloo_tldr_6.9b_ds2

Text Generation • Updated May 30, 2024 • 1

ArashAhmadian/rloo_tldr_6.9b

Text Generation • Updated May 27, 2024