Shayekh Islam's picture

Shayekh Islam

shayekh

·

https://shayekhbinislam.github.io/

AI & ML interests

Natural Language Processing, Reinforcement Learning

Recent Activity

upvoted a collection 9 days ago

upvoted a collection 9 days ago

C4AI Aya Vision

updated a collection 9 days ago

Global Exams: Bangladesh (Localized MMLU) [ICLR'25]

View all activity

Organizations

shayekh's activity

upvoted 2 collections 9 days ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 3 days ago • 96

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 12 days ago • 64

upvoted a collection 13 days ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 8 items • Updated 14 days ago • 25

upvoted a collection about 1 month ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 5 days ago • 105

upvoted 2 collections 2 months ago

Global Exams: Bangladesh (Localized MMLU) [ICLR'25]

Exams dataset in Bangladesh (Bengali, English) • 4 items • Updated 9 days ago • 1

Retrieval-Augmented Generation [EMNLP'24]

Artifacts for "Open-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models" [EMNLP 2024 Findings] • 5 items • Updated 24 days ago • 2

upvoted 3 papers 3 months ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 27

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 48

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 12

upvoted a collection 5 months ago

Multilingual RewardBench (M-RewardBench)

Multilingual Reward Model Evaluation Dataset and Results • 3 items • Updated 11 days ago • 4

upvoted 4 papers 5 months ago

How to Evaluate Reward Models for RLHF

Paper • 2410.14872 • Published Oct 18, 2024 • 1

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 1

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12