YASH AKHAURI's picture

9 1

YASH AKHAURI

akhauriyash

AI & ML interests

None yet

Recent Activity

authored a paper about 14 hours ago

ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models

authored a paper about 14 hours ago

Attamba: Attending To Multi-Token States

authored a paper about 14 hours ago

TokenButler: Token Importance is Predictable

View all activity

Organizations

None yet

akhauriyash's activity

authored 3 papers about 14 hours ago

ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models

Paper • 2406.16635 • Published Jun 24, 2024

Attamba: Attending To Multi-Token States

Paper • 2411.17685 • Published Nov 26, 2024

TokenButler: Token Importance is Predictable

Paper • 2503.07518 • Published 2 days ago

New activity in akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler 1 day ago

Remove file information

#2 opened 1 day ago by

updated a collection 1 day ago

TokenButler

TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity! • 6 items • Updated 1 day ago • 2

New activity in akhauriyash/Llama-3.2-3B-Butler 1 day ago

Remove file information section

#3 opened 1 day ago by

New activity in akhauriyash/Llama-3.2-1B-Butler 1 day ago

Add text-generation pipeline tag

#2 opened 1 day ago by

Adding `safetensors` variant of this model

#1 opened 2 days ago by

New activity in akhauriyash/Llama-3.2-3B-Butler 1 day ago

Add pipeline tag, correct base model name, and link Wandb logs

#2 opened 1 day ago by

Adding `safetensors` variant of this model

#1 opened 2 days ago by

New activity in akhauriyash/Llama-3.1-8B-Butler 1 day ago

Add pipeline tag and link to Github repository

#1 opened 1 day ago by

New activity in akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler 1 day ago

Add pipeline tag and link to project page

#1 opened 1 day ago by

New activity in akhauriyash/Llama-2-7b-hf-Butler 1 day ago

Add pipeline tag: text-generation

#1 opened 1 day ago by

updated 5 models 2 days ago

akhauriyash/Llama-3.2-1B-Butler

Text Generation • Updated 1 day ago • 44

akhauriyash/Llama-3.2-3B-Butler

Text Generation • Updated 1 day ago • 10

akhauriyash/Llama-2-7b-hf-Butler

Text Generation • Updated 1 day ago • 19

akhauriyash/Llama-3.1-8B-Butler

Text Generation • Updated 1 day ago • 10

akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler

Text Generation • Updated 1 day ago • 16

upvoted a collection 2 days ago

TokenButler

TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity! • 6 items • Updated 1 day ago • 2

updated a collection 2 days ago

TokenButler

TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity! • 6 items • Updated 1 day ago • 2