11 8

Jonathan Mamou

jmamou

jmamou

AI & ML interests

None yet

Recent Activity

published an article 12 days ago

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

upvoted an article 16 days ago

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

upvoted a paper about 2 months ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

View all activity

Organizations

jmamou's activity

published an article 12 days ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

and 8 others •

12 days ago

• 17

upvoted an article 16 days ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

and 8 others •

12 days ago

• 17

upvoted a paper about 2 months ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published Feb 13 • 16

updated a dataset 3 months ago

jmamou/augmented-glue-sst2

Viewer • Updated Jan 19 • 796k • 119

upvoted a paper 4 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10

published an article 5 months ago

Article

Universal Assisted Generation: Faster Decoding with Any Assistant Model

and 7 others •

Oct 29, 2024

• 55

upvoted an article 6 months ago

Article

Faster Assisted Generation with Dynamic Speculation

and 6 others •

Oct 8, 2024

• 46

New activity in huggingface/documentation-images 6 months ago

Create dynamic_speculation_lookahead/

#373 opened 6 months ago by

jmamou

Upload 3 files

#374 opened 6 months ago by

jmamou

published an article 6 months ago

Article

Faster Assisted Generation with Dynamic Speculation

and 6 others •

Oct 8, 2024

• 46

upvoted a paper 8 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 37

upvoted 2 papers 10 months ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 18

authored 5 papers 11 months ago

Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow

Paper • 1807.10104 • Published Jul 26, 2018 • 1

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

published an article about 1 year ago

Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

and 10 others •

Jan 30, 2024

• 9

updated a dataset over 1 year ago

jmamou/dolly_hhrlhf_with_dev

Viewer • Updated Jul 25, 2023 • 64.4k • 78