2 7

Arthur Douillard

ArthurDouillard

https://arthurdouillard.com/

AI & ML interests

Continual Learning, Computer Vision, Transformers

Recent Activity

commented on a paper 1 day ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

upvoted a paper 2 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

commented on a paper 2 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

View all activity

Organizations

None yet

ArthurDouillard's activity

commented a paper 1 day ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 2 days ago • 20 •

upvoted a paper 2 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 2 days ago • 20

commented a paper 2 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 2 days ago • 20 •

upvoted 2 papers 6 months ago

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 39

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 19

upvoted a paper 7 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

authored 4 papers 7 months ago

DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 15

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Paper • 2211.11747 • Published Nov 15, 2022

PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning

Paper • 2004.13513 • Published Apr 28, 2020

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

upvoted a paper 7 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

authored a paper 11 months ago

DiPaCo: Distributed Path Composition

Paper • 2403.10616 • Published Mar 15, 2024 • 13

upvoted a paper about 1 year ago

Asynchronous Local-SGD Training for Language Modeling

Paper • 2401.09135 • Published Jan 17, 2024 • 11

authored a paper about 1 year ago

Asynchronous Local-SGD Training for Language Modeling

Paper • 2401.09135 • Published Jan 17, 2024 • 11

upvoted a paper about 1 year ago

DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 15