Deqing Fu's picture

Deqing Fu PRO

deqing

·

https://deqingfu.github.io

AI & ML interests

None yet

Recent Activity

updated a model about 16 hours ago

deqing/fone-llama-3.2-1B-dclm-100BT-fone3d-hybrid-tile-v1

published a model 12 days ago

deqing/fone-llama-3.2-1B-dclm-100BT-fone3d-hybrid-tile-v1

updated a model 13 days ago

deqing/vanilla-llama-3.2-1B-dclm-100BT-v1

View all activity

Organizations

updated a model about 16 hours ago

deqing/fone-llama-3.2-1B-dclm-100BT-fone3d-hybrid-tile-v1

1B • Updated about 1 hour ago • 970

published a model 12 days ago

deqing/fone-llama-3.2-1B-dclm-100BT-fone3d-hybrid-tile-v1

1B • Updated about 1 hour ago • 970

updated a model 13 days ago

deqing/vanilla-llama-3.2-1B-dclm-100BT-v1

1B • Updated 12 days ago • 2.08k • 1

authored a paper 14 days ago

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Paper • 2604.20817 • Published 22 days ago • 7

upvoted a collection 15 days ago

Convergent Evolution

4 items • Updated 21 days ago • 1

upvoted a paper 15 days ago

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Paper • 2406.03445 • Published Jun 5, 2024 • 1

liked a model 17 days ago

deqing/vanilla-llama-3.2-1B-dclm-100BT-v1

1B • Updated 12 days ago • 2.08k • 1

updated a model 18 days ago

deqing/vanilla-llama-3.2-1B-fineweb-sample-100BT-v4

1B • Updated 18 days ago • 549 • 1

upvoted a paper 20 days ago

Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks

Paper • 2604.11610 • Published about 1 month ago • 7

published a model 21 days ago

deqing/vanilla-llama-3.2-1B-dclm-100BT-v1

1B • Updated 12 days ago • 2.08k • 1

upvoted a paper 21 days ago

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Paper • 2604.20817 • Published 22 days ago • 7

updated a collection 21 days ago

Convergent Evolution

4 items • Updated 21 days ago • 1

submitted a paper to Daily Papers 21 days ago

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Paper • 2604.20817 • Published 22 days ago • 7

updated a model 28 days ago

deqing/fone-llama-3.2-1B-fineweb-sample-100BT-fone3d-hybrid-tile-v4

Updated Apr 3 • 597 • 3

liked a model about 1 month ago

deqing/vanilla-llama-3.2-1B-fineweb-sample-100BT-v4

1B • Updated 18 days ago • 549 • 1

updated 2 collections about 1 month ago

Convergent Evolution (Architecture and Optimizer)

8 items • Updated Apr 10

Convergent Evolution (Data)

10 items • Updated Apr 10