-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 14 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 96
daje kang
daje
AI & ML interests
None yet
Recent Activity
updated
a dataset
5 days ago
daje/synthetic-ko-sql-hard-add-llm-result
published
a dataset
5 days ago
daje/synthetic-ko-sql-hard-add-llm-result
updated
a dataset
6 days ago
daje/synthetic-ko-sql-hard
Organizations
None yet
Collections
1
models
39
daje/Meta-Llama-3.1-8B-Instruct-de-identification
Updated
•
1
daje/Qwen2.5-14B-Instruct-tools
Text Generation
•
Updated
•
1
daje/model_0.0002_alpha-32_r-64
Updated
•
8
daje/model_0.0002_alpha-8_r-16
Updated
•
8
daje/model_5e-05_alpha-128_r-256
Updated
•
7
daje/model_2e-4_alpha-8_r-16
Updated
•
4
daje/model_Lora
Updated
•
4
daje/model_2e-4
Updated
•
3
daje/model
Updated
•
4
daje/Qwen2-7B-Instruct-harmful_detector_2000-H100_1
Updated
•
4
datasets
16
daje/synthetic-ko-sql-hard-add-llm-result
Viewer
•
Updated
•
1.68k
•
41
daje/synthetic-ko-sql-hard
Viewer
•
Updated
•
1.68k
•
20
•
1
daje/kotext-to-sql-v1-hard
Viewer
•
Updated
•
2k
•
36
daje/de-identify-chat-ko
Viewer
•
Updated
•
9.92k
•
50
daje/ko-hatefulmemes_train_8500
Viewer
•
Updated
•
8.2k
•
38
daje/ko-hatefulmemes_train_8500_kmhas
Viewer
•
Updated
•
95.3k
•
61
daje/ko-hatefulmemes_train_2000
Viewer
•
Updated
•
1.91k
•
23
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
28
daje/keyword_summary
Viewer
•
Updated
•
1k
•
70
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
62
•
2