-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 12 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 7 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 90
daje kang
daje
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 8 hours ago
daje/ko-hatefulmemes_train_8500
updated
a model
about 10 hours ago
daje/qwen2-7b-instruct-hamful-detector
updated
a dataset
about 12 hours ago
daje/ko-hatefulmemes_train_2000
Organizations
None yet
Collections
1
models
27
daje/qwen2-7b-instruct-hamful-detector
Image-Text-to-Text
•
Updated
daje/Qwen2.5-coder-7B-en-all-merged
Text Generation
•
Updated
•
8
daje/Qwen2.5-coder-7B-ko-all
Updated
daje/llama3-8B-ko-all
Updated
daje/Qwen2.5-coder-7B-en-all
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct-15000
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct-all
Updated
daje/ko-sql-Qwen-2.5-coder-7B-instruct
Updated
daje/Qwen2-VL-72B-instruct-ScienceQA
Updated
•
3
daje/Qwen2-VL-72B-instruct-ScienceQA-LoRA
Updated
datasets
11
daje/ko-hatefulmemes_train_8500
Updated
daje/ko-hatefulmemes_train_2000
Updated
daje/Ko-SciecneQA
Viewer
•
Updated
•
12.7k
•
54
daje/keyword_summary
Viewer
•
Updated
•
1k
•
102
daje/kotext-to-sql-v1
Viewer
•
Updated
•
262k
•
76
•
1
daje/mistral_tokenized_en_wiki
Viewer
•
Updated
•
16.1M
•
44
daje/mistral_tokenized_ko_wiki
Viewer
•
Updated
•
1.7M
•
28
daje/tokenized_enwiki
Viewer
•
Updated
•
16.4M
•
112
daje/tokenized_kowiki
Viewer
•
Updated
•
1.71M
•
29
daje/en_wiki
Viewer
•
Updated
•
5.09M
•
31