arxiv:2412.04626
Torsten Scholak
tscholak
AI & ML interests
NLP, semantic parsing, program synthesis, deep learning for code
Recent Activity
authored
a paper
7 days ago
BigDocs: An Open and Permissively-Licensed Dataset for Training
Multimodal Models on Document and Code Tasks
Organizations
Papers
3
models
8
tscholak/2jrayxos
Text2Text Generation
•
Updated
•
60
•
2
tscholak/2e826ioa
Text2Text Generation
•
Updated
•
9
•
7
tscholak/1wnr382e
Text2Text Generation
•
Updated
•
12
•
3
tscholak/1zha5ono
Text2Text Generation
•
Updated
•
22
•
4
tscholak/cxmefzzi
Text2Text Generation
•
Updated
•
195
•
30
tscholak/3vnuv1vf
Text2Text Generation
•
Updated
•
225
•
10
tscholak/t5.1.1.lm100k.base
Text2Text Generation
•
Updated
•
49
tscholak/t5.1.1.lm100k.large
Text2Text Generation
•
Updated
•
20
•
1
datasets
None public yet