2 3 26

Szymon Tworkowski

syzymon

https://syzymon.github.io

AI & ML interests

Language models, theorem proving and much more!

Recent Activity

authored a paper 11 days ago

Magnushammer: A Transformer-based Approach to Premise Selection

authored a paper 11 days ago

Structured Packing in LLM Training Improves Long Context Utilization

authored a paper 11 days ago

Hierarchical Transformers Are More Efficient Language Models

View all activity

Organizations

None yet

syzymon's activity

authored 4 papers 11 days ago

upvoted a paper about 2 months ago

Structured Packing in LLM Training Improves Long Context Utilization

Paper • 2312.17296 • Published Dec 28, 2023 • 2

liked 6 datasets about 1 year ago

erfanzar/GPT4-8K

Viewer • Updated Sep 7, 2023 • 6.14k • 42 • 5

shibing624/sharegpt_gpt4

Viewer • Updated Feb 23 • 103k • 532 • 114

vicgalle/alpaca-gpt4

Viewer • Updated Feb 10 • 52k • 6.73k • 251

openchat/openchat_sharegpt4_dataset

Updated Jul 1, 2023 • 218 • 167

Open-Orca/OpenOrca

Viewer • Updated Oct 21, 2023 • 2.91M • 8.97k • 1.35k

meta-math/MetaMathQA

Viewer • Updated Dec 21, 2023 • 395k • 7.94k • 342

liked a model about 1 year ago

syzymon/long_llama_code_7b_instruct

Text Generation • Updated Oct 6, 2023 • 26 • 11

updated a model about 1 year ago

syzymon/long_llama_code_7b_instruct

Text Generation • Updated Oct 6, 2023 • 26 • 11

liked 2 models about 1 year ago

mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24 • 3.43M • • 3.51k

pminervini/mistral-7B-v0.1

Text Generation • Updated Sep 27, 2023 • 8 • 1

updated a model over 1 year ago

syzymon/long_llama_code_7b

Text Generation • Updated Sep 24, 2023 • 12 • 31

liked a model over 1 year ago

syzymon/long_llama_code_7b

Text Generation • Updated Sep 24, 2023 • 12 • 31

liked a dataset over 1 year ago

openai/gsm8k

Viewer • Updated Jan 4 • 17.6k • 169k • 459

liked a model over 1 year ago

TIGER-Lab/MAmmoTH-Coder-7B

Text Generation • Updated Dec 5, 2023 • 582 • 27

liked a dataset over 1 year ago

TIGER-Lab/MathInstruct

Viewer • Updated May 15 • 262k • 2.77k • 257