Stanisław Szymczyk
sszymczyk
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
perplexity-ai/r1-1776:This model performs worse in complex problems compared to the DeepSeek R1
new activity
about 1 month ago
perplexity-ai/r1-1776:This model performs worse in complex problems compared to the DeepSeek R1
new activity
3 months ago
MiniMaxAI/MiniMax-Text-01:Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp
Organizations
None yet
sszymczyk's activity
This model performs worse in complex problems compared to the DeepSeek R1
17
#254 opened about 2 months ago
by
sszymczyk
Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp
19
4
#1 opened 3 months ago
by
Doctor-Chad-PhD

Missing tokenizer.json and tokenizer_config.json files
1
#2 opened 3 months ago
by
sszymczyk
Please add the "tokenizer.model" file
2
#3 opened 3 months ago
by
ken133
Hardware Requirements
6
#1 opened 5 months ago
by
Lightchain
Can you provide code for inference with MCTS?
6
6
#3 opened 5 months ago
by
sszymczyk
Reason behind not using special tokens in the prompt format?
2
#2 opened 5 months ago
by
Doctor-Shotgun

The curse of the Consolidated Safetensors strikes again...
2
#4 opened 5 months ago
by
jukofyork

The model often enters infinite generation loops
5
13
#32 opened 9 months ago
by
sszymczyk
Calculation of _mscale during YARN RoPE scaling
1
#4 opened 11 months ago
by
sszymczyk
Wrong BOS and EOS tokens in tokenizer.model file
1
1
#12 opened 12 months ago
by
sszymczyk
Problem with repeated generation of newline characters
2
#3 opened 12 months ago
by
sszymczyk
Possible error in tokenizer.json
4
6
#6 opened about 1 year ago
by
sszymczyk
Model is paraphrasing text instead of citing it verbatim
3
#7 opened about 1 year ago
by
sszymczyk