tdh111
tdh111
AI & ML interests
None yet
Recent Activity
new activity
about 2 hours ago
microsoft/bitnet-b1.58-2B-4T-gguf:Chat template issue
updated
a model
about 3 hours ago
tdh111/bitnet-b1.58-2B-4T-GGUF
new activity
about 6 hours ago
microsoft/bitnet-b1.58-2B-4T-gguf:TQ1 quant version
Organizations
None yet
tdh111's activity
Chat template issue
#8 opened about 2 hours ago
by
tdh111
TQ1 quant version
3
#7 opened about 14 hours ago
by
TobDeBer
Whimsical Waffle: The Curious Case of LLMs and Their Linguistic Shenanigans
331
#4 opened about 1 month ago
by
mradermacher
Token issue
1
2
#3 opened 12 days ago
by
tdh111
public mradermacher discussions
5
#5 opened 22 days ago
by
mradermacher
https://huggingface.co/deepseek-ai/DeepSeek-V3-0324
20
#797 opened 29 days ago
by
nicoboss
Quantum Entanglement and the Sentient Toaster: Revolutionizing LLM Training
573
#3 opened 5 months ago
by
mradermacher
Hope you are okay
3
#11 opened 5 months ago
by
BigHuggyD

Quantizer Tool
2
#14 opened 3 months ago
by
TobDeBer
imatrix.dat missing output.weight and token_embd.weight
7
#1 opened 3 months ago
by
tdh111
https://huggingface.co/deepseek-ai/DeepSeek-V3-Base
21
#515 opened 4 months ago
by
nicoboss
What is the context size this model was trained on?
2
#23 opened 4 months ago
by
treehugg3
2 abc or not 2 abc
386
#2 opened 8 months ago
by
mradermacher
Nemotron 51B
15
#436 opened 5 months ago
by
AIGUYCONTENT

Disable all languages except English
1
#3 opened 7 months ago
by
codewalker7

https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 6 months ago
by
DazzlingXeno
https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 6 months ago
by
DazzlingXeno