tdh111
tdh111
AI & ML interests
None yet
Recent Activity
new activity
3 days ago
mradermacher/DeepSeek-V3-i1-GGUF:imatrix.dat missing output.weight and token_embd.weight
new activity
11 days ago
mradermacher/model_requests:https://huggingface.co/deepseek-ai/DeepSeek-V3-Base
Organizations
None yet
tdh111's activity
imatrix.dat missing output.weight and token_embd.weight
3
#1 opened 3 days ago
by
tdh111
Quantum Entanglement and the Sentient Toaster: Revolutionizing LLM Training
209
#3 opened about 1 month ago
by
mradermacher
https://huggingface.co/deepseek-ai/DeepSeek-V3-Base
21
#515 opened 20 days ago
by
nicoboss
Can Llama-3.1- Nemotron-40B-Instruct be released as well?
#24 opened 13 days ago
by
tdh111
What is the context size this model was trained on?
2
#23 opened 17 days ago
by
treehugg3
2 abc or not 2 abc
386
#2 opened 5 months ago
by
mradermacher
Nemotron 51B
15
#436 opened about 2 months ago
by
AIGUYCONTENT
Disable all languages except English
1
#3 opened 4 months ago
by
codewalker7
https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 2 months ago
by
DazzlingXeno
https://huggingface.co/Downtown-Case/Star-Command-R-Lite-32B-v1
8
#417 opened 2 months ago
by
DazzlingXeno