Michael PRO
michaelfeil
AI & ML interests
ML Inference
Recent Activity
liked
a model
about 20 hours ago
meta-llama/Llama-4-Scout-17B-16E-Instruct
published
a model
3 days ago
michaelfeil/llama-405B-tllm-tp8-fp8kv
published
a model
3 days ago
michaelfeil/llama-405B-engine-h200-tp8-fp8kv-17-post1
Organizations
michaelfeil's activity
Please upload a "merged" version.
3
#1 opened 4 days ago
by
michaelfeil

How to combine `thinking on/off` prompt with existing system prompt.
1
#8 opened 5 days ago
by
michaelfeil

Add padding token to config (fix batched generation)
13
#1 opened 12 days ago
by
rawsh
infinity rerank inference
2
#3 opened 23 days ago
by
qdrddr
Whats the closest modeling code?
3
#9 opened 22 days ago
by
michaelfeil

Add yarn scaling
#1 opened about 1 month ago
by
michaelfeil

Adding `safetensors` variant of this model
#1 opened about 1 month ago
by
SFconvertbot

KaLM-embedding-multilingual-max-v1
4
#5 opened 2 months ago
by
gururaser
type mismatch
5
#6 opened about 2 months ago
by
michaelfeil

Update sentence_bert_config.json
#2 opened 3 months ago
by
michaelfeil

Update sentence bert config to 4096
#7 opened about 2 months ago
by
michaelfeil

fix: tokenizer fast
#1 opened 2 months ago
by
michaelfeil

fix: tokenizer
#1 opened 2 months ago
by
michaelfeil

dummy pr, do not merge! replace with dummy fast-tokenizer
1
#1 opened 2 months ago
by
michaelfeil

Update config.json to llama for easy loading
#2 opened 2 months ago
by
michaelfeil

update config on right class
#30 opened 2 months ago
by
michaelfeil

Update config.json
#16 opened 2 months ago
by
michaelfeil

do not merge! mistral example
#1 opened 3 months ago
by
michaelfeil
