Michael PRO

michaelfeil

529 7 220

https://michaelfeil.eu

michaelfeil

AI & ML interests

ML Inference

Recent Activity

new activity 27 days ago

microsoft/harrier-oss-v1-270m:b10-poc

new activity about 1 month ago

Skywork/Skywork-Reward-Llama-3.1-8B-v0.2:Update config.json

new activity about 1 month ago

Skywork/Skywork-Reward-Llama-3.1-8B-v0.2:Update config.json

View all activity

Organizations

New activity in microsoft/harrier-oss-v1-270m 27 days ago

b10-poc

#3 opened 27 days ago by

michaelfeil

New activity in Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 about 1 month ago

Update config.json

#4 opened about 1 month ago by

michaelfeil

New activity in baseten-admin/bert-base-ner-uncased 5 months ago

Create modules.json

#1 opened 5 months ago by

michaelfeil

New activity in voyageai/voyage-4-nano 5 months ago

Alt modeling code

#5 opened 6 months ago by

michaelfeil

New activity in gradientai/Llama-3-8B-Instruct-Gradient-1048k 5 months ago

Update Readme link

#30 opened 5 months ago by

michaelfeil

New activity in nvidia/llama-embed-nemotron-8b 5 months ago

Upstream transformers support with `use_bidirectional_attention`

#13 opened 6 months ago by

michaelfeil

New activity in voyageai/voyage-4-nano 6 months ago

Add a config hint for use_linear_output_projection

#6 opened 6 months ago by

michaelfeil

NIT Update config.json

#4 opened 6 months ago by

michaelfeil

framework support via use_bidirectional_attention - Cheers from your friends at Baseten

#3 opened 6 months ago by

michaelfeil

New activity in nvidia/llama-nemotron-embed-1b-v2 6 months ago

"use_bidirectional_attention": true flag

#13 opened 6 months ago by

michaelfeil

New activity in Qwen/Qwen3-30B-A3B-Instruct-2507 6 months ago

dummy config.json

#26 opened 6 months ago by

michaelfeil

New activity in jinaai/jina-code-embeddings-0.5b 8 months ago

missing tokenizer

#3 opened 8 months ago by

michaelfeil

New activity in baseten/Llama-3.2-3B-Instruct-pythonic 9 months ago

Update chat_template.jinja

#1 opened 9 months ago by

baseten-admin

New activity in Snowflake/snowflake-arctic-embed-l-v2.0 10 months ago

Set CTX Length to 2048

#19 opened 10 months ago by

michaelfeil

New activity in huggingface/InferenceSupport 10 months ago

moonshotai/Kimi-K2-Instruct-0905

#4638 opened 10 months ago by

michaelfeil

New activity in sentence-transformers/all-MiniLM-L6-v2 10 months ago

Update config.json

#130 opened 10 months ago by

michaelfeil

New activity in baseten/Kimi-K2-Instruct-FP4 11 months ago

remove-tiktoken

🔥 1

#1 opened 12 months ago by

bdubayah

New activity in nomic-ai/nomic-embed-text-v1.5 11 months ago

Update usage with infinity

#36 opened over 1 year ago by

michaelfeil

New activity in zeroentropy/zerank-1-small-reranker 12 months ago

Missing weights in architecture

👀 1

#2 opened 12 months ago by

m-ric

New activity in mistralai/Mistral-Small-3.2-24B-Instruct-2506 12 months ago

Add-huggingface-tokenizer

👍 10

#24 opened 12 months ago by

michaelfeil

Michael PRO

AI & ML interests

Recent Activity

Organizations

michaelfeil's activity

b10-poc

Update config.json

Create modules.json

Alt modeling code

Update Readme link

Upstream transformers support with `use_bidirectional_attention`

Add a config hint for use_linear_output_projection

NIT Update config.json

framework support via use_bidirectional_attention - Cheers from your friends at Baseten

"use_bidirectional_attention": true flag

dummy config.json

missing tokenizer

Update chat_template.jinja

Set CTX Length to 2048

moonshotai/Kimi-K2-Instruct-0905

Update config.json

remove-tiktoken

Update usage with infinity

Missing weights in architecture

Add-huggingface-tokenizer