Max Rubin's picture
1 7

Max Rubin

maxrubin629
ยท

AI & ML interests

None yet

Recent Activity

updated a model about 14 hours ago
maxrubin629/Phi-4-mini-instruct-Q8-mlx
published a model about 14 hours ago
maxrubin629/Phi-4-mini-instruct-Q8-mlx
updated a model 9 days ago
maxrubin629/Arcee-Blitz-Q4-mlx
View all activity

Organizations

None yet

maxrubin629's activity

reacted to mkurman's post with ๐Ÿ‘ 20 days ago
view post
Post
1584
Blurred-Thoughts Supervised-Finetuning ๐Ÿ™ˆ

After hours of working with GitHub Copilot to organize the code, I'm keen to announce the release of Blurred Thoughts Supervised-Finetuning (BT-SFT), a new method for fine-tuning LLMs to produce more diverse and creative responses.

BT-SFT introduces:
โœ… Smart tokenization method randomly masks tokens within <think> ... </think> tags, promoting the model to generate diverse responses that align better with its probability distribution instead of memorizing the thought process from distilled data.
โœ… Reward function that ensures responses are well-structured.

Explore and contribute to the project available in my GitHub repository:
https://github.com/mkurman/blurred-thoughts-SFT

Keep me updated on your experiments with BT-SFT! ๐Ÿ