Victor Mustar's picture

Victor Mustar PRO

victor

AI & ML interests

Building the UX of this website

Recent Activity

updated a Space 39 minutes ago
victor/spaces-trending
published a Space 40 minutes ago
victor/spaces-trending
published a Space about 1 hour ago
victor/azeeaze
View all activity

Organizations

Hugging Face's profile picture Google's profile picture Safetensors's profile picture Competitions's profile picture 21 RNN's profile picture Spaces-explorers's profile picture Text Generation Inference's profile picture Spaces Examples's profile picture CVPR Demo Track's profile picture Hugging Chat's profile picture Webhooks Explorers (BETA)'s profile picture lora concepts library's profile picture Huggingface Projects's profile picture Scanned Tokens's profile picture hf admins's profile picture Hugging Face OSS Metrics's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Core ML Projects's profile picture temp-org's profile picture Blog-explorers's profile picture Mustarz's profile picture Open LLM Leaderboard's profile picture Enterprise Explorers's profile picture The Collectionists's profile picture ZeroGPU Explorers's profile picture Hugging Face Tools's profile picture TstOrg141's profile picture Stable Video benchmark's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture LLHF's profile picture SLLHF's profile picture Self-serve FTW's profile picture Inference Explorers's profile picture

victor's activity

reacted to nyuuzyou's post with ๐Ÿ‘ about 3 hours ago
view post
Post
956
I am planning to release *something big* this week, but in the meantime I was bored, so I quickly made a small dataset in as-is format.

๐Ÿ“ฑ Sponsr.ru Dataset - nyuuzyou/sponsr

Collection of 44,138 posts from Sponsr.ru, a Russian content subscription platform featuring:
- Comprehensive metadata including project details, post information, and pricing
- Detailed content categorization with images, videos, and text formats
- Monolingual Russian content from diverse creator projects
reacted to csabakecskemeti's post with ๐Ÿ‘ about 20 hours ago
view post
Post
2635
I'm collecting llama-bench results for inference with a llama 3.1 8B q4 and q8 reference models on varoius GPUs. The results are average of 5 executions.
The system varies (different motherboard and CPU ... but that probably that has little effect on the inference performance).

https://devquasar.com/gpu-gguf-inference-comparison/
the exact models user are in the page

I'd welcome results from other GPUs is you have access do anything else you've need in the post. Hopefully this is useful information everyone.
reacted to onekq's post with ๐Ÿ‘ about 20 hours ago
reacted to smirki's post with ๐Ÿ‘ about 20 hours ago
view post
Post
1453
Introducing a SMALL Reasoning React Model with State!
We did this by introducing a new form of reasoning that aligns with UI principles to do a layer of testing. For example:
"Looking back at all these pieces, we've considered state management, data structures, core functionalities etc"
And it comes in all sizes. Great for agents!
Tesslate/tessa-t1-react-reasoning-model-67e0fb72ca23e04473885c0e
Tesslate/Tessa-T1-14B
https://huggingface.co/smirki/Tessa-T1-14B-Q8_0-GGUF
reacted to MikeDoes's post with ๐Ÿ”ฅ about 20 hours ago
reacted to etemiz's post with ๐Ÿ‘€ 5 days ago
view post
Post
1657
Started fine tuning Gemma 3 using evolutionary approach. It is not the worst model according to AHA leaderboard and it is one of the smart according to lmarena.ai. My objective is to make it based, anti woke, wise, beneficial and then some.

Several GPUs are fine tuning it at the same time, each using a different dataset and using QLoRA and the successful ones are merged later. Compared to LoRa this allows faster training and also reduced overfitting because the merge operation heals overfitting. The problem with this could be the 4 bit quantization may make models dumber. But I am not looking for sheer IQ. Too much mind is a problem anyway :)

Has anyone tried parallel QLoRa and merge before?

I also automated the dataset selection and benchmarking and converging to objectives (the fit function, the reward). It is basically trying to get higher score in AHA Leaderboard as fast as possible with a diverse set of organisms that "evolve by training".

I want to release some cool stuff when I have the time:
- how an answer to a single question changes over time, with each training round or day
- a chart to show AHA alignment over training rounds
  • 3 replies
ยท
reacted to chansung's post with โค๏ธ 5 days ago
view post
Post
2434
Mistral AI Small 3.1 24B is not only commercial free but also the best model in a single GPU deployment.

I packed up all the information you need to know in a single picture. Hope this helps! :)
  • 1 reply
ยท
reacted to MohamedRashad's post with ๐Ÿ‘€ 5 days ago
reacted to sharpenb's post with ๐Ÿ”ฅ 5 days ago
view post
Post
3009
We open-sourced the pruna package that can be easily installed with pip install pruna :) It allows to easily ccompress and evaluate AI models including transformers and diffusers.

- Github repo: https://github.com/PrunaAI/pruna
- Documentation: https://docs.pruna.ai/en/stable/index.html

With open-sourcing, people can now inspect and contribute to the open code. Beyond the code, we provide detailed readme, tutorials, benchmarks, and documentation to make transparent compression, evaluation, and saving/loading/serving of AI models.

Happy to share it with you and always interested in collecting your feedback :)
  • 1 reply
ยท
reacted to daavoo's post with ๐Ÿ”ฅ 5 days ago
view post
Post
1965
๐Ÿค– ๐Ÿ—บMapped all(?) the swimming pools ๏ธ๐ŸŠ around another town with https://github.com/mozilla-ai/osm-ai-helper.

This time, I have mapped and contributed to https://www.openstreetmap.org more than 100 swimming pools around my wife's hometown. Only took about 20min to find them all (+~3 min verification) in a free Colab GPU๐Ÿš€

Try it yourself around a single point: mozilla-ai/osm-ai-helper
reacted to clem's post with ๐Ÿ”ฅ 5 days ago
view post
Post
2428
Nice new space to see how fast your personal or organization followers are growing on HF:
julien-c/follow-history

As you can see, I still have more followers than @julien-c even if he's trying to change this by building such cool spaces ๐Ÿ˜๐Ÿ˜๐Ÿ˜
reacted to csabakecskemeti's post with ๐Ÿ˜Ž 6 days ago
reacted to MikeDoes's post with ๐Ÿ‘€ 6 days ago
view post
Post
2058
#PII Masking Tech that does not **** around!

We are happy to release the OpenPII English Anonymiser โ€”the most powerful open-source tool for redacting sensitive info from English text.

Fine-tuned Modernbert on 5.7 million+ PII examples, itโ€™s clocking 99%+ accuracy across emails, dates, social numbers, and more!

Why itโ€™s a big deal:
โœ… Top-tier precision: 100% for passport numbers, 99.96% for emails*.
โœ… Totally free: MIT license for personal or commercial use.
โœ… No secrets: Full metrics shared on Hugging Face.

#AI #OpenSource #DataSecurity @huggingface

Day 2 out 7 of PII-Masking-1M Announcements Complete!

*Accuracies reported from the new OpenPII-500k dataset

ai4privacy/llama-ai4privacy-english-anonymiser-openpii
reacted to AdinaY's post with ๐Ÿ”ฅ 6 days ago
reacted to Jaward's post with ๐Ÿš€ 6 days ago
view post
Post
2060
Nvidia brings blue (from starwars droids) to life ๐Ÿคฏ, supercute with flawless dexterity and droid voice. It's the result of their colab research with Google DeepMind and Disney, revealed as part of their new opensource physics engine for robotics simulation: NEWTON - which enables robots to learn how to complete complex tasks with greater precision.

ReadMore: https://developer.nvidia.com/blog/announcing-newton-an-open-source-physics-engine-for-robotics-simulation?ncid=so-twit-820797-vt48
reacted to mrfakename's post with ๐Ÿ‘ 6 days ago
reacted to etemiz's post with ๐Ÿ‘ 7 days ago
replied to onekq's post 7 days ago
reacted to onekq's post with ๐Ÿš€ 7 days ago
view post
Post
2248
Introducing ๐ŸŽ‰ OneSQL-v0.1๐Ÿฅณ, our first text-to-SQL model based on Qwen2.5-Coder. This model has achieved an EX score of 63.33 on the BIRD leaderboard (https://bird-bench.github.io/).

The model family includes 7B and 32B,
onekq-ai/onesql-v01-qwen-67d8e3eb1611c5532bb90c5f
and can be also found on Ollama (https://ollama.com/onekq/OneSQL-v0.1-Qwen)

My goal is to make OneSQL the most usable open-weights model for text-to-SQL. I'm currently working on best practices to help users use this model the right away and avoid pitfalls. After that, I plan to train the next version to push for a higher EX score.

Enjoy this model and feel free to share comments/questions ๐Ÿค—
  • 1 reply
ยท
reacted to AdinaY's post with ๐Ÿš€ 7 days ago