Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
7
51
William Suffill
wsuff
Follow
shtefcs's profile picture
1 follower
ยท
58 following
wsuff
AI & ML interests
None yet
Recent Activity
reacted
to
MikeDoes
's
post
with ๐ฅ
10 days ago
๐ We are quite excited to announce the Ai4Privacy Python library! ๐ pip install ai4privacy to anonymize short english text with OpenPII Masking 500k labels ๐ Day 5/7 of PII Masking 1M announcements complete! โฐ
reacted
to
m-ric
's
post
with โค๏ธ
10 days ago
๐ DeepSeek R1 moment has come for GUI agents: Rule-based Reinforcement Learning gives better results than SFT with 500x smaller datasets! Traditionally (by which I mean "in the last few months"), GUI agents have been trained with supervised fine-tuning (SFT). This meant, collecting huge datasets of screen captures from people using computers, and using these to fine-tune your model. ๐ ๐ But last week, a new paper introduced UI-R1, applying DeepSeek's R1-style rule-based reinforcement learning (RL) specifically to GUI action prediction tasks. This is big news: with RL, maybe we could build good agents without the need for huge datasets. UI-R1 uses a unified reward function that evaluates multiple responses from models, optimizing via policy algorithms like Group Relative Policy Optimization (GRPO). Specifically, the reward function assesses: ๐ฏ Action type accuracy: Does the predicted action match the ground truth? ๐ Coordinate accuracy (specifically for clicks): Is the predicted click within the correct bounding box? ๐ Output format: Does the model clearly articulate both its reasoning and final action? Using just 136 carefully selected mobile tasksโcompared to 76,000 tasks for larger models like OS-AtlasโUI-R1 shows significant efficiency and improved performance: ๐ Boosted action prediction accuracy from 76% to 89% on AndroidControl. ๐ Outperformed larger, SFT-trained models (e.g., OS-Atlas-7B), demonstrating superior results with vastly fewer data points (136 tasks vs. 76K). ๐ Enhanced adaptability and generalization, excelling even in out-of-domain scenarios. The paper tests this RL-based method only in low-level GUI tasks. Could it generalize to more complex interactions? ๐ง Read the full paper here ๐ https://huggingface.co/papers/2503.21620
reacted
to
zamal
's
post
with ๐
10 days ago
DeepGit: Your GitHub Gold Digger! ๐ฐ๐ Hey Hugging Face gang! Meet DeepGitโmy open-source sidekick that rips through GitHub to snag repos that fit you. Done with dead-end searches? Me too. Built it with LangGraph and some dope tricks: Embeddings grab the good stuff (HF magic, baby!) Re-ranking nails the best picks Snoops docs, code, and buzz in one slick flow Drops a clean list of hidden gems ๐ Unearth that sneaky ML lib or Python gemโrun python app.py or langgraph dev and boom! Peek it at https://github.com/zamalali/DeepGit. Fork it, tweak it, love itโDockerโs in, HF vibes are strong. Drop a ๐ or a crazy ideaโIโm pumped to jam with you all! ๐ช
View all activity
Organizations
None yet
wsuff
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
allenai/tulu-2-dpo-7b
about 1 year ago
GitHub Model Source Repository Link
#1 opened about 1 year ago by
wsuff