Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Soonjae Lee
SoonjaeLee
Follow
0 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
replied
to
maxiw
's
post
about 1 month ago
You can now try out computer use models from the hub to automate your local machine with https://github.com/askui/vision-agent. ๐ป ``` import time from askui import VisionAgent with VisionAgent() as agent: agent.tools.webbrowser.open_new("http://www.google.com") time.sleep(0.5) agent.click("search field in the center of the screen", model_name="Qwen/Qwen2-VL-7B-Instruct") agent.type("cats") agent.keyboard("enter") time.sleep(0.5) agent.click("text 'Images'", model_name="AskUI/PTA-1") time.sleep(0.5) agent.click("second cat image", model_name="OS-Copilot/OS-Atlas-Base-7B") ``` Currently these models are integrated with Gradio Spaces API. Also planning to add local inference soon! Currently supported: - https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct - https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct - https://huggingface.co/AskUI/PTA-1 - https://huggingface.co/OS-Copilot/OS-Atlas-Base-7B
View all activity
Organizations
None yet
SoonjaeLee
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
replied
to
maxiw
's
post
about 1 month ago
view reply
Thank you