-
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
Paper • 2402.17553 • Published • 26 -
Learning to Generate Unit Tests for Automated Debugging
Paper • 2502.01619 • Published • 4 -
gitbugactions/gitbug-java
Viewer • Updated • 199 • 27 • 2 -
rufimelo/defects4j
Viewer • Updated • 467 • 95 • 3
Moshood Fakorede
thefabdev
·
AI & ML interests
None yet
Recent Activity
liked a dataset about 1 month ago
MobileDev-Bench/mobiledev-bench upvoted a paper about 1 month ago
MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development updated a collection about 1 month ago
Academic Papers