Steven Goldfeather

treehugg3
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

treehugg3's activity

reacted to merterbak's post with ๐Ÿ”ฅ 9 days ago
view post
Post
3000
OpenAI has released BrowseComp an open source benchmark designed to evaluate the web browsing capabilities of AI agents. This dataset comprising 1,266 questions challenges AI models to navigate the web and uncover complex and obscure information. Crafted by human trainers, the questions are intentionally difficult. (unsolvable by another person in under ten minutes and beyond the reach of existing models like ChatGPT with and without browsing and an early version of OpenAI's Deep Research tool.)

Blog Post: https://openai.com/index/browsecomp/
Paper: https://cdn.openai.com/pdf/5e10f4ab-d6f7-442e-9508-59515c65e35d/browsecomp.pdf
Code in simple eval repo: https://github.com/openai/simple-evals
New activity in meta-llama/Llama-4-Scout-17B-16E-Instruct 10 days ago

Object Detection?

5
#23 opened 14 days ago by
buckeye17-bah
New activity in mistral-community/pixtral-12b 10 days ago
New activity in mradermacher/Meta-Llama-3.1-70B-i1-GGUF about 2 months ago

Where does the imatrix come from?

11
#1 opened about 2 months ago by
treehugg3
New activity in bartowski/Mistral-Large-Instruct-2407-GGUF 8 months ago

Chat template

3
#3 opened 9 months ago by
sydneyfong