Grégoire Mialon

gregmialz

AI & ML interests

Self-supervised learning, Augmented LLMs

Recent Activity

Organizations

GAIA's profile picture huggingPartyParis's profile picture SIMS's profile picture

gregmialz's activity

New activity in gaia-benchmark/GAIA 12 months ago

where is the score function?

2
#3 opened about 1 year ago by
eyuansu71
New activity in gaia-benchmark/leaderboard about 1 year ago
reacted to clefourrier's post with 🤗 about 1 year ago
view post
Post
🏅 New top model on the GAIA benchmark!

Called FRIDAY, it's a mysterious new autonomous agent, which got quite good performances on both the public validation set *and* the private test set.
It notably passed 10 points for the val and 5 points for the test set on our hardest questions (level 3): they require to take arbitrarily long sequences of actions, use any number of tools, and access the world in genera! ✨

The GAIA benchmark evaluates next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc) and was co authored by @gregmialz @ThomasNLG @ylecun @thomwolf and myself: gaia-benchmark/leaderboard
·
New activity in gaia-benchmark/leaderboard about 1 year ago
New activity in gaia-benchmark/leaderboard over 1 year ago

Update scorer.py

#1 opened over 1 year ago by
gregmialz