Benchmarks are 💩

by MrDevolver - opened 22 days ago

22 days ago

Benchmarks are 💩. For the love of God please stop claiming that small models like this one can compete with big models such as ChatGPT or Claude if it can't even fix small issues such as missing paddle movement logic in a simple pong game code written in javascript!

JRZ

13 days ago

Agreed. For my use case the benchmarks and leader-boards seem very misleading most of the time.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment