Tl;dr: After performing poorly on benchmarks, OpenAI created their own. OpenAI products perform much better on OpenAI benchmark.
Oh, we're back to measuring skulls lmao
Tl;dr: After performing poorly on benchmarks, OpenAI created their own. OpenAI products perform much better on OpenAI benchmark.