AI Flunks SimpleBench, Teens Triumph
·51 words·1 min
New SimpleBench is super interesting. It claims to be the only benchmark where human high schoolers get 90% but all frontier models get less than 30%. Furthermore it’s private so folks can’t game it. Even more interesting is what I feel was long time coming: A “business model” of… continue reading