One Neat trick in FrontierMath
·42 words·1 min
Benchmarks previously used multi-choice format or scaler values to easily check ground truth.
One cool innovation in new FrontierMath benchmark is to use SymPy objects to represent mathematical structures so they can be easily checked.
This is actually a big unlock!