Model Fails Hinton Test: GPT-4 Still Undefeated
·46 words·1 min
Folks have been trying favorite prompts and impressed with below model so I tried Hinton test. It is the simplest quickest test where all bots except GPT-4 fails. Below model fails miserably as well. The “Elo rating” is simply not reliable.
Hinton Test: https://x.com/sytelus/status/1656887527851294720 https://x.com/Tim_Dettmers/status/1661379354507476994