When Yellow Fades: GPT-4 Passes Hinton's Paint Test While Bard Falls Short
·96 words·1 min
Hinton test is actually very effective at quickly testing AI. GPT-4 gives very smart answer, GPT 3.5 is so-so and latest Bard fails completely.
Prompt:
I have some rooms in my house painted white, some in yellow and some in blue. Yellow paint fades to white within a year. I…
Update:
Claude V1, Claude+, NeevaAI and Cohere also fails.
Davinci-003 gives trivial answer.
I would have thought more models succeeding at this rather simple puzzle, at least giving trivial solution, given many of them score so well on ARC, Winogrande etc.
Hats off to… continue reading