These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models
In a long-running program known as the Sunday Puzzle, NPR host Will Shortz, the crossword puzzle expert for The New York Times, gets to test thousands of listeners every Sunday. Even experienced participants typically find the brainteasers difficult, despite the fact that they are designed to be solved with no prior information. Because of this, …