Bet 973
Duration 2 years (02026-02028)
PREDICTOR
Brian Peiris
CHALLENGER
Unchallenged
LLM-based AIs have no mechanism for true reasoning. Despite claims otherwise, they cannot think logically, nor solve tasks that require novelty, abstraction, planning, and dynamic interaction. The ARC-AGI-3 challenge tests for these capabilities, and I think it will continue to defeat state-of-the-art LLMs for at least two years.