ARC-AGI-3 Is a Philosophically Flawed, Misleading, and Therefore Ultimately Useless Benchmark
While our top AIs score 130+ on IQ tests, and outperform humans on coding, pattern recognition, memory and numerous other cognitive and emotional skills and attributes, ARC-AGI-3 would have us believe that they are literal Morons, (below 70 IQ…