We have spent years testing AI models on document extraction. Not edge cases—invoices. The simplest version of the task: read ...
GPT-5.4 Pro cracked a conjecture in number theory that had stumped generations of mathematicians, using a proof strategy that ...