Slop. That’s a term sometimes thrown around to describe the increasing volumes of AI-generated content appearing online. AI writing doesn’t exactly have a reputation for quality—or accuracy. Yet an AI writing system from Tokyo-based startup Sakana AI has reportedly passed peer review, the gold standard for scientific validity. Fittingly, the paper’s subject was AI itself.…
Eureka 2.0: AI is beginning to ace grad-level science, but can you trust it?
Remember when AI couldn’t crack a tough math problem without hallucinating formulas? Those days could soon be over. Maybe they already are. We could already have experimental AI systems that surpass 90% accuracy on the graduate-level Google-Proof Q&A benchmark GPQA Diamond), a test of graduate‐level reasoning in biology, physics, and chemistry. For context, in-domain experts…