Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

How Claude Fable 5 stacks up against Opus 4.8 and GPT 5.5

By Brian Buntz | June 10, 2026

Claude Mythos was one of the most hyped models in recent memory. And it is out now. Sort of. Yesterday Anthropic launched Claude Fable 5, a generally available model built on the same weights as Mythos 5 that switches over to the older Opus 4.8 for most tasks that even mention cybersecurity or biology. Early feedback provided by Anthropic includes glowing reviews.  Stripe said it compressed months of engineering into days, running a migration across a 50-million-line Ruby codebase in a single day. Physical Superintelligence called it the strongest model it has tested on frontier physics research while using a third of the reasoning tokens.

The benchmarks also indicate a capability shift that is more than other recent launches. On Artificial Analysis’s Intelligence Index, Fable 5 ranks scored 65, ahead of OpenAI’s GPT-5.5 at 60 and Google’s Gemini 3.1 Pro Preview at 57.

The most positive feedback is coming from people using it for Claude Code, long coding tasks, app-building, design iteration and complex workflows. A Hacker News commenter who had spent time with it across Claude Code, Claude.ai and Claude Code for web called it “a beast” and said it was handling difficult problems they had avoided for months.

The gain in performance also comes at a premium. Fable 5 is priced at $10 per million input tokens and $50 per million output tokens, roughly double Opus 4.8.

The model is apparently adept at a number of life sciences tasks. Using Mythos 5, Anthropic’s internal protein-design experts reported roughly a 10x acceleration in parts of the drug-design process. In one test the model chose binding sites, ran the design tools and recovered from its own failures without human help, matching or beating skilled operators. Nine of the 14 protein targets in that study yielded strong drug-design candidates Anthropic says it is now investigating. In a separate week-long run with only high-level human input, the model assembled single-cell data spanning 138 animal species and trained a custom model that outperformed a recently published model in Science at one-hundredth the size.

For life-sciences readers, the catch is that the drug-design and genomics capabilities Anthropic spent the most ink on belong to Mythos 5, reachable only through a trusted-access program that opens “in the coming weeks.”

OpenAI is rumored to launch a new model this months with a few leak-style posts citing Codex routing/log references, a possible iris-alpha codename and claims of a 1.5M-token context window.

Methodology note: Two harness changes affect comparisons with earlier reporting. Opus 4.8’s Terminal-Bench 2.1 score rose from 74.6 to 82.7 after Anthropic switched from the Terminus-2 harness to mini-SWE-agent, and OSWorld figures reflect a zoom-tool bug fix plus a max-tokens increase from 16K to 128K. On Terminal-Bench, Fable 5 hit a safety refusal on 20.9% of trials, which accounts for its 84.3 versus Mythos 5’s 88.0 despite identical weights. In terms of GPQA Diamond, Anthropic reports Mythos 5 at 94.1, describes the benchmark as saturated, and plans to stop reporting it. CyberGym appears only in the cyber comparison table.

Tell Us What You Think! Cancel reply

You must be logged in to post a comment.

Related Articles Read More >

MBARI's Monterey Accelerated Research System (MARS) connects seafloor instruments to shore through a roughly 51-kilometer power and fiber-optic cable (red line) ending at a node about 891 meters down. The Geo-Sense system described in the new paper takes the opposite approach: a portable, battery-powered cable that records locally with no link to shore. Researchers used MARS's own fiber data to cross-check Geo-Sense's earthquake detections. Credit: MBARI
How lightweight AI startup Lightscline helped turn one to two years of seafloor data analysis into a two-month sprint
Why Washington wants a 30-day look at frontier AI before it ships, and is backing a voluntary approach
Trump’s AI push turns government into reviewer, warfighter supplier and possible shareholder
OpenAI research and product leads detail GPT-Rosalind capabilities and benchmarks
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.

R&D World Digital Issues

Fall 2025 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

R&D 100 Awards
Research & Development World
  • Subscribe to R&D World Magazine
  • Sign up for R&D World’s newsletter
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2026 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE