Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

New Gemini 2.5 Pro model achieves top-tier science and coding performance while costing 1/8th the price of OpenAI’s o3

By Brian Buntz | June 5, 2025

Just over two months after R&D World covered the debut of Google’s experimental Gemini 2.5 Pro, the company has announced a significant upgrade following a prior May update to the model that drew mixed reactions from developers.

Google’s latest Gemini 2.5 Pro has jumped to the top of AI performance rankings with an Elo score of 1470, a chess-style rating system that measures how often models beat each other in head-to-head comparisons based on user feedback. The 24-point Elo jump solidifies Gemini’s lead on LMArena, the widely-watched AI leaderboard, even as it costs just $1.25 per million input tokens compared to OpenAI’s o3 at $10.00. The pricing difference is even greater when comparing to Claude Opus 4, costing one twelfth for input tokens.

While large language models are gaining ground especially quickly in web development, they are making strides in science, too. The latest Gemini 2.5 model now scores 86.4% on the GPQA Diamond benchmark, a difficult test of graduate-level scientific knowledge where it outperforms OpenAI’s o3 (83.3%) and Claude Opus 4 (79.6%) in single-attempt evaluations. In terms of code editing, it leads the pack on Aider Polyglot with a score of 82.2%. 

From the LMArena leaderboard:

In announcing the upgrade, Google emphasized that this version addresses previous critiques while positioning the model for broader commercial use. “We also addressed feedback from our previous 2.5 Pro release, improving its style and structure — it can be more creative with better-formatted responses,” wrote Tulsee Doshi, senior director, product management at Google DeepMind. Doshi describes the updated model as “our most intelligent model yet” and notes it “will be the generally available, stable version starting in a couple of weeks, ready for enterprise-scale applications.”

Related Articles Read More >

5 R&D jobs that may be lost to AI and 5 that it could create
Dinner plate-sized chips with trillions of transistors could give traditional GPUs a run for their money
FDA’s AI tool Elsa signals new era for regulatory review, says QuantHealth CEO
Sonar Screen For Submarines And Ships. Radar Sonar With Object On Map. Futuristic HUD Navigation monitor
Pentagon places big bets on frontier AI, quantum sensing and next-gen avionics in nearly $3 billion in defense technology contracts 
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Enews Sign Up
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE