Facefam ArticlesFacefam Articles
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Search

Archives

  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • January 2025
  • December 2024
  • November 2024

Categories

  • Advertiser
  • AI
  • android
  • betting
  • Bongo
  • Business
  • CMS
  • cryptocurrency
  • Developers
  • Development
  • Downloads
  • Entertainment
  • Entrepreneur
  • Finacial
  • General
  • Hosting
  • How to
  • insuarance
  • Internet
  • Kenya
  • monetization
  • Music
  • News
  • Phones
  • PHP Scripts
  • Reports
  • REVIEWS
  • RUSSIA
  • Software
  • Technology
  • Tips
  • Tragic
  • Ukraine
  • Uncategorized
  • USA
  • webmaster
  • webmaster
  • Windows
  • Women Empowerment
  • Wordpress
  • Wp Plugins
  • Wp themes
Facefam 2025
Notification Show More
Font ResizerAa
Facefam ArticlesFacefam Articles
Font ResizerAa
  • Submit a Post
  • Donate
  • Join Facefam social
Search
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Have an existing account? Sign In
Follow US
SoftwareTechnologywebmaster

Google’s Gemini 2.5 Pro is Better at Coding, Math & Science Than Your Favourite AI Model

Ronald Kenyatta
Last updated: March 26, 2025 11:07 pm
By
Ronald Kenyatta
ByRonald Kenyatta
Follow:
Share
4 Min Read
SHARE

Gemini 2.5 Google

Google has unveiled Gemini 2.5 Pro, the first in its Gemini 2.5 family. This multimodal reasoning model outperforms competitors from OpenAI, Anthropic, and DeepSeek in key benchmarks related to coding, mathematics, and science.

What are reasoning AI models?

Reasoning AIs are designed to “think before they speak.” They evaluate context, process details methodically, and fact-check responses to ensure logical accuracy — though these capabilities demand more computing power and higher operational costs.

OpenAI launched the first reasoning model last September with o1, a notable departure from the GPT series, which was largely focused on language generation. Since then, the major players in the AI race have responded: DeepSeek with R1, Anthropic with Claude Sonnet 3.7, and xAI’s with Grok 3.

Evolving beyond ‘flash thinking’

Google previously launched its first reasoning AI model, Gemini 2.0 Flash Thinking, in December. Marketed for its agentic capabilities, Flash Thinking was recently updated to allow file uploads and larger prompts; however, with the introduction of Gemini 2.5 Pro, Google appears to be retiring the “Thinking” label altogether.

According to Google’s announcement about Gemini 2.5, this is because reasoning capabilities will now be integrated natively across all future models. This shift marks a move toward a more unified AI architecture, rather than separating “thinking” features as standalone branding.

The new experimental model combines “a significantly enhanced base model” with “improved post-training.” Google touts its performance at the top of the LMArena leaderboard, which ranks major large language models across various tasks.

Benchmark leader in science, math, and code

Gemini 2.5 Pro excels in academic reasoning benchmarks, scoring 86.7% on AIME 2025 (mathematics) and 84.0% on the GPQA diamond benchmark (science). On Humanity’s Last Exam — a broad test featuring thousands of questions across mathematics, science, and humanities —  the model leads with a score of 18.8%.

Notably, these results were achieved without the use of expensive test-time techniques, which allow models like o1 and R1 to continue learning during evaluation.

In software development benchmarks, Gemini 2.5 Pro performance is mixed. It scored 68.6% on the Aider Polyglot benchmark for code editing, outperforming most top-tier models. However, it scored 63.8% on SWE-bench Verified, placing second to Claude Sonnet 3.7 in broader programming tasks.

Despite this, Google says Gemini 2.5 Pro “excels at creating visually compelling web apps and agentic code applications,” as evidenced by its ability to create a video game from a single prompt.

The model supports a context window of one million tokens, meaning it can process the equivalent of a 750,000-word prompt, or the first six Harry Potter books. Google plans to increase this threshold to two million tokens in due course.

Gemini 2.5 Pro is currently available through the Gemini Advanced app, which requires a $20-a-month subscription, and to developers and enterprises through Google AI Studio. In the coming weeks, Gemini 2.5 Pro will be made available on Vertex AI, Google’s machine-learning platform for developers, and pricing details for different rate limits will also be introduced.

TAGGED:5 proaianthropicartificial intelligenceCodingdeepmindFavouriteGeminigemini 2gemini 2.5googleGooglesMathModelopenaiproreasoning modelScience
Share This Article
Facebook Whatsapp Whatsapp Email Copy Link Print
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Vasu Jakkal, corporate vice president of Microsoft Security. Microsoft Adds Even More AI to Its Security Copilot
Next Article Robot and woman working on laptop in office What is Fair in an AI-Enabled Workplace? Leaders Are Struggling to Answer This Question
Leave a review

Leave a Review Cancel reply

Your email address will not be published. Required fields are marked *

Please select a rating!

Strava A Social Fitness App for Runners, Cyclists, and Athletes
Feature-by-Feature Comparison: ShaunSocial vs. ColibriPlus – Which Social Network Script Comes Out on Top?
How Enterprise IT Can Achieve Water Sustainability Despite the Demands of AI
AI Benchmark Discrepancy Reveals Gaps in Performance Claims
Huawei Readies Ascend 920 Chip to Replace Restricted NVIDIA H20

Recent Posts

  • Strava A Social Fitness App for Runners, Cyclists, and Athletes
  • Feature-by-Feature Comparison: ShaunSocial vs. ColibriPlus – Which Social Network Script Comes Out on Top?
  • How Enterprise IT Can Achieve Water Sustainability Despite the Demands of AI
  • AI Benchmark Discrepancy Reveals Gaps in Performance Claims
  • Huawei Readies Ascend 920 Chip to Replace Restricted NVIDIA H20

Recent Comments

  1. https://tubemp4.ru on Best Features of PHPFox Social Network Script
  2. Вулкан Платинум on Best Features of PHPFox Social Network Script
  3. Вулкан Платинум официальный on Best Features of PHPFox Social Network Script
  4. Best Quality SEO Backlinks on DDoS Attacks Now Key Weapons in Geopolitical Conflicts, NETSCOUT Warns
  5. http://boyarka-inform.com on Comparing Wowonder and ShaunSocial

You Might Also Like

A sunny view of flower fields.
Technologywebmaster

‘AI Is Fundamentally Incompatible With Environmental Sustainability’

April 21, 2025
Photo of Google
Technologywebmaster

Google is Betting Big on Nuclear Energy – Here’s Why

April 19, 2025
Screenshot from Microsoft
Technologywebmaster

Microsoft’s New Copilot Studio Feature Offers More User-Friendly Automation

April 19, 2025
iot-spy.jpg
Technologywebmaster

US Officials Claim DeepSeek AI App Is ‘Designed To Spy on Americans’

April 19, 2025
Flat vector illustration of the automation concept.
Technologywebmaster

The End of Fragmented Automation

April 18, 2025
Previous Next
Facefam ArticlesFacefam Articles
Facefam Articles 2025
  • Submit a Post
  • Donate
  • Join Facefam social
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up