Facefam ArticlesFacefam Articles
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Search

Archives

  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • January 2025
  • December 2024
  • November 2024

Categories

  • Advertiser
  • AI
  • android
  • betting
  • Bongo
  • Business
  • CMS
  • cryptocurrency
  • Developers
  • Development
  • Downloads
  • Entertainment
  • Entrepreneur
  • Finacial
  • General
  • Hosting
  • How to
  • insuarance
  • Internet
  • Kenya
  • monetization
  • Music
  • News
  • Phones
  • PHP Scripts
  • Reports
  • REVIEWS
  • RUSSIA
  • Software
  • Technology
  • Tips
  • Tragic
  • Ukraine
  • Uncategorized
  • USA
  • webmaster
  • webmaster
  • Windows
  • Women Empowerment
  • Wordpress
  • Wp Plugins
  • Wp themes
Facefam 2025
Notification Show More
Font ResizerAa
Facefam ArticlesFacefam Articles
Font ResizerAa
  • Submit a Post
  • Donate
  • Join Facefam social
Search
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Have an existing account? Sign In
Follow US
Technologywebmaster

AI Models Least & Most Likely to Invent Information

Ronald Kenyatta
Last updated: August 12, 2025 9:54 pm
By
Ronald Kenyatta
ByRonald Kenyatta
Follow:
Share
5 Min Read
SHARE

Contents
How the top AI tools stack up when the facts matterOpenAIGoogleMore must-read AI coverageAnthropicMetaxAIKeeping track of truth in the age of AI
AI-generated image of a rocket launch.
Source: Vectara

OpenAI’s latest AI models are outpacing competitors from Google, Anthropic, xAI, and Meta in keeping their facts straight, according to new rankings. The results show stark differences in “hallucination rates,” or how often these AI models invent details.

The results come from Vectara’s Hughes Hallucination Evaluation Model (HHEM) Leaderboard, which measures the “ratio of summaries that hallucinate” across leading large language models. In head-to-head tests, ChatGPT models outperformed Gemini, Claude, Grok, and Meta AI, landing near the top of the accuracy race.

How the top AI tools stack up when the facts matter

Vectara’s HHEM Leaderboard is based on a large-scale test designed to determine whether AI models can adhere to the facts when summarizing real news articles. Each AI model was given the same set of short documents and scored on how often its summaries included information not found in the original text.

Refusal rates were also tracked, capturing how often an AI model declined to answer. With the conditions kept identical across the board, the results reveal which AI tools handle the truth best under the same pressure. Here’s how they performed.

OpenAI

OpenAI holds five of the lowest hallucination rates on the leaderboard, with ChatGPT-o3 mini at 0.795%, followed by ChatGPT-4.5, ChatGPT-5, ChatGPT-o1 mini, and ChatGPT-4o all clustered around the 1.2% to 1.49% mark.

That grounding in facts made the debut of ChatGPT-5 as the default model a strong move for the AI giant, until users pushed back, demanding the return of ChatGPT-4o. CEO Sam Altman relented, letting Plus subscribers choose their model.

But there’s a trade-off. Once free users hit their GPT-5 limit, they’re switched to ChatGPT-5 mini, a sharp drop in accuracy with a 4.9% hallucination rate that’s among the highest in OpenAI’s lineup. That could mean a sudden slide in how much you can trust the answers you get.

Google

Google’s Gemini 2.5 Pro Preview and Gemini 2.5 Flash Lite scored 2.6% and 2.9%, respectively. Not as low as OpenAI’s leaders, but still well clear of the highest-risk models. Pro Preview replaced the now-retired Gemini 2.5 Pro Experimental, which had once posted one of the lowest scores on the board at 1.1%.

More must-read AI coverage

Anthropic

Anthropic’s newest models, Claude Opus 4.1 and Claude Sonnet 4, post hallucination rates of 4.2% and 4.5%. Those scores place both models among the more error-prone models on the board, well behind leaders such as ChatGPT and Gemini.

Meta

Meta’s LLaMA 4 Maverick and LLaMA 4 Scout had 4.6% and 4.7% hallucination rates, putting them in the same ballpark as Claude’s latest models and outside the group of most accurate performers on the board.

xAI

Grok 4 posts a high hallucination rate of 4.8%, placing it among the least accurate models on the leaderboard. Elon Musk has promoted the newly released model as “smarter than almost all graduate students, in all disciplines,” pointing to its 26.9% score on On Humanity’s Last Exam.

The chatbot is also facing criticism for harmful and inappropriate outputs. This combination of a high error rate and ongoing content issues could make Grok a risky choice for fact-reliable answers.

Keeping track of truth in the age of AI

When AI gets it wrong, it can sound right. And when those made-up details slip past unnoticed, bending facts and spreading misinformation, it can lead to serious risks in areas like health, law, finance, and politics. That’s why ongoing, transparent testing is more important than ever.

Vectara’s HHEM Leaderboard updates with every model change, tracking in real time which AIs are improving and which are falling behind. As these systems weave deeper into search, messaging, and everyday tools, knowing which AI model stays closest to the truth is knowing what to trust.

In our closer look at OpenAI’s GPT-5, we focus on the AI model’s health-related benchmarks and guidelines.

TAGGED:InformationInventModels
Share This Article
Facebook Whatsapp Whatsapp Email Copy Link Print
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Trump Says Intel CEO's Success an 'Amazing Story' After Calling for His Resignation Trump Says Intel CEO’s Success an ‘Amazing Story’ After Calling for His Resignation
Next Article How Non-Developers Can Build Apps Faster Than Ever How Non-Developers Can Build Apps Faster Than Ever
Leave a review

Leave a Review Cancel reply

Your email address will not be published. Required fields are marked *

Please select a rating!

Meta Strikes $10 Billion Cloud Deal With Google to Boost AI Capacity
NVIDIA CEO Dismisses Chip Security Allegations as China Orders Firms to Halt Purchases
Anthropic Folds Claude Code Into Business Plans With Governance Tools
Google Claims One Gemini AI Prompt Uses Five Drops of Water
Generate AI Business Infographics without the Fees

Recent Posts

  • Meta Strikes $10 Billion Cloud Deal With Google to Boost AI Capacity
  • NVIDIA CEO Dismisses Chip Security Allegations as China Orders Firms to Halt Purchases
  • Anthropic Folds Claude Code Into Business Plans With Governance Tools
  • Google Claims One Gemini AI Prompt Uses Five Drops of Water
  • Generate AI Business Infographics without the Fees

Recent Comments

  1. https://tubemp4.ru on Best Features of PHPFox Social Network Script
  2. Вулкан Платинум on Best Features of PHPFox Social Network Script
  3. Вулкан Платинум официальный on Best Features of PHPFox Social Network Script
  4. Best Quality SEO Backlinks on DDoS Attacks Now Key Weapons in Geopolitical Conflicts, NETSCOUT Warns
  5. http://boyarka-inform.com on Comparing Wowonder and ShaunSocial

You Might Also Like

IT Leader’s Guide to the Metaverse

August 21, 2025
State of AI Adoption in Financial Services: A TechRepublic Exclusive
Technologywebmaster

State of AI Adoption in Financial Services: A TechRepublic Exclusive

August 21, 2025
AI Underperforms in Reality, and the Stock Market is Feeling It
Technologywebmaster

AI Underperforms in Reality, and the Stock Market is Feeling It

August 21, 2025
Google Shows Off Pixel 10 Series and Pixel Watch 4
Technologywebmaster

Google Shows Off Pixel 10 Series and Pixel Watch 4

August 21, 2025
NVIDIA & NSF to Build Fully Open AI Models for Science
Technologywebmaster

NVIDIA & NSF to Build Fully Open AI Models for Science

August 20, 2025
Previous Next
Facefam ArticlesFacefam Articles
Facefam Articles 2025
  • Submit a Post
  • Donate
  • Join Facefam social
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up