Facefam ArticlesFacefam Articles
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Search

Archives

  • May 2025
  • April 2025
  • March 2025
  • January 2025
  • December 2024
  • November 2024

Categories

  • Advertiser
  • AI
  • android
  • betting
  • Bongo
  • Business
  • CMS
  • cryptocurrency
  • Developers
  • Development
  • Downloads
  • Entertainment
  • Entrepreneur
  • Finacial
  • General
  • Hosting
  • How to
  • insuarance
  • Internet
  • Kenya
  • monetization
  • Music
  • News
  • Phones
  • PHP Scripts
  • Reports
  • REVIEWS
  • RUSSIA
  • Software
  • Technology
  • Tips
  • Tragic
  • Ukraine
  • Uncategorized
  • USA
  • webmaster
  • webmaster
  • Windows
  • Women Empowerment
  • Wordpress
  • Wp Plugins
  • Wp themes
Facefam 2025
Notification Show More
Font ResizerAa
Facefam ArticlesFacefam Articles
Font ResizerAa
  • Submit a Post
  • Donate
  • Join Facefam social
Search
  • webmaster
    • How to
    • Developers
    • Hosting
    • monetization
    • Reports
  • Technology
    • Software
  • Downloads
    • Windows
    • android
    • PHP Scripts
    • CMS
  • REVIEWS
  • Donate
  • Join Facefam
Have an existing account? Sign In
Follow US
Technologywebmaster

OpenAI’s New AI Models o3 and o4-mini Can Now ‘Think With Images’

Ronald Kenyatta
Last updated: April 18, 2025 8:12 pm
By
Ronald Kenyatta
ByRonald Kenyatta
Follow:
Share
3 Min Read
SHARE

Contents
How does ‘thinking with images’ work?More must-read AI coverageOutperforms previous models in key benchmarksWho can use OpenAI o3 and o4-mini?
Photo of OpenAI's CEO Sam Altman with the company's logo.
OpenAI’s CEO Sam Altman. Image: Creative Commons

OpenAI has rolled out two new AI models, o3 and o4‑mini, that can literally “think with images,” marking a big step forward in how machines understand pictures. These models, announced in an OpenAI press release, can reason about images the same way they do about text — cropping, zooming, and rotating photos as part of their internal thought process.

At the heart of this update is the ability to blend visual and verbal reasoning.

“OpenAI o3 and o4‑mini represent a significant breakthrough in visual perception by reasoning with images in their chain of thought,” the company said in its press release. Unlike past versions, these models don’t rely on separate vision systems — instead, they natively mix image tools and text tools for richer, more accurate answers.

How does ‘thinking with images’ work?

The models can crop, zoom, rotate, or flip an image as part of their thinking process, just like humans would. They’re not just recognizing what’s in a photo but working with it to draw conclusions.

The company notes that “ChatGPT’s enhanced visual intelligence helps you solve tougher problems by analyzing images more thoroughly, accurately, and reliably than ever before.”

This means if you upload a photo of a handwritten math problem, a blurry sign, or a complicated chart, the model can not only understand it, but also break it down step by step — possibly even better than before.

More must-read AI coverage

Outperforms previous models in key benchmarks

These new abilities aren’t just impressive in theory; OpenAI says both models outperform their predecessors regarding top academic and AI benchmarks.

“Our models set new state-of-the-art performance in STEM question-answering (MMMU, MathVista), chart reading and reasoning (CharXiv), perception primitives (VLMs are Blind), and visual search (V*),” the company noted in a statement. “On V*, our visual reasoning approach achieves 95.7% accuracy, largely solving the benchmark.”

But the models aren’t perfect. OpenAI admits the models can sometimes overthink, leading to prolonged and unnecessary image manipulations. There are also cases where the AI might misinterpret what it sees, despite correctly using tools to analyze the image. The company also warned of reliability issues when trying the same task multiple times.

Who can use OpenAI o3 and o4-mini?

As of April 16, both o3 and o4-mini are available to ChatGPT Plus, Pro, and Team users; they replace older models like o1 and o3-mini. Enterprise and education users will get access next week, and free users can try o4-mini through a new “Think” feature.

TAGGED:aiai modelsartificial intelligenceImagesModelso4miniopenaiopenai o-4 miniopenai o3OpenAIs
Share This Article
Facebook Whatsapp Whatsapp Email Copy Link Print
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Photo of Omdia’s Research Director for Digital Infrastructure Vlad Galabov. How AI is Revolutionizing Data Center Power and Cooling
Next Article Microsoft Releases Largest 1-Bit LLM, Letting Powerful AI Run on Some Older Hardware Microsoft Releases Largest 1-Bit LLM, Letting Powerful AI Run on Some Older Hardware
Leave a review

Leave a Review Cancel reply

Your email address will not be published. Required fields are marked *

Please select a rating!

Feature-by-Feature Comparison: ShaunSocial vs. ColibriPlus – Which Social Network Script Comes Out on Top?
How Enterprise IT Can Achieve Water Sustainability Despite the Demands of AI
AI Benchmark Discrepancy Reveals Gaps in Performance Claims
Huawei Readies Ascend 920 Chip to Replace Restricted NVIDIA H20
‘AI Is Fundamentally Incompatible With Environmental Sustainability’

Recent Posts

  • Feature-by-Feature Comparison: ShaunSocial vs. ColibriPlus – Which Social Network Script Comes Out on Top?
  • How Enterprise IT Can Achieve Water Sustainability Despite the Demands of AI
  • AI Benchmark Discrepancy Reveals Gaps in Performance Claims
  • Huawei Readies Ascend 920 Chip to Replace Restricted NVIDIA H20
  • ‘AI Is Fundamentally Incompatible With Environmental Sustainability’

Recent Comments

  1. https://tubemp4.ru on Best Features of PHPFox Social Network Script
  2. Вулкан Платинум on Best Features of PHPFox Social Network Script
  3. Вулкан Платинум официальный on Best Features of PHPFox Social Network Script
  4. Best Quality SEO Backlinks on DDoS Attacks Now Key Weapons in Geopolitical Conflicts, NETSCOUT Warns
  5. http://boyarka-inform.com on Comparing Wowonder and ShaunSocial

You Might Also Like

Photo of Google
Technologywebmaster

Google is Betting Big on Nuclear Energy – Here’s Why

April 19, 2025
Screenshot from Microsoft
Technologywebmaster

Microsoft’s New Copilot Studio Feature Offers More User-Friendly Automation

April 19, 2025
iot-spy.jpg
Technologywebmaster

US Officials Claim DeepSeek AI App Is ‘Designed To Spy on Americans’

April 19, 2025
Flat vector illustration of the automation concept.
Technologywebmaster

The End of Fragmented Automation

April 18, 2025
Microsoft Releases Largest 1-Bit LLM, Letting Powerful AI Run on Some Older Hardware
Technologywebmaster

Microsoft Releases Largest 1-Bit LLM, Letting Powerful AI Run on Some Older Hardware

April 18, 2025
Previous Next
Facefam ArticlesFacefam Articles
Facefam Articles 2025
  • Submit a Post
  • Donate
  • Join Facefam social
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up