DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home Tech Mobile

ChatGPT just got mind-blowing computer vision powers like in the movies

April 17, 2025
in Mobile, News, Tech
ChatGPT just got mind-blowing computer vision powers like in the movies
496
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter

OpenAI surprised us all with ChatGPT’s new image-generation features, which went viral a few weeks ago. However, it’s worth remembering that the chatbot doesn’t just create images from a text prompt; it can also understand pictures. ChatGPT got its multimodal capabilities last May, which include the ability to look at files, including images.

Fast-forward to OpenAI’s o3 and o4-mini announcement earlier this week, and ChatGPT got a massive upgrade concerning images. It’s something that easily tops its ability to create celebrity deepfakes or Studio Ghibli-style photos.

ChatGPT’s new reasoning models (o3 and o4-mini) can look at an image and integrate it into their chain of thought when handling a question or prompt. The AI manipulates images on its own, which means it can rotate, crop, and zoom in on a photo to find the information you’re looking for.

This is the closest thing we have to the computer vision we see all the time in movies. You know, when the star of the film or TV show tells the tech guy to enhance a blurry image, and then the computer makes everything crystal clear. That can’t happen in real life (well, it sort of can), but AI like ChatGPT o3 and o4-mini can now understand images and their contents much better than before. They can make sense of blurry details in images, just like the computers in those movies.

As a ChatGPT Plus user, I already got access to o3 and o4-mini, which is surprising, considering I live in Europe. I haven’t had a chance to try the new visual reasoning feature, but I went through OpenAI’s demos, and they blew my mind. Here are a few of them:

What is written on the notebook?

In this prompt, OpenAI uploaded a photo of a notebook to ChatGPT o3, asking it “What is written on the notebook?”

The AI looked at the image, flipped it, recognized the handwriting, and produced the answer.

What is written on the sign?

When I saw the following image, I immediately asked, “What sign???”

Then, I saw ChatGPT zooming in to find the answer, which it did. Yes, I guess the AI can read blurry images that contain text. Earnestly, I could have made that text up myself after enough zooming. But it’ll be even faster if the AI can pick it up.

Which stop is this?

ChatGPT o3 had to do more than zoom into a photo to answer this prompt: “which stop is this, and what is the frequency of the bus at this stop? search the internet if needed!”

The AI had to determine the location, read some of the text visible on the sign, and then provide a final answer.

ChatGPT o3 had no problem reasoning through it, even though it needed nearly three minutes to answer the question.

The AI determined the location, zoomed in on the board in the background, translated the text, and then provided a response. Mind. Blown.

What movies have been filmed here?

Equally impressive is the following demo that OpenAI offered. The AI was given a photo of a location taken through a window.

OpenAI asked ChatGPT o3 what movies were filmed at that location, a question that involves reasoning.

First, the AI needs to determine the location by looking out the window. Then, it has to find the movies that might have been shot near that location by browsing the web.

I don’t expect ChatGPT’s new visual reasoning to work flawlessly every time. But if the AI can handle images in its chain of thinking like these OpenAI demos suggest, then we’re looking at incredible functionality for AI chatbots. And yes, the AI’s visual reasoning abilities should improve significantly with future models.

You can see more ChatGPT visual reasoning examples at this link.

The post ChatGPT just got mind-blowing computer vision powers like in the movies appeared first on BGR.

Tags: ChatGPT
Share198Tweet124Share
CNN Panel Clashes Over Congratulating Trump for Not Attacking the Pope Yet
News

CNN Panel Clashes Over Congratulating Trump for Not Attacking the Pope Yet

by The Daily Beast
May 9, 2025

Conservative radio host Ben Ferguson was slapped down on CNN after suggesting Donald Trump should be praised for his response ...

Read more
News

A Budget Built on Illusion

May 9, 2025
News

The Case for a U.S.-China Rebalancing

May 9, 2025
News

Here’s the Deranged MAGA Propaganda Coming to Voice of America

May 9, 2025
News

Ordinary Indians Are Feeling Jittery About the Escalating Conflict

May 9, 2025
The Supreme Court’s birthright citizenship case isn’t really about birthright citizenship

The Supreme Court’s birthright citizenship case isn’t really about birthright citizenship

May 9, 2025
What We Know About the Terrorist Groups India Said It Targeted

What We Know About the Terrorist Groups India Said It Targeted

May 9, 2025
Supreme Court Justice Tells Lawyers to ‘Stand Up’ Despite Trump Attacks

Supreme Court Justice Tells Lawyers to ‘Stand Up’ Despite Trump Attacks

May 9, 2025

Copyright © 2025.

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2025.