DNYUZ
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Music
    • Movie
    • Television
    • Theater
    • Gaming
    • Sports
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel
No Result
View All Result
DNYUZ
No Result
View All Result
Home Tech Apps

Gemini 2.5 Flash is Google’s cheapest thinking AI: What you need to know

April 18, 2025
in Apps, News, Tech
Gemini 2.5 Flash is Google’s cheapest thinking AI: What you need to know
497
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter

After launching the Gemini 2.5 Pro model a few weeks ago, Google has a new AI product ready for testing. Gemini 2.5 Flash is supposed to bring more affordable AI reasoning to tasks that require more thinking.

Google lets users specify a budget and turn reasoning on and off depending on the task. Not everything you throw at the AI will require reasoning, so you don’t have to overspend by having the AI “think” when it doesn’t need to.

However, Gemini 2.5 Flash isn’t an AI product targeting regular users. Instead, Gemini 2.5 Flash is a new tool that developers and enterprise customers can use for work. Gemini 2.5 Flash is available in preview via the Gemini API in Google AI Studio and Vertex AI.

Google says Gemini 2.5 Flash is quite formidable. The AI is Google’s lowest latency and most cost-efficient thinking model. That means it’s faster and cheaper than other models.

Gemini 2.5 Flash delivers a “major upgrade in reasoning abilities,” Google said in a blog post. The new AI is Google’s “first fully hybrid reasoning model,” which is how Google describes AI models where developers can turn reasoning on or off.

Interestingly, developers can set up thinking budgets so the AI can perform thinking tasks when they’re required. However, the AI will not consume the entire budget during a single reasoning task if that task doesn’t need it. The model is trained to know how long to think for prompts, so it’ll decide beforehand how much reasoning is required based on the perceived complexity.

Google offers a few prompt examples that explain how much reasoning Gemini 2.5 Flash will perform. For example, asking it to translate a word into a different language requires little reasoning. The same goes for answering questions like “How many provinces does Canada have?”

But more complex math and physics problems will require medium to high reasoning. The AI will spend more time on a prompt, and you’ll pay more money to get your answers.

Developers can set a thinking budget from 0 to 24576 tokens in the API or use a slider in Google AI Studio and Vertex AI.

As for the cost, Google says Gemini 2.5 Flash costs $0.15 per million tokens (input) and $0.60 per million tokens (output). If reasoning is involved for the output, the price goes up sixfold, up to $3.50 per million tokens. These costs make Gemini 2.5 Flash incredibly competitive, as seen in the table at the end of this post.

With thinking turned off, the Gemini 2.5 Flash will be at least as fast as the Gemini 2.0 Flash model.

The speed and competitive pricing for reasoning tasks aren’t Gemini 2.5 Flash’s only advantages. The new model also does very well in benchmarks. According to Google, Gemini 2.5 Flash is second only to Gemini 2.5 Pro in Hard Prompts in LMArena.

In Humanity’s Last Exam, Gemini 2.5 Flash outscored all recent models except ChatGPT o4-mini, which was launched earlier this week. The image below shows more benchmark results.

The post Gemini 2.5 Flash is Google’s cheapest thinking AI: What you need to know appeared first on BGR.

Tags: GeminiGoogle
Share199Tweet124Share
Ordinary Indians Are Feeling Jittery About the Escalating Conflict
News

Ordinary Indians Are Feeling Jittery About the Escalating Conflict

by New York Times
May 9, 2025

The worry is running deep in the parts of Kashmir and the rest of India that are in range of ...

Read more
News

The Supreme Court’s birthright citizenship case isn’t really about birthright citizenship

May 9, 2025
News

What We Know About the Terrorist Groups India Said It Targeted

May 9, 2025
News

Supreme Court Justice Tells Lawyers to ‘Stand Up’ Despite Trump Attacks

May 9, 2025
News

Von der Leyen: I won’t meet Trump in US until we can have ‘concrete’ trade talks

May 9, 2025
Cricket: IPL suspended amid India-Pakistan tensions

Cricket: IPL suspended amid India-Pakistan tensions

May 9, 2025
Grizzly bear killed by vehicle after famous mother met same fate

Grizzly bear killed by vehicle after famous mother met same fate

May 9, 2025
Big-Budget Trump Biopic In The Works From ‘Ferrari’ Producer Andrea Iervolino — Cannes Market

Big-Budget Trump Biopic In The Works From ‘Ferrari’ Producer Andrea Iervolino — Cannes Market

May 9, 2025

Copyright © 2025.

No Result
View All Result
  • Home
  • News
    • U.S.
    • World
    • Politics
    • Opinion
    • Business
    • Crime
    • Education
    • Environment
    • Science
  • Entertainment
    • Culture
    • Gaming
    • Music
    • Movie
    • Sports
    • Television
    • Theater
  • Tech
    • Apps
    • Autos
    • Gear
    • Mobile
    • Startup
  • Lifestyle
    • Arts
    • Fashion
    • Food
    • Health
    • Travel

Copyright © 2025.