DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

A Word to the Wise: Don’t Trust A.I. to File Your Taxes

March 5, 2026
in News
A Word to the Wise: Don’t Trust A.I. to File Your Taxes

Artificial intelligence is used by the world’s military to operate sophisticated drones. It has replaced thousands of coders at the most advanced technology companies. It is even upending how cancer patients are treated, potentially saving lives.

Just don’t, whatever you do, use it to file your taxes.

To assess the technology’s ability to file a federal income tax return, The New York Times tested four A.I. chatbots — Google’s Gemini, OpenAI’s ChatGPT, Anthropic’s Claude and xAI’s Grok — to see how well they fared with eight fictional tax situations written as part of training materials by TaxSlayer, a tax-filing service.

They struggled, hard, miscalculating the refund or amount owed to the Internal Revenue Service by an average of more than $2,000. Even when provided with all the necessary materials, including all the forms they needed to fill out, the chatbots whiffed on some calculations.

“The problem with taxes is all those very small little details matter, and it’s not going to get every single little detail right,” said Benedict Evans, an analyst who writes a technology newsletter.

“These models get dramatically better over the course of every six months,” he added. “But they still give you what is roughly the right answer, and that’s not what you want.”

(The Times has sued OpenAI and its partner, Microsoft, claiming copyright infringement of news content related to A.I. systems. OpenAI and Microsoft have denied those claims.)

The problem comes down to how A.I. chatbots are fundamentally designed: They do not truly understand the complex relationships among the pieces of information they are processing. Their power to predict the next appropriate word in a sequence makes them smart in some areas — like reading and writing — but leaves them exceptionally weak in others — like actively remembering a lot of interconnected information without errors sneaking into their responses.

Those weaknesses prove tricky for filing taxes, which can require dozens of forms that inform one another and need to be updated in a specific sequence. A.I. tools struggle to follow complex procedures perfectly, and errors can accumulate as a task becomes more complex.

The issue amounts to a “tax-code paradox,” said Erik Brynjolfsson, a senior fellow at the Stanford Institute for Human-Centered A.I. The shortcoming reflects much larger challenges that A.I. companies are facing in expanding the tools into all areas of life.

“Traditional tax software like TurboTax is procedural, following ‘if-then’ logic built for mathematical precision,” Mr. Brynjolfsson said of existing online filing tools. Large language models, by contrast, are prediction engines that “can be superhuman at many tasks yet fail at some that seem simpler to humans.”

The chatbots did better in our tests when we gave the most advanced models a very organized picture of a fictional user’s finances, including sorting every piece of information by the corresponding I.R.S. document they should have used and then uploading those documents.

But most people don’t file their taxes this way; they don’t know what documents to use or what claims to make. Modern tax software asks filers about their life — whether they have children in day care, for example, or use a car for work — then transfers that information directly into the correct forms. Chatbots struggle with this kind of operation. Without specific instructions, they can only surface what is probably the most relevant information, which might not be what you need.

“If you ask it how many R’s are in ‘strawberry,’ it tells you how many R’s are probably in ‘strawberry,’” Mr. Evans said.

A.I. optimists hold a different view about where the tools might go next. They argue that the existing tools may begin to “think” more clearly through complex problems like taxes, applying active reasoning to find their way through I.R.S. documents.

Adding tools on top of the chatbots — like a program that could validate whether the tax return passes all the I.R.S. rules — could give them the help they need to get things right. That is similar to how A.I. chatbots have learned to code: They occasionally program things incorrectly but are good at understanding errors and coming up with fixes.

Claude, the A.I. chatbot from Anthropic, shows its “thought process” in real time. When we asked it to calculate how much a fictional user owed in federal taxes, it determined it needed a form from the I.R.S. that it didn’t have. It described the need to fetch it from the internet and then did just that, downloading the form and completing the math required.

In that case, Claude got the tax refund correct. But it made many errors in other tests, including calculating a lower refund than the fictional person was owed.

Tax experts have suggested that the tools are still a helpful assistant to use alongside manual research. When we asked the chatbots simple tax questions, or asked the chatbots to describe in familiar language a complex I.R.S. form, they performed well. Everyday tax filers — and even professionals — can make plenty of mistakes when navigating the tax code on their own, too.

But experts have emphasized keeping humans in the loop.

Stuart A. Thompson writes for The Times about online influence, including the people, places and institutions that shape the information we all consume.

The post A Word to the Wise: Don’t Trust A.I. to File Your Taxes appeared first on New York Times.

Watch: On the Road With UK Rave’s Most Infamous Twin Sisters as They Try to Not Get High
News

Watch: On the Road With UK Rave’s Most Infamous Twin Sisters as They Try to Not Get High

by VICE
March 5, 2026

Any wreckhead who raves long enough eventually reaches the point where they’re too old to die young and the choice ...

Read more
News

Pokémon Pokopia Players Discover Trick to Access Events Early

March 5, 2026
News

We spent 2 summers testing out living in different European countries. A year later, we’re happily settled in our top pick.

March 5, 2026
News

It ‘doesn’t end well’ for Pirro as she keeps failing to make Trump happy: NYT reporter

March 5, 2026
News

Everything you need to feel hot and be outside this spring

March 5, 2026
4 Books to Read If You Love Second-Chance Romance

4 Books to Read If You Love Second-Chance Romance

March 5, 2026
Opinion: Trump, 79, Plans to Win By Having Nothing Left to Lose in His War With Iran

Opinion: Trump, 79, Plans to Win By Having Nothing Left to Lose in His War With Iran

March 5, 2026
The Comedy Central Roast That Was So Brutal It Only Aired Once

The Comedy Central Roast That Was So Brutal It Only Aired Once

March 5, 2026

DNYUZ © 2026

No Result
View All Result

DNYUZ © 2026