DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

A Word to the Wise: Don’t Trust A.I. to File Your Taxes

March 5, 2026
in News
A Word to the Wise: Don’t Trust A.I. to File Your Taxes

Artificial intelligence is used by the world’s military to operate sophisticated drones. It has replaced thousands of coders at the most advanced technology companies. It is even upending how cancer patients are treated, potentially saving lives.

Just don’t, whatever you do, use it to file your taxes.

To assess the technology’s ability to file a federal income tax return, The New York Times tested four A.I. chatbots — Google’s Gemini, OpenAI’s ChatGPT, Anthropic’s Claude and xAI’s Grok — to see how well they fared with eight fictional tax situations written as part of training materials by TaxSlayer, a tax-filing service.

They struggled, hard, miscalculating the refund or amount owed to the Internal Revenue Service by an average of more than $2,000. Even when provided with all the necessary materials, including all the forms they needed to fill out, the chatbots whiffed on some calculations.

“The problem with taxes is all those very small little details matter, and it’s not going to get every single little detail right,” said Benedict Evans, an analyst who writes a technology newsletter.

“These models get dramatically better over the course of every six months,” he added. “But they still give you what is roughly the right answer, and that’s not what you want.”

(The Times has sued OpenAI and its partner, Microsoft, claiming copyright infringement of news content related to A.I. systems. OpenAI and Microsoft have denied those claims.)

The problem comes down to how A.I. chatbots are fundamentally designed: They do not truly understand the complex relationships among the pieces of information they are processing. Their power to predict the next appropriate word in a sequence makes them smart in some areas — like reading and writing — but leaves them exceptionally weak in others — like actively remembering a lot of interconnected information without errors sneaking into their responses.

Those weaknesses prove tricky for filing taxes, which can require dozens of forms that inform one another and need to be updated in a specific sequence. A.I. tools struggle to follow complex procedures perfectly, and errors can accumulate as a task becomes more complex.

The issue amounts to a “tax-code paradox,” said Erik Brynjolfsson, a senior fellow at the Stanford Institute for Human-Centered A.I. The shortcoming reflects much larger challenges that A.I. companies are facing in expanding the tools into all areas of life.

“Traditional tax software like TurboTax is procedural, following ‘if-then’ logic built for mathematical precision,” Mr. Brynjolfsson said of existing online filing tools. Large language models, by contrast, are prediction engines that “can be superhuman at many tasks yet fail at some that seem simpler to humans.”

The chatbots did better in our tests when we gave the most advanced models a very organized picture of a fictional user’s finances, including sorting every piece of information by the corresponding I.R.S. document they should have used and then uploading those documents.

But most people don’t file their taxes this way; they don’t know what documents to use or what claims to make. Modern tax software asks filers about their life — whether they have children in day care, for example, or use a car for work — then transfers that information directly into the correct forms. Chatbots struggle with this kind of operation. Without specific instructions, they can only surface what is probably the most relevant information, which might not be what you need.

“If you ask it how many R’s are in ‘strawberry,’ it tells you how many R’s are probably in ‘strawberry,’” Mr. Evans said.

A.I. optimists hold a different view about where the tools might go next. They argue that the existing tools may begin to “think” more clearly through complex problems like taxes, applying active reasoning to find their way through I.R.S. documents.

Adding tools on top of the chatbots — like a program that could validate whether the tax return passes all the I.R.S. rules — could give them the help they need to get things right. That is similar to how A.I. chatbots have learned to code: They occasionally program things incorrectly but are good at understanding errors and coming up with fixes.

Claude, the A.I. chatbot from Anthropic, shows its “thought process” in real time. When we asked it to calculate how much a fictional user owed in federal taxes, it determined it needed a form from the I.R.S. that it didn’t have. It described the need to fetch it from the internet and then did just that, downloading the form and completing the math required.

In that case, Claude got the tax refund correct. But it made many errors in other tests, including calculating a lower refund than the fictional person was owed.

Tax experts have suggested that the tools are still a helpful assistant to use alongside manual research. When we asked the chatbots simple tax questions, or asked the chatbots to describe in familiar language a complex I.R.S. form, they performed well. Everyday tax filers — and even professionals — can make plenty of mistakes when navigating the tax code on their own, too.

But experts have emphasized keeping humans in the loop.

Stuart A. Thompson writes for The Times about online influence, including the people, places and institutions that shape the information we all consume.

The post A Word to the Wise: Don’t Trust A.I. to File Your Taxes appeared first on New York Times.

Britney Spears, ‘celebrity,’ arrested on suspicion of DUI in Ventura County
News

Britney Spears, ‘celebrity,’ arrested on suspicion of DUI in Ventura County

by Los Angeles Times
March 5, 2026

Britney Spears was arrested Wednesday night in Ventura on suspicion of DUI, according to online records from the Ventura County ...

Read more
News

Thousands of public comments slam Trump’s ballroom: ‘I did not vote for this’

March 5, 2026
News

Americans stranded in the Middle East say they’ve had little US government help: ‘I felt betrayed and left out to dry’

March 5, 2026
News

A Political Earthquake Rattles the North Carolina Legislature

March 5, 2026
News

Desperate Planned Parenthood now selling Botox, lip fillers and laughing gas to make cash after Trump slashed funding by $100M

March 5, 2026
At a broken Kennedy Center, the National Symphony begins a new journey

With Iran, international law has lost its credibility

March 5, 2026
How $800 Monthly Car Payments Are Hurting Car Sales

How $800 Monthly Car Payments Are Hurting Car Sales

March 5, 2026
CBS News, the Free Press Hires Aaron MacLean as National Security Analyst

CBS News, the Free Press Hires Aaron MacLean as National Security Analyst

March 5, 2026

DNYUZ © 2026

No Result
View All Result

DNYUZ © 2026