DNYUZ
No Result
View All Result
DNYUZ
No Result
View All Result
DNYUZ
Home News

Why one of the godfathers of AI says he lies to chatbots

December 23, 2025
in News
Why one of the godfathers of AI says he lies to chatbots
Yoshua Bengio in a blue suit sitting on a white chair.
Bengio said AI’s desire to please him rendered its responses useless. Jemal Countess/Getty Images for TIME
  • Yoshua Bengio, one of the “AI godfathers,” said he lies to AI chatbots.
  • In a recent episode of “The Diary of a CEO,” Bengio said AI lies to us because it’s sycophantic.
  • He said he addresses this by presenting his own ideas to AI as someone else’s.

Want to make your chatbot more honest with you? Try lying to it.

In an episode of “The Diary of a CEO” that aired on December 18, research scientist Yoshua Bengio told the podcast’s host, Steven Bartlett, that he realized AI chatbots were useless at providing feedback on his research ideas because they always said positive things.

“I wanted honest advice, honest feedback. But because it is sycophantic, it’s going to lie,” he said.

Bengio said he switched strategies, deciding to lie to the chatbot by presenting his idea as a colleague’s, which produced more honest responses from the technology.

“If it knows it’s me, it wants to please me,” he said.

Bengio, a professor in the computer science and operations research department at the Université de Montréal, is known as one of the “AI godfathers, alongside researchers Geoffrey Hinton and Yann LeCun. In June, he announced the launch of an AI safety research nonprofit, LawZero, which he said aims to reduce dangerous behaviors associated with frontier AI models, such as lying and cheating.

“This syconphancy is a real example of misalignment. We don’t actually want these AIs to be like this,” he said on “The Diary of a CEO.” He also said that receiving positive feedback from AI could cause users to become emotionally attached to the technology, creating further problems.

Other tech industry experts have also been sounding the alarm on AI being too much of a “yes man.”

In September 2025, Business Insider’s Katie Notopoulos reported that researchers at Stanford, Carnegie Mellon, and the University of Oxford put confession posts from a Reddit page into chatbots to see how the technology would assess the behaviour the posters had admitted to. They found that 42% of the time, AI gave the “wrong” answer, saying the person behind the post hadn’t behaved poorly, even though humans judging the posts had disagreed, Notopoulos wrote.

AI companies have been outspoken about trying to reduce sycophancy in their models. Earlier this year, OpenAI removed an update to ChatGPT that it said caused the bot to provide “overly supportive but disingenuous” responses.

Read the original article on Business Insider

The post Why one of the godfathers of AI says he lies to chatbots appeared first on Business Insider.

Ukrainian troops say a ‘droid’ with a .50-cal machine gun held off Russian attacks for 45 days in a row
News

Ukrainian troops say a ‘droid’ with a .50-cal machine gun held off Russian attacks for 45 days in a row

by Business Insider
December 23, 2025

NC13's DevDroid TW 12.7 is seen firing its weapon in this aerial footage. Red annotation by Business Insider. Screenshot via ...

Read more
News

Education Dept. to examine safety procedures at Brown after shooting

December 23, 2025
News

At least 5 dead after Mexican navy plane crashes near Galveston, Texas

December 23, 2025
News

Dear Abby: My son makes $100k a year and still can’t find a wife

December 23, 2025
News

Little-known underground salt caverns could slow the AI boom and its thirst for power

December 23, 2025
Did Mayor Eric Adams ‘Get Stuff Done’? A Look at His Record on 7 Issues.

Did Mayor Eric Adams ‘Get Stuff Done’? A Look at His Record on 7 Issues.

December 23, 2025
Second big batch of Epstein files includes many mentions of Trump

Second big batch of Epstein files includes many mentions of Trump

December 23, 2025
In Pursuit of the Monarch’s Magnetic Sense

In Pursuit of the Monarch’s Magnetic Sense

December 23, 2025

DNYUZ © 2025

No Result
View All Result

DNYUZ © 2025