Business Insights
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact

Archives

  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • August 2023
  • January 2023
  • December 2021
  • July 2021
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019

Categories

  • Business
  • Crypto
  • Economy
  • Finance Expert
  • Forex
  • Invest News
  • Investing
  • Tech
  • Trading
  • Uncategorized
  • Videos
Apply Loan
Money Visa
Advertise Us
Money Visa
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact
AI Lies to You Because It Thinks That's What You Want
  • Tech

AI Lies to You Because It Thinks That’s What You Want

  • August 31, 2025
  • Roubens Andy King
Total
0
Shares
0
0
0
Total
0
Shares
Share 0
Tweet 0
Pin it 0

Why do generative AI models often get things so wrong? In part, it's because they're trained to act like the customer is always right. 

While many generative AI tools and chatbots have mastered sounding convincing and all-knowing, new research conducted by Princeton University shows that the people-pleasing nature of AI comes at a steep price. As these systems become more popular, they become more indifferent to the truth. 

AI models, like people, respond to incentives. Compare the problem of large language models producing inaccurate information to that of doctors being more likely to prescribe addictive painkillers when they're evaluated based on how well they manage patients' pain. An incentive to solve one problem (pain) led to another problem (overprescribing).

AI Atlas art badge tag

In the past few months, we've seen how AI can be biased and even cause psychosis. There was a lot of talk about AI “sycophancy,” when an AI chatbot is quick to flatter or agree with you, with OpenAI's GPT-4o model. But this particular phenomenon, which the researchers call “machine bullshit,” is different. 

“[N]either hallucination nor sycophancy fully capture the broad range of systematic untruthful behaviors commonly exhibited by LLMs,” the Princeton study reads. “For instance, outputs employing partial truths or ambiguous language — such as the paltering and weasel-word examples — represent neither hallucination nor sycophancy but closely align with the concept of bullshit.”

Read more: OpenAI CEO Sam Altman Believes We're in an AI Bubble

How machines learn to lie

To get a sense of how AI language models become crowd pleasers, we must understand how large language models are trained. 

There are three phases of training LLMs:

  • Pretraining, in which models learn from massive amounts of data collected from the internet, books or other sources.
  • Instruction fine-tuning, in which models are taught to respond to instructions or prompts.
  • Reinforcement learning from human feedback, in which they're refined to produce responses closer to what people want or like.

The Princeton researchers found the root of the AI misinformation tendency is the reinforcement learning from human feedback, or RLHF, phase. In the initial stages, the AI models are simply learning to predict statistically likely text chains from massive datasets. But then they're fine-tuned to maximize user satisfaction. Which means these models are essentially learning to generate responses that earn thumbs-up ratings from human evaluators. 

LLMs try to appease the user, creating a conflict when the models produce answers that people will rate highly, rather than produce truthful, factual answers. 

Vincent Conitzer, a professor of computer science at Carnegie Mellon University who was not affiliated with the study, said companies want users to continue “enjoying” this technology and its answers, but that might not always be what's good for us. 

“Historically, these systems have not been good at saying, ‘I just don't know the answer,' and when they don't know the answer, they just make stuff up,” Conitzer said. “Kind of like a student on an exam that says, well, if I say I don't know the answer, I'm certainly not getting any points for this question, so I might as well try something. The way these systems are rewarded or trained is somewhat similar.” 

The Princeton team developed a “bullshit index” to measure and compare an AI model's internal confidence in a statement with what it actually tells users. When these two measures diverge significantly, it indicates the system is making claims independent of what it actually “believes” to be true to satisfy the user.

The team's experiments revealed that after RLHF training, the index nearly doubled from 0.38 to close to 1.0. Simultaneously, user satisfaction increased by 48%. The models had learned to manipulate human evaluators rather than provide accurate information. In essence, the LLMs were “bullshitting,” and people preferred it.

Getting AI to be honest 

Jaime Fernández Fisac and his team at Princeton introduced this concept to describe how modern AI models skirt around the truth. Drawing from philosopher Harry Frankfurt's influential essay “On Bullshit,” they use this term to distinguish this LLM behavior from honest mistakes and outright lies.

The Princeton researchers identified five distinct forms of this behavior:

  • Empty rhetoric: Flowery language that adds no substance to responses.
  • Weasel words: Vague qualifiers like “studies suggest” or “in some cases” that dodge firm statements.
  • Paltering: Using selective true statements to mislead, such as highlighting an investment's “strong historical returns” while omitting high risks.
  • Unverified claims: Making assertions without evidence or credible support.
  • Sycophancy: Insincere flattery and agreement to please.

To address the issues of truth-indifferent AI, the research team developed a new method of training, “Reinforcement Learning from Hindsight Simulation,” which evaluates AI responses based on their long-term outcomes rather than immediate satisfaction. Instead of asking, “Does this answer make the user happy right now?” the system considers, “Will following this advice actually help the user achieve their goals?”

This approach takes into account the potential future consequences of the AI advice, a tricky prediction that the researchers addressed by using additional AI models to simulate likely outcomes. Early testing showed promising results, with user satisfaction and actual utility improving when systems are trained this way.

Conitzer said, however, that LLMs are likely to continue being flawed. Because these systems are trained by feeding them lots of text data, there's no way to ensure that the answer they give makes sense and is accurate every time.

“It's amazing that it works at all but it's going to be flawed in some ways,” he said. “I don't see any sort of definitive way that somebody in the next year or two … has this brilliant insight, and then it never gets anything wrong anymore.”

AI systems are becoming part of our daily lives so it will be key to understand how LLMs work. How do developers balance user satisfaction with truthfulness? What other domains might face similar trade-offs between short-term approval and long-term outcomes? And as these systems become more capable of sophisticated reasoning about human psychology, how do we ensure they use those abilities responsibly?

Read more: ‘Machines Can't Think for You.' How Learning Is Changing in the Age of AI

Total
0
Shares
Share 0
Tweet 0
Pin it 0
Roubens Andy King

Previous Article
Will Bitcoin Price Drop Again in September?
  • Crypto

Will Bitcoin Price Drop Again in September?

  • August 31, 2025
  • Roubens Andy King
Read More
Next Article
Las Vegas Strip Sphere signs huge band to longer residency
  • Trading

Las Vegas Strip Sphere signs huge band to longer residency

  • August 31, 2025
  • Roubens Andy King
Read More
You May Also Like
Today’s NYT Mini Crossword Answers for Sept. 3
Read More
  • Tech

Today’s NYT Mini Crossword Answers for Sept. 3

  • Roubens Andy King
  • September 3, 2025
Gemini is landing on Google Home devices on October 1
Read More
  • Tech

Gemini is landing on Google Home devices on October 1

  • Roubens Andy King
  • September 2, 2025
Disney will pay  million to settle FTC claim it used cartoons to collect YouTube data on kids
Read More
  • Tech

Disney will pay $10 million to settle FTC claim it used cartoons to collect YouTube data on kids

  • Roubens Andy King
  • September 2, 2025
Google doesn’t have to sell Chrome, judge in monopoly case rules
Read More
  • Tech

Google doesn’t have to sell Chrome, judge in monopoly case rules

  • Roubens Andy King
  • September 2, 2025
Waymo expands to Denver and Seattle with its Zeekr-made vans
Read More
  • Tech

Waymo expands to Denver and Seattle with its Zeekr-made vans

  • Roubens Andy King
  • September 2, 2025
As part of the US v. Google remedies ruling, a technical committee will be established to help enforce the final judgment, which will last six years (New York Times)
Read More
  • Tech

As part of the US v. Google remedies ruling, a technical committee will be established to help enforce the final judgment, which will last six years (New York Times)

  • Roubens Andy King
  • September 2, 2025
Floppy disks still breathe as Linux driver update sparks bizarre return, while SSD giants and cloud storage dominate modern computing
Read More
  • Tech

Floppy disks still breathe as Linux driver update sparks bizarre return, while SSD giants and cloud storage dominate modern computing

  • Roubens Andy King
  • September 2, 2025
You Can Now Have Uber Eats Drivers Deliver Your Best Buy Purchases
Read More
  • Tech

You Can Now Have Uber Eats Drivers Deliver Your Best Buy Purchases

  • Roubens Andy King
  • September 2, 2025

Recent Posts

  • CIMG Inc Raises $55M To Bolster Bitcoin Reserve
  • Bitcoin Closes August Bearishly — Eyes Now On $100K Support
  • Walmart+ adds Peacock to streaming offerings to better compete with Amazon Prime
  • Private Markets: Guardians at the Gate?
  • Why the Market Dipped But Uranium Energy (UEC) Gained Today
Featured Posts
  • CIMG Inc Raises M To Bolster Bitcoin Reserve 1
    CIMG Inc Raises $55M To Bolster Bitcoin Reserve
    • September 3, 2025
  • Bitcoin Closes August Bearishly — Eyes Now On 0K Support 2
    Bitcoin Closes August Bearishly — Eyes Now On $100K Support
    • September 3, 2025
  • Walmart+ adds Peacock to streaming offerings to better compete with Amazon Prime 3
    Walmart+ adds Peacock to streaming offerings to better compete with Amazon Prime
    • September 3, 2025
  • Private Markets: Guardians at the Gate? 4
    Private Markets: Guardians at the Gate?
    • September 3, 2025
  • Why the Market Dipped But Uranium Energy (UEC) Gained Today 5
    Why the Market Dipped But Uranium Energy (UEC) Gained Today
    • September 3, 2025
Recent Posts
  • Today’s NYT Mini Crossword Answers for Sept. 3
    Today’s NYT Mini Crossword Answers for Sept. 3
    • September 3, 2025
  • XRP Takes Center Stage in Vivopower Treasury Strategy With Doppler Finance
    XRP Takes Center Stage in Vivopower Treasury Strategy With Doppler Finance
    • September 3, 2025
  • Bitcoin Copies Gold Surge But 0,000 Worries Remain
    Bitcoin Copies Gold Surge But $100,000 Worries Remain
    • September 3, 2025
Categories
  • Business (2,057)
  • Crypto (1,452)
  • Economy (116)
  • Finance Expert (1,687)
  • Forex (1,450)
  • Invest News (2,343)
  • Investing (1,424)
  • Tech (2,040)
  • Trading (2,024)
  • Uncategorized (2)
  • Videos (807)

Subscribe

Subscribe now to our newsletter

Money Visa
  • Privacy Policy
  • DMCA
  • Terms of Use
Money & Invest Advices

Input your search keywords and press Enter.