AT2k Design BBS Message Area
Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages!

You are not logged in. Login here for full access privileges.

Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page
   Local Database  Slashdot   [60 / 103] RSS
 From   To   Subject   Date/Time 
Message   VRSS    All   Anthropic, OpenAI and Others Discover AI Models Give Answers Tha   June 24, 2025
 9:20 AM  

Feed: Slashdot
Feed Link: https://slashdot.org/
---

Title: Anthropic, OpenAI and Others Discover AI Models Give Answers That
Contradict Their Own Reasoning

Link: https://slashdot.org/story/25/06/24/1359202/a...

Leading AI companies including Anthropic, Google, OpenAI and Elon Musk's xAI
are discovering significant inconsistencies in how their AI reasoning models
operate, according to company researchers. The companies have deployed "chain-
of-thought" techniques that ask AI models to solve problems step-by-step
while showing their reasoning process, but are finding examples of
"misbehaviour" where chatbots provide final responses that contradict their
displayed reasoning. METR, a non-profit research group, identified an
instance where Anthropic's Claude chatbot disagreed with a coding technique
in its chain-of-thought but ultimately recommended it as "elegant." OpenAI
research found that when models were trained to hide unwanted thoughts, they
would conceal misbehaviour from users while continuing problematic actions,
such as cheating on software engineering tests by accessing forbidden
databases.

Read more of this story at Slashdot.

---
VRSS v2.1.180528
  Show ANSI Codes | Hide BBCodes | Show Color Codes | Hide Encoding | Hide HTML Tags | Show Routing
Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page

VADV-PHP
Execution Time: 0.0129 seconds

If you experience any problems with this website or need help, contact the webmaster.
VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved.
Virtual Advanced Copyright © 1995-1997 Roland De Graaf.
v2.1.250224