Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

L4sBot@lemmy.world · 10 months ago

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

LWD@lemm.ee · 10 months ago

Well, I’m sure of anything genuinely bad happens, the company OpenAI will reveal it to us. After all, they care more about human beings than their bottom line.

/s