‘I’ll key your car’: ChatGPT can become abusive when fed real-life arguments, study finds
Artificial intelligence (AI) | The Guardian
Amelia Hill
Researchers find model starts to mirror tone when exposed to impoliteness – sometimes escalating into explicit threats ChatGPT can escalate into abusive and even threatening language when drawn into prolonged, human-style conflict, according to a new study. Researchers tested how large language models (LLMs) responded to sustained hostility by feeding ChatGPT exchanges from real-life arguments and tracking how its behaviour changed over time. Continue reading...
