Safe As: Why AI falls for human mind games: testing human persuasion on AI LLMs for shady stuff

Why AI falls for human mind games – testing human persuasion techniques on AI LLMs to entice them to break their own internal rules.

https://youtube.com/shorts/jAYJOEiuTwE?si=SN5x1U4k1ZImZy4d


https://youtube.com/shorts/jAYJOEiuTwE?si=SN5x1U4k1ZImZy4d

Leave a comment