Psychological Tricks Can Get AI to Break the Rules
Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics.
Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics.