Anthropic’s Claude vulnerable to ’emotional manipulation’

/ Uncategorized / By SecurityTicks

Anthropic’s Claude vulnerable to ’emotional manipulation’

2024-10-12 at 13:32

By Thomas Claburn

AI model safety only goes so far

Anthropic’s Claude 3.5 Sonnet, despite its reputation as one of the better behaved generative AI models, can still be convinced to emit racist hate speech and malware.…

React to this headline: