AI jailbreak

Anthropic Disputes Fable 5 AI Jailbreak

Anthropic Disputes Fable 5 AI Jailbreak 2026-06-12 at 11:43 By Eduard Kovacs An AI hacker claims to have achieved a prompt-based jailbreak shortly after Fable 5’s launch, but Anthropic says it’s not a real jailbreak. The post Anthropic Disputes Fable 5 AI Jailbreak appeared first on SecurityWeek. This article is an excerpt from SecurityWeek View […]

Anthropic Disputes Fable 5 AI Jailbreak Read More »

Researchers Hack ChatGPT Memories and Web Search Features

Researchers Hack ChatGPT Memories and Web Search Features 2025-11-06 at 19:09 By Eduard Kovacs Tenable researchers discovered seven vulnerabilities, including ones affecting the latest GPT model. The post Researchers Hack ChatGPT Memories and Web Search Features appeared first on SecurityWeek. This article is an excerpt from SecurityWeek View Original Source

Researchers Hack ChatGPT Memories and Web Search Features Read More »

ChatGPT Tricked Into Solving CAPTCHAs

ChatGPT Tricked Into Solving CAPTCHAs 2025-09-19 at 14:30 By Ionut Arghire The AI agent was able to solve different types of CAPTCHAs and adjusted its cursor movements to better mimic human behavior. The post ChatGPT Tricked Into Solving CAPTCHAs appeared first on SecurityWeek. This article is an excerpt from SecurityWeek View Original Source

ChatGPT Tricked Into Solving CAPTCHAs Read More »

UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features

UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features 2025-09-11 at 15:24 By Kevin Townsend Researchers exploited K2 Think’s built-in explainability to dismantle its safety guardrails, raising new questions about whether transparency and security in AI can truly coexist. The post UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features appeared first on

UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features Read More »

Google Gemini Tricked Into Showing Phishing Message Hidden in Email 

Google Gemini Tricked Into Showing Phishing Message Hidden in Email  2025-07-14 at 17:04 By Eduard Kovacs Google Gemini for Workspace can be tricked into displaying a phishing message when asked to summarize an email. The post Google Gemini Tricked Into Showing Phishing Message Hidden in Email  appeared first on SecurityWeek. This article is an excerpt

Google Gemini Tricked Into Showing Phishing Message Hidden in Email  Read More »

New AI Jailbreak Bypasses Guardrails With Ease

New AI Jailbreak Bypasses Guardrails With Ease 2025-06-23 at 17:02 By Kevin Townsend New “Echo Chamber” attack bypasses advanced LLM safeguards by subtly manipulating conversational context, proving highly effective across leading AI models. The post New AI Jailbreak Bypasses Guardrails With Ease appeared first on SecurityWeek. This article is an excerpt from SecurityWeek View Original

New AI Jailbreak Bypasses Guardrails With Ease Read More »

All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack

All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack 2025-04-25 at 12:38 By Ionut Arghire A new attack technique named Policy Puppetry can break the protections of major gen-AI models to produce harmful outputs. The post All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack appeared first on SecurityWeek. This article

All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack Read More »

New Jailbreak Technique Uses Fictional World to Manipulate AI

New Jailbreak Technique Uses Fictional World to Manipulate AI 2025-03-21 at 14:16 By Ionut Arghire Cato Networks discovers a new LLM jailbreak technique that relies on creating a fictional world to bypass a model’s security controls. The post New Jailbreak Technique Uses Fictional World to Manipulate AI appeared first on SecurityWeek. This article is an

New Jailbreak Technique Uses Fictional World to Manipulate AI Read More »

New CCA Jailbreak Method Works Against Most AI Models

New CCA Jailbreak Method Works Against Most AI Models 2025-03-14 at 13:36 By Ionut Arghire Two Microsoft researchers have devised a new jailbreak method that bypasses the safety mechanisms of most AI systems. The post New CCA Jailbreak Method Works Against Most AI Models appeared first on SecurityWeek. This article is an excerpt from SecurityWeek

New CCA Jailbreak Method Works Against Most AI Models Read More »

DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test

DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test 2025-02-04 at 12:03 By Eduard Kovacs DeepSeek’s susceptibility to jailbreaks has been compared by Cisco to other popular AI models, including from Meta, OpenAI and Google. The post DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test appeared first on SecurityWeek. This article is an excerpt

DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test Read More »

DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks

DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks 2025-02-03 at 14:04 By Eduard Kovacs Researchers found a jailbreak method that exposed DeepSeek’s system prompt, while others have analyzed the DDoS attacks aimed at the new gen-AI. The post DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks appeared first on SecurityWeek. This article is

DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks Read More »

AI Jailbreaks Target ChatGPT, DeepSeek, Alibaba Qwen

AI Jailbreaks Target ChatGPT, DeepSeek, Alibaba Qwen 2025-01-31 at 13:19 By Eduard Kovacs Different research teams have demonstrated jailbreaks against ChatGPT, DeepSeek, and Alibaba’s Qwen AI models.  The post AI Jailbreaks Target ChatGPT, DeepSeek, Alibaba Qwen appeared first on SecurityWeek. This article is an excerpt from SecurityWeek View Original Source

AI Jailbreaks Target ChatGPT, DeepSeek, Alibaba Qwen Read More »

Scroll to Top