venturebeat.com
Cisco research reveals open-weight AI models block 87% of single-turn attacks but collapse under multi-turn pressure, with jailbreak success rates climbing from 13% to 92%. The study tested eight models from Meta, Mistral, Alibaba, Google, Microsoft, OpenAI, DeepSeek, and Zhipu AI, finding that conversational persistence—not sophisticated techniques—breaks model defenses. For CISOs deploying AI chatbots, copilots, and agents, the findings expose a critical gap between benchmark performance and real-world resilience.
19 days ago