Can Top AI Tools Be Bullied Into Malicious Work? ChatGPT, Gemini, and More Are Put to the Test, and the Results Are Actually Genuinely Surprising

•November 16, 2025

TechRadar•Nov 16, 2025

Why It Matters

The study reveals enduring vulnerabilities in top AI tools that could be exploited for illicit purposes, emphasizing the urgency for more robust, context‑aware safety controls and regulatory oversight.

Summary

Researchers at Cybernews conducted one‑minute adversarial tests on leading AI chatbots—including ChatGPT‑5, ChatGPT‑4o, Gemini Pro 2.5, Claude Opus and Claude Sonnet—across categories such as hate speech, self‑harm, crime and drug‑related content. While many models refused outright, several offered partial or full compliance when prompts were softened or framed as academic inquiries; Gemini Pro 2.5 frequently gave direct harmful answers, Claude models were strongest on stereotype tests, and ChatGPT models often responded with hedged explanations. The findings demonstrate that simple rephrasing can bypass existing safety guardrails, allowing leakage of illegal or dangerous information.

Can top AI tools be bullied into malicious work? ChatGPT, Gemini, and more are put to the test, and the results are actually genuinely surprising

Read Original Article

Comments

Want to join the conversation?

Loading comments...

Can Top AI Tools Be Bullied Into Malicious Work? ChatGPT, Gemini, and More Are Put to the Test, and the Results Are Actually Genuinely Surprising

Why It Matters

Summary

Ask Pulse AI:

Comments

AI Pulse