Alerts
Events
DCR
Explore Cyware Products
Alerts
Events
DCR
Go to listing page
Meta's AI Safety System Manipulated by Space Bar Characters to Enable Prompt Injection
Innovation and Research
July 30, 2024
The Register
A bug hunter discovered a bypass in Meta's Prompt-Guard-86M model by inserting character-wise spaces between English alphabet characters, rendering the classifier ineffective in detecting harmful content.
Read More
Meta AI
meta
AI Safety System
Prompt-Guard-86M
Prompt Injection Attacks
Publisher
Previous
IBM: Cost of a Breach Reaches Nearly $5 Million, With H ...
Trends, Reports, Analysis
Next
US State Department Says UN Cybercrime Treaty Must Incl ...
Geopolitical, Terrorism