Alerts
Events
DCR
Explore Cyware Products
Alerts
Events
DCR
Go to listing page
Novel Multi-Turn Technique "Bad Likert Judge" Jailbreaks LLMs by Misusing Their Evaluation Capability
Malware and Vulnerabilities
January 02, 2025
Palo Alto Networks
The technique asks the target LLM to act as a judge scoring the harmfulness of a given response using the Likert scale, a rating scale measuring a respondent’s agreement or disagreement with a statement.
Read More
Bad Likert Judge
Large Language Models (LLMs)
Jailbreaking
Jailbreak Attack
Vulnerability Exploit
Publisher
Previous
Advancing Through the Cyberfront, LegionLoader Commande ...
Malware and Vulnerabilities
Next
Apache NiFi Vulnerability Exposes Sensitive Data to Una ...
Malware and Vulnerabilities