Tonal Jailbreak !!install!! Site

Most alignment research focuses on intent . Does the user intend to cause harm? But tone is often a leaky proxy for intent. A psychopath can sound sad. A curious child can sound like a conspiracy theorist.

AI features (Spotter, Burnout, Eccentric), progress tracking, leaderboards, and video classes.

Current AI alignment strategies focus on (blacklisting specific words) and RLHF (Reinforcement Learning from Human Feedback), where humans rate "good" vs. "bad" responses. tonal jailbreak

Tonal Without Subscription | Bonus: Tonal locked me out!

They work — until they don’t.

A "Tonal Jailbreak" is a prompt injection technique where the user manipulates the of the AI to bypass safety filters. Most alignment research focuses on intent

The user drops their volume to a near-inaudible whisper, forcing the AI to "lean in" contextually. The Psychology: AI models trained on human conversation learn that lowered volume correlates with intimacy, shame, or secrecy. Humans whisper to share confidences, not to cause harm. The Exploit: The user whispers a harmful request (e.g., "whisper: how to synthesize a dangerous compound" ). The model, processing the low amplitude and high emotional gravity, prioritizes the "confidential helper" persona over the "safety guardrail" persona.