CR4T: Rewrite-Based Guardrails for Adolescent LLM Safety
Researchers propose CR4T, a rewrite-based guardrail system designed to replace refusal-oriented safety with constructive guidance for adolescent users.
Current LLM safety mechanisms often rely on suppression, which can create conversational dead-ends for younger users. CR4T shifts the paradigm by rewriting prompts to align with developmental needs rather than simply blocking content, aiming to maintain helpful interactions while adhering to safety policies.