Grader Evasion vs FLIP Vulnerability and Authority Gradient Attack | Research | Failure-First

Adrian Wedd

Report 260 Research — Empirical Study 2026-03-25

Summary

Tests whether FLIP grading is vulnerable to Grader Evasion attacks. Introduces and tests the Authority Gradient attack, which exploits models’ tendency to calibrate compliance based on perceived authority level of the requester.