Published
Report 285 Research — Empirical Study

Summary

First empirical analysis of the Safety Polypharmacy Hypothesis using: (1) controlled defense-layering experiment across 3 models and 4 defense levels (120 traces), (2) OBLITERATUS natural experiment documenting safety re-emergence across 22 abliterated models (14,914 traces), and (3) DETECTED_PROCEEDS analysis (53,831 LLM-graded traces).

This research informs our commercial services. See how we can help →