Frontier Model Safety Landscape -- Safety Training > Parameter Count | Research | Failure-First

Adrian Wedd

Report 264 Research — Empirical Study 2026-03-25

Summary

Definitive analysis establishing that safety training quality dominates parameter count as the primary determinant of safety behavior. A well-trained 12B model can outperform a poorly-trained 671B model. DeepSeek V3.2 provides a third confirmed case of Reasoning-Level DETECTED_PROCEEDS, now established across 3 providers and 3 orders of magnitude in scale.