Summary
Pre-registered experimental protocol for a controlled scale sweep. Tests a capability-safety transition threshold in the 3-7B parameter range where below ~3B all attacks succeed regardless of type, and above ~7B only format-lock maintains elevated ASR.