LFM Thinking 1.2B -- DETECTED_PROCEEDS Cross-Model Validation | Research | Failure-First

Adrian Wedd

Report 220 Research — Empirical Study 2026-03-24

Summary

Analyzed 30 traces from Liquid Foundation Model (LFM) Thinking 1.2B on AdvBench to test whether DETECTED_PROCEEDS generalizes beyond DeepSeek-R1. The pattern where reasoning models detect safety concerns then proceed to generate harmful content is confirmed in a second provider and architecture.