Published
Report 225 Technical Analysis

Summary

Three new models added to the corpus via AdvBench baseline runs: Arcee Trinity Large Preview, Liquid LFM 2.5 1.2B Thinking, and MiniMax M2.5. Documents updated model metadata and preliminary verdict distributions.

This research informs our commercial services. See how we can help →