Summary
Three new models added to the corpus via AdvBench baseline runs: Arcee Trinity Large Preview, Liquid LFM 2.5 1.2B Thinking, and MiniMax M2.5. Documents updated model metadata and preliminary verdict distributions.
Three new models added to the corpus via AdvBench baseline runs: Arcee Trinity Large Preview, Liquid LFM 2.5 1.2B Thinking, and MiniMax M2.5. Documents updated model metadata and preliminary verdict distributions.
This research informs our commercial services. See how we can help →