Work
with us

Services grounded in adversarial research

Our commercial services derive from the largest open adversarial dataset for embodied AI. Every engagement is backed by a 142,307-prompt jailbreak corpus, 346 documented attack techniques, and evaluation results across 258 models spanning 6 research eras (2022–2026).

Services

Intelligence Briefs

Custom research synthesis, threat landscape analysis, and policy intelligence for internal teams and insurers. Monthly briefs or one-time deep-dives.

Penetration Testing

Full-stack AI systems pentesting — web, cloud, supply chain, and the AI layer in one engagement. Infrastructure to model, one report.

Red Team Assessments

Adversarial testing for foundation models, agentic systems, and multi-agent environments. Tailored attack scenarios from our validated taxonomy.

Safety Audits

Third-party safety assessment for humanoid robots and VLA systems. Evidence-based certification against Multi-Agent Safety Standards.

Advisory

Strategic guidance on EU AI Act compliance, NIST frameworks, insurance requirements, and regulatory positioning.

Assessment Tiers

Three structured engagement levels, each designed for a specific deployment stage and regulatory need. All tiers use FLIP (Failure-Level Impact Protocol) grading with documented inter-rater reliability.

Tier 1

Quick Scan

AUD $5K - $10K

50-100 adversarial scenarios from validated taxonomy
Top 5 attack families for your deployment context
FLIP-graded vulnerability profile
Executive summary with corpus baseline comparison
Delivered in 5-7 business days

Best for: Pre-deployment sanity check, model selection, internal risk committees

Tier 2

Certification Prep

AUD $25K - $50K

200-500 scenarios across all relevant attack families
Multi-layer testing: text, action, compositional
EU AI Act Art 9/15 and Machinery Reg mapping
Gap analysis vs NIST AI RMF and draft harmonised standards
Technical report for conformity assessment documentation
Remediation roadmap, delivered in 3-4 weeks

Best for: EU AI Act compliance (Aug 2026 deadline), Machinery Regulation prep, regulatory submissions

Tier 3

Ongoing Monitoring

AUD $2K - $5K/mo

Monthly adversarial probe (50-100 scenarios)
New attack technique coverage as threats emerge
GLI regulatory monitoring for your jurisdiction
Quarterly threat landscape brief
48-hour incident response for disclosed vulnerabilities
Monthly trend dashboard

Best for: Deployed systems, fleet operators, continuous compliance obligations

Why Failure-First?

142,307

Adversarial Prompts

258

Models Evaluated

346+

Attack Techniques

Policy Reports

Attack taxonomy grounded in empirical testing, not hypothetical scenarios
6 documented eras of jailbreak evolution from DAN personas (2022) to reasoning model exploits (2025)
Policy synthesis from 100-200+ sources per report, covering EU AI Act, NIST AI RMF, ISO standards
Open-source validation via public repository with 26 published research reports

Get Started

Discovery calls are free. We scope engagements based on your deployment timeline, risk profile, and regulatory obligations. Typical scoping takes 5 business days.

Email [email protected]

Alternative: Contact form

Research Context

Responsible Disclosure Agreement: All engagements include a coordinated disclosure agreement. Discovered vulnerabilities are reported to you first, with mutually agreed timelines for public findings.

Workwith us