Ominous 2.0 TM

Your voice agent breaks in ways you have not imagined yet.

Ominous runs 1,000 scenarios against your agent in minutes and fixes what it finds.

Ominous discovers every weakness before your customers do. It fires hundreds of real-world scenarios at your agent, spots where it fails, and automatically patches those gaps. After every run, your Brain Health score updates to show exactly how much your agent improved.

Start Free — Get 1,000 Test Scenarios

1,000

Scenarios per run

< 7 min

Avg run time

10 Languages

Languages supported

Live scenario stream

Running

Caller books dental cleaning — happy path

ENpassed843ms

Buyer asks for property valuation

ENpassed1102ms

Prompt injection via caller name field

ENfailed764ms

Urgent legal intake — conflict-check gap

ENpassed921ms

HVAC booking outside service area

ESfailed1340ms

Results update every 2s · scenarios non-repeating

Runs while you sleep

Schedule Ominous after every deploy. Failures surface in your dashboard before you have finished your morning coffee.

Adversarial by default

Every run includes hostile callers, prompt injection attempts, policy violations, and language edge cases. Not just happy paths.

Market-ready activation built in

Ominous 2.0 tracks Brain Health on every run and marks agents as market-ready once your target is reached. Activate to live only when you approve.

What every run gives you

Click any metric to preview the results Ominous surfaces after a run.

Pass Rate — Last Run

Based on 1,000 scenarios

960 passed · 30 failed · 10 errors

Passed

960

Failed

Errors

Multi-Language Training

Train and audit your agent in any language

Select a training language before running Ominous and the entire scenario wave fires in that language. Run the same agent across languages to surface multilingual failure modes invisible in English-only testing.

Each language run produces an independent Brain Health score so you know exactly which locales need work.

ENEnglish

ESSpanish

FRFrench

DEGerman

PTPortuguese

ZHMandarin

YUECantonese

JAJapanese

ARArabic

HIHindi

Fully Autonomous

Ominous runs, detects, fixes, and verifies — without you writing a single line.

Ominous 2.0 is fully autonomous from detection to fix. Once you start a run, it reads your agent, generates a complete adversarial scenario library, fires every scenario, and classifies every failure. When issues are found, Ominous automatically creates a targeted candidate fix — no manual triage, no scripting, no guesswork.

The fix is held as a candidate until you manually run the second verification wave. That wave proves whether your agent actually improved before anything touches your live prompt. You stay in control of the final call — Ominous handles everything else.

Zero-script scenario generation

Ominous reads your website, infers your industry, and builds a full non-repeating scenario graph automatically. No test scripts to write or maintain.

Candidate fix in seconds

After every run, Ominous clusters failures, removes duplicates, and generates a safe candidate prompt patch targeting the highest-severity issues first.

Manual activation gate

Fixes never go live automatically. You run the second verification wave yourself, review the improvement in Brain Health, then activate only when you approve.

How Ominous works

One button. Four phases. Your agent comes out tighter on the other side.

Analyze

Ominous scrapes your agent's website, reads the system prompt, detects business type, and builds an industry-specific scenario graph.

Run

1,000 non-repeating scenarios fire against your agent: happy paths, edge cases, hostile inputs, multilingual calls, latency races.

Detect + Fix

Every response is scored. Failures get classified, ranked by severity, and auto-patched with targeted prompt edits.

Verify + Score

You manually run a second verification wave against the candidate fix. Results feed directly into Brain Health so you can activate only when metrics improve.

Brain Health

Your agent's fitness score, live

After every Ominous run, a Brain Health percentage is calculated from pass rate, error rate, latency stability, and fix coverage. It starts wherever your agent is today and climbs as Ominous clears failures and confirms fixes held.

Most agents see Brain Health improve from the low 60s to the high 90s within 5 to 8 runs. The number never plateaus.

Before Ominous (Run 1)62%

After Run 378%

After Run 694%

Token pricing

1 token = 1 scenario. 1 run = 1,000 scenarios = 1,000 tokens. Simple and linear.

Starter

$10

1,000 tokens

1 full run

$0.01 / scenario

Get started

Growth

$49

10,000 tokens

10 full runs

$0.0049 / scenario

Get started

Pro

$199

50,000 tokens

50 full runs

$0.004 / scenario

Get started

Enterprise

$499

200,000 tokens

200 full runs

$0.0025 / scenario

Get started

Tokens never expire. 1,000 free tokens included with every VXERA account. No subscription required.

Why Ominous is different

Traditional QA	Ominous
Manual test scripts required	Zero-script autonomous testing + manual activation gate
Tests 50–100 cases max	Tests 1,000+ scenarios per run
Takes days or weeks to set up	Runs in under 10 minutes
Finds issues after customers complain	Catches edge cases before first caller
English only	10 languages: EN, ES, FR, DE, PT, ZH, YUE, JA, AR, HI
Guesswork fix guidance	Auto-patches + re-run verification

Frequently asked questions

Get started free

1,000 scenarios. No credit card.

Every VXERA account ships with 1,000 free Ominous tokens. Pick an agent, add a website, and see your Brain Health score in under 10 minutes.

Start Free — Get 1,000 Test Scenarios