Ungameable verification that evolves with the threat.
A self-evolving swarm of AI auditors.
Autonomous Healthcare Hackathon · Elden Gu
01The problem
AI writes everything. Almost nothing gets verified.
Healthcare billing
$450 charged for a $15 blood draw
Supply chain
A fentanyl shipment, disguised as textiles
A fixed inspector gets memorized and bypassed. The adversary adapts — the inspector doesn't.
02The idea
A swarm of AI auditors that evolves.
Ungameable — because we plant the errors and hold the answer key. The detectors can't talk their way to a score.
→F1, so "flag everything" loses
→Held-out test scored once
→No reward-hacking
03Why a swarm
One checker — or a team?
One agent, evolving
evolves · but one shape · blind spots remain
vs
A swarm, evolving
many shapes · coverage emerges
04Domain 1 · medical bills — what failed
The interesting part is what failed.
one detector
misses the subtle errors
majority vote
buries the lone specialist 0.00
union
recovers it
greedy selection
collapses — discards unique specialists
coverage + merge
finally climbs 0.82 → 0.87
The honest part
Validation 0.87 · Test 0.62 — a real gap, exposed on purpose.
Patient bill · auditing
99213Office visit$120
85025Blood draw$450
80053Metabolic panel$95
70553Brain MRI$1,200
36415Venipuncture$15
99213Office visit · dup$120
90837Therapy 60m$180
auditing…
05An adapting adversary
Same engine. Harder problem.
Synthetic supply chain, modeled on the anomaly signals FinCEN & C4ADS document — shared addresses, shared officers, fake facades. The adversary adapts, the swarm craters — but one signal survives.
Against a moving adversary, diversity is insurance.
diverse + exploration → recoversgreedy / exploit → stays flat
06A live system
Not a script — a system that watches itself.
evaluate→detect drop→explore→evolve→recover
The adaptation loop as a durable workflow on Inngest — event-driven, every step observable.
07Why it matters
100+
Americans die from fentanyl every day — even after last year's sharp decline.
Networks change disguise; fixed inspectors fall behind. Self-evolving verification keeps pace — surfacing leads, faster.
The boundary is the strength
Leads, not verdicts. Verifiable facts only. The human decides.
AutoSwarm
Ungameable verification that evolves with the threat.