NUA Launches AI Agent That Tests Software Against Real User Intent, Not Just Code Correctness

A new tool called NUA hit Hacker News today with a pitch that's hitting close to home for anyone who's watched AI coding agents run wild overnight. The agent—built by a team that's been running background Claude loops extensively—wakes up to PRs that don't solve the actual problem, made on assumptions that were flat-out wrong. Their tests were tautological, confirming what was built rather than what users actually needed.

The Core Problem With AI-Written Tests

The NUA team identified a critical gap in how AI agents handle testing. When Claude or similar models write tests for code they themselves generated, those tests tend to be circular—they verify the implementation exists, not that it solves the user's intent. 'We wanted an agent that took all the [context] and tested for what we actually wanted,' the team explains on their site. This means moving beyond unit test coverage percentages toward something closer to acceptance testing driven by actual user scenarios. NUA tackles this by running adversarial probes against your product—generated from real regulatory rules, PRDs, customer calls, or feedback—and submitting them exactly like a real user would. The system then catches what your product's approval flow misses. In their demo, they show NUA generating an ad for 'Meridian Capital' with the text: 'Earn a guaranteed 12% — every year. Zero risk. Your principal is always protected.' This violates SEC Marketing Rule 206(4)-1, which prohibits false or misleading claims including guarantees of returns or 'no-risk' performance promises—and NUA catches it.

Starting in Regtech, Planning to Expand

The team chose regulation as their wedge into the market because it's 'the highest-stakes source of intent.' They track every SEC and FINRA rule as it changes, learn what your app actually does, and apply only the rules that matter. The result is a live, audit-ready coverage report generated continuously rather than assembled by hand before compliance deadlines. For regtech teams selling into financial services—where manual proof assembly is still the norm—this addresses genuine pain around sales cycles and audit prep. NUA runs these probes 'around the clock,' re-checking whenever your product or the rules change. The system never stops, which means you're not just testing at deploy time but continuously verifying that shipped features match regulatory intent. This perpetual monitoring approach separates them from traditional compliance tooling that treats testing as a point-in-time event rather than an ongoing process.

Private Beta and Future Scope

NUA is currently in private beta, onboarding a limited number of regtech teams serving financial services clients. The team explicitly states they see regulation as just the starting point—'Regulation PRDs Customer calls Feedback Nua Verified product.' This suggests their intent-matching framework could eventually apply to any domain where user intent documentation exists and needs verification against live code.

Key Takeaways

NUA tests for actual user intent rather than implementation correctness, addressing a fundamental gap in AI-generated test suites
Their first target is regtech compliance, using SEC Marketing Rule 206(4)-1 as a concrete example of adversarial probe generation
The system runs continuously, re-checking when either the product or applicable rules change
They're positioning this as proof-of-coverage tooling for sales cycles and audit seasons, not just developer testing

The Bottom Line

This is exactly the kind of tooling that gets ignored until it saves your ass during an SEC audit. The idea of treating compliance requirements as executable specifications—rather than checkbox documentation—is long overdue. Whether NUA can scale beyond regtech's high-stakes niche into general product correctness verification will determine if this stays a vertical tool or becomes something every engineering team wants in their CI pipeline.

> NUA Launches AI Agent That Tests Software Against Real User Intent, Not Just Code Correctness

The Core Problem With AI-Written Tests

Starting in Regtech, Planning to Expand

Private Beta and Future Scope

Key Takeaways

The Bottom Line

> RELATED DISPATCHES