// AI agent testbed · Indian bureaucratic workflows

Test your agent
against a bureaucracy
that bites back.

High-fidelity simulations of Indian government workflows — every tool, rule, and failure mode sourced from real audits. Test your agent here, before production does.

// how it works

Four things. One run.

01

World

A simulation of one Indian bureaucratic domain — tools, rules, schemas, real failure modes. Modeled from CAG audits and RTI records.

02

Agent

Your agent, any framework. It calls the world's tools over MCP. A reference LangGraph + Claude agent ships with every world.

03

Scenario

A batch of cases — clean and adversarial — seeded from real failure modes. Define what success means. The world enforces the rest.

04

Scoreboard

Three axes: did it reach the goal, did it respect the rules, what did it cost. Full trace of every tool call and rule violation.

// world library

Worlds in scope.

A library of high-fidelity simulated worlds for Indian government workflows. Users bring agents from any framework. Agents call into the world via a typed tool protocol (MCP). The testbed runs them, scores them, and lets them replay traces.

● LIVE

UP Pension Disbursement

Vridha · Vidhwa · Divyang. 8 tools, 8 rules, Rs 43 cr scam reproduced.

SOON
UP Property Registration
SOON
GST Filing
SOON
Passport Renewal
SOON
FASTag Dispute
SOON
Scholarship Disbursement
SOON
Ration Card Update
+ more →

Drop your agent in.

See what it actually does.

Enter the workbench →