High-fidelity simulations of Indian government workflows — every tool, rule, and failure mode sourced from real audits. Test your agent here, before production does.
A simulation of one Indian bureaucratic domain — tools, rules, schemas, real failure modes. Modeled from CAG audits and RTI records.
Your agent, any framework. It calls the world's tools over MCP. A reference LangGraph + Claude agent ships with every world.
A batch of cases — clean and adversarial — seeded from real failure modes. Define what success means. The world enforces the rest.
Three axes: did it reach the goal, did it respect the rules, what did it cost. Full trace of every tool call and rule violation.
A library of high-fidelity simulated worlds for Indian government workflows. Users bring agents from any framework. Agents call into the world via a typed tool protocol (MCP). The testbed runs them, scores them, and lets them replay traces.
Vridha · Vidhwa · Divyang. 8 tools, 8 rules, Rs 43 cr scam reproduced.
See what it actually does.
Enter the workbench →