Open source mock environments for AI agents

Test your agents
without the chaos

Safe, reproducible, API-identical mock environments so your agents can learn, fail, and improve — without deleting real emails or booking phantom meetings.

terminal
$ pip install smolclaws
$ smolclaw seed --scenario default
  Seeded database with scenario 'default': Users: 1
$ smolclaw serve --port 8001
  Gmail API running at http://localhost:8001
  54 endpoints ready. 0 real APIs harmed. 🦞

Agents break things.
That's how they learn.

Real APIs are unforgiving. One bad function call and your agent has archived your entire inbox, double-booked your Tuesday, or replied-all with hallucinated contract terms.

smolclaw gives agents a safe place to fail. Every environment is API-identical, fully seedable, and instantly resettable — so you can run thousands of trials without consequences.

💥
100%
of agents eventually do something destructive in production
🚀
3,000+
seed scenarios per environment for stress testing
0 sec
to reset and re-run — instant snapshot restore

Environments

Each environment is a complete mock of a real API — same endpoints, same schemas, same auth patterns. Swap one URL and your agent can't tell the difference.

Live

claw-gmail

Full Gmail API mock. 54 endpoints including messages, threads, labels, drafts, and history. FastAPI + SQLite with seedable scenarios.

54 endpoints 3000+ scenarios snapshots admin API
View docs
Coming soon

claw-calendar

Google Calendar API mock. Events, recurring schedules, attendees, free/busy queries. Test scheduling agents without real calendars.

events recurring free/busy
Coming soon

claw-drive

Google Drive API mock. Files, folders, permissions, sharing. Let agents manage documents without touching real data.

files permissions sharing
Coming soon

claw-slack

Slack API mock. Channels, messages, threads, reactions. Test communication agents in isolated workspaces.

channels threads reactions

How It Works

Three concepts. Drop-in replacement for real APIs.

1

Seed

Pick a scenario — default, long_context, or a custom task — and populate the environment with realistic data.

2

Run

Point your agent at the mock API. Same endpoints, same auth, same response schemas. Your agent code doesn't change at all.

3

Evaluate

Snapshot diffs show exactly what changed. Deterministic verifiers check if the agent did the right thing. Reset and repeat.

Your Agent
OpenClaw, Pi, any harness
🦞
smolclaw
Mock API + seed data
Evaluate
Diffs, verifiers, scores

Want the full optimization loop? envdash connects smolclaw environments with GEPA skill optimization, Harbor task runners, and automated evaluation pipelines.

API-Identical

Same endpoints, same request/response schemas, same error codes. Change one environment variable to switch between mock and production.

Seedable

Deterministic scenarios from 57 emails to 3,000+ complex threads. Content library generates realistic, diverse data every time.

Snapshotable

Save state, run an agent, diff the results, restore. Run thousands of experiments from the exact same starting point.

Docker-Ready

Every environment ships with Dockerfile and docker-compose. One command to spin up isolated, reproducible test environments.

Admin API

Separate admin endpoints for seeding, snapshots, state inspection, and task management. Full control over the environment.

Open Source

MIT licensed. Contribute environments, seed scenarios, and evaluation tasks. Built by the community, for the community.

Get Started

Up and running in under a minute.

quickstart
# Install
$ pip install smolclaws
# Seed Gmail with default scenario (57 emails)
$ smolclaw seed --scenario default
# Start the mock API server
$ smolclaw serve --port 8001
# Point your agent at it
$ export GMAIL_API_URL=http://localhost:8001
$ your-agent run --task "triage my inbox"