The product

One workspace. Three modes. A skill graph that's honest with you.

aiworklab wraps the agent harness you already use and overlays a teaching layer that nobody else builds. Here's how every piece works, and how they fit together.

Start with the modes → See pricing

01 · the modes

Pick the relationship you want with your agent. Per task.

The agent is the same: Claude Code, Codex, T3 Code, OpenCode. Your relationship with it changes. Default is Copilot; the system also auto-suggests modes based on the novelty of the files you're touching.

01 / Autopilot

The agent works. You ship.

Standard agent loop: plans, executes, edits, runs commands. No interruptions. Concept exposure is silently logged to your skill graph for later spaced retrieval.

default for · production work · infra chores · deadlines · already-mastered concepts

02 / Copilot

One beat of friction at the right moment.

Same speed as Autopilot, with one 15-second comprehension check before applying any non-trivial diff containing concepts you haven't yet demonstrated. Pass and merge.

default for · real work in unfamiliar code · the bulk of your day

03 / Coach

You write. It questions.

The agent withholds. Reviews your code, points to bugs, asks Socratic questions. Refuses to fix things for you. Heaviest skill-graph updates, deepest learning per minute.

default for · onboarding · weekly fly-solo · deload weeks · interview prep

02 · the skill graph

A graph of the concepts you've actually touched.

Not LeetCode topics. A per-user, per-repo graph of programming concepts that have appeared in your real code. Each node has a state machine that's honest with you.

Encountered

The concept appeared in code you accepted but didn't engage with. Logged silently from Autopilot mode.

Explained

You read or were given an inline explanation card when the concept came up.

Demonstrated

You passed a comprehension check on it, or you wrote it yourself in Coach mode.

Mastered

You demonstrated it across multiple spaced retrievals over 30+ days. Mastered concepts never trigger checks.

repo · backend-api sample data

encountered explained demonstrated mastered

03 · explain-to-merge

The single most important thing aiworklab does.

For agent-authored diffs above a configurable threshold (lines changed, files touched, or novelty against your skill graph), the merge button is gated. You write 2 to 3 sentences explaining what the diff does and why. An LLM judges the explanation against the diff.

PASS

Verdict pass

The concept advances toward "demonstrated." Merge proceeds. Total cost: about 30 seconds.

SOFT

Soft fail

You see what the judge thought you missed. You can revise, or escalate to Coach mode for that hunk.

SKIP

Force-merge

You can always skip. We log it. Your weekly retention report shows the trade-offs honestly. No nags, no shame.

workers/retry.ts · +12 -4 exponential backoff

explain to merge · why the jitter, and why re-throw non-transient errors?

Jitter de-synchronises retries so a fleet of workers doesn't hammer the dependency in lockstep after an outage. Re-throwing non-transient errors matters because retrying a 400 just burns the budget; only timeouts and 5xxs deserve another attempt.

PASS exponential-backoff advanced to demonstrated

Product preview with sample data.

04 · spaced retrieval

Review prompts pulled from your own past code.

An FSRS-based scheduler picks 3 to 7 concepts due for retrieval each day. Prompts are extracted from code you committed weeks ago, not synthetic exercises. The result: review feels like reviewing your own work.

Free Spaced Repetition Scheduler (FSRS)

The modern, open scheduling algorithm that Anki adopted. Replaces SM-2 and calibrates well from a handful of reviews.

Code-grounded prompts

"Two weeks ago you wrote this query. Without looking, why did you choose a window function over a self-join?"

Fly-solo sessions

An optional weekly 30 to 60 minute window where the agent is read-only. Solo throughput is your headline metric, tracked over time.

today · 3 concepts due FSRS · sample data

async cancel · interval 14d

Two weeks ago you wrote a request handler that aborts on disconnect. Without looking, what's the difference between the cancellation token and the abort signal you used?

prepared stmts · interval 7d

In user_repo.ts you prepared statements once at module load. What's the failure mode if the connection drops?

CRDT merge · interval 3d

Recall: in a Yjs document, what happens when two clients edit the same line offline and reconnect?

05 · for engineering leaders

The artefact that closes the budget conversation.

An anonymised, aggregated dashboard of skill coverage across your engineering organisation. The metric is concept-level, never source code. Available on Team and Enterprise tiers.

142

concepts mastered across the org

+18%

solo throughput, quarter over quarter

bus-factor warnings (concepts held by 2 or fewer engineers)

94%

7-day FSRS retention, team average

cohort comparison

Engineers hired in 2026 vs 2024

sample data

2024 hires · concept retention at 30d81%

2026 hires · concept retention at 30d52%

The gap is the cohort effect of unmetered AI assistance. aiworklab closes it without sacrificing throughput.

06 · integrations

Standing on the shoulders of giants.

We don't rebuild the agent loop. We integrate with the agent harnesses you already trust, contribute upstream where the licence allows, and put our work in the layer above.

Claude Code

Anthropic's agent loop, integrated through the official Claude Agent SDK. Our default backend at launch.

OpenAI Codex CLI

Codex's open-source CLI agent, integrated via a thin adapter. Reasoning levels and supervised mode supported.

T3 Code

Theo's open-source GUI for agentic coding. We layer on top of its session model.

OpenCode

The open-source agent harness. Full integration including its provider abstraction.

Adapter model. Each harness implements a small interface: start session, plan, tool call, diff, finish, pause, resume, inject. The teaching kernel hooks the diff event. This isolates harness churn from our product. These harnesses are independent third-party projects; we are not affiliated with or endorsed by their developers.

Get started

See it on your own repo.

Private beta is in flight. Public beta opens in Q3 2026, and general availability lands by early Q4 2026. Drop us your email and we'll write back personally.

Email team@aiworklab.com → Talk to us