Product Pricing About Careers Blog Talk to us
The product

One workspace. Three modes. A skill graph that's honest with you.

aiworklab wraps the open-source agent harness you already use and overlays a teaching layer that nobody else builds. Here's how every piece works — and how they fit together.

01 · the modes

Pick the relationship you want with your agent. Per task.

The agent is the same — Claude Code, Codex, T3 Code, OpenCode. Your relationship with it changes. Default is Copilot; the system also auto-suggests modes based on the novelty of the files you're touching.

01 / Autopilot

The agent works.
You ship.

Standard agent loop: plans, executes, edits, runs commands. No interruptions. Concept exposure is silently logged to your skill graph for later spaced retrieval.

Default for · production work · infra chores · deadlines · already-mastered concepts
02 / Copilot

One beat of friction at the right moment.

Same speed as Autopilot, with one 15-second comprehension check before applying any non-trivial diff containing concepts you haven't yet demonstrated. Pass and merge.

Default for · real work in unfamiliar code · the bulk of your day
03 / Coach

You write.
It questions.

The agent withholds. Reviews your code, points to bugs, asks Socratic questions. Refuses to fix things for you. Heaviest skill-graph updates, deepest learning per minute.

Default for · onboarding · weekly fly-solo · deload weeks · interview prep
02 · the skill graph

A graph of the concepts you've actually touched.

Not LeetCode topics — a per-user, per-repo graph of programming concepts that have appeared in your real code. Each node has a state machine that's honest with you.

01

Encountered

The concept appeared in code you accepted but didn't engage with. Logged silently from Autopilot mode.

02

Explained

You read or were given an inline explanation card when the concept came up.

03

Demonstrated

You passed a comprehension check on it, or you wrote it yourself in Coach mode.

04

Mastered

You demonstrated it across multiple spaced retrievals over 30+ days. Mastered concepts never trigger checks.

repo · backend-api 12 concepts mastered
seen explained demonstrated mastered
03 · explain-to-merge

The single most important thing aiworklab does.

For agent-authored diffs above a configurable threshold (LOC, files touched, or novelty against your skill graph), the merge button is gated. You write 2–3 sentences explaining what the diff does and why. An LLM judges the explanation against the diff.

PASS

Verdict pass

Concept advances toward "demonstrated." Merge proceeds. Total cost: ~30 seconds.

SOFT

Soft fail

You see what the judge thought you missed. You can revise, or escalate to Coach mode for that hunk.

SKIP

Force-merge

You can always skip. We log it. Your weekly retention report shows the trade-offs honestly — no nags, no shame.

db/migrations.ts · +14 -2 concept · postgres advisory locks
async function runMigration(name: string) {
- await db.query(MIGRATION[name]);
+ const lockId = hashKey(name);
+ await db.query('SELECT pg_advisory_lock($1)', [lockId]);
+ try {
+ await db.query(MIGRATION[name]);
+ } finally {
+ await db.query('SELECT pg_advisory_unlock($1)', [lockId]);
+ }
}
Explain to merge · what does the advisory lock buy us here?
It guarantees only one app instance runs a given migration at a time, even during a rolling deploy. The lock is per-migration-name, so different migrations can still run in parallel. The try/finally is critical — if the migration throws, we still release the lock instead of holding it forever.
verdict pass · concept pg-advisory-locks → demonstrated
04 · spaced retrieval

Review prompts pulled from your own past code.

FSRS-based scheduler picks 3–7 concepts due for retrieval each day. Prompts are extracted from code you committed weeks ago — not synthetic exercises. The result: review feels like reviewing your own work.

01

Free spaced repetition scheduler (FSRS)

Modern, open algorithm. Same one Anki and Mochi have moved to. Replaces SM-2.

02

Code-grounded prompts

"Two weeks ago you wrote this query — without looking, why did you choose a window function over a self-join?"

03

Fly-solo sessions

Optional weekly 30–60 min window where the agent is read-only. Solo throughput is your headline metric, tracked over time.

today · 4 concepts due FSRS
async cancel · interval 14d

Two weeks ago you wrote a request handler that aborts on disconnect. Without looking — what's the difference between the cancellation token and the abort signal you used?

prepared stmts · interval 7d

In user_repo.ts you prepared statements once at module load. What's the failure mode if the connection drops?

CRDT merge · interval 3d

Recall: in a Yjs document, what happens when two clients edit the same line offline and reconnect?

05 · for engineering leaders

The artefact that closes the budget conversation.

An anonymised, aggregated dashboard of skill coverage across your engineering organisation. The metric is concept‐level — not source code. Available on Team and Enterprise tiers.

142
concepts mastered across the org
+18%
solo throughput, quarter over quarter
7
bus-factor warnings (concepts held by ≤ 2)
94%
7-day FSRS retention, team average
cohort comparison

Engineers hired in 2026 vs 2024

live
2024 hires · concept retention at 30d81%
2026 hires · concept retention at 30d52%

The gap is the cohort effect of unmetered AI assistance. aiworklab closes it without sacrificing throughput.

06 · integrations

Standing on the shoulders of giants.

We don't rebuild the agent loop. We integrate with the open-source harness you already trust, contribute upstream, and put our work in the layer above it.

Claude

Claude Code SDK

Anthropic's agent loop, integrated via the official SDK. Our default backend at launch.

Codex

OpenAI Codex CLI

Codex's CLI agent, integrated via thin adapter. Reasoning levels and supervised mode supported.

T3 Code

T3 Code

Theo's open-source GUI for agentic coding. We layer on top of its session model.

OpenCode

OpenCode

The open-source agent harness. Full integration including its provider abstraction.

Adapter model. Each harness implements a small interface — start session, plan, tool call, diff, finish, pause, resume, inject. The teaching kernel hooks the diff event. This isolates harness churn from our product.
Get started

See it on your own repo.

Private alpha is in flight. Public beta opens Q4 2026. Drop your email and we'll write back personally.