The agent works.
You ship.
Standard agent loop: plans, executes, edits, runs commands. No interruptions. Concept exposure is silently logged to your skill graph for later spaced retrieval.
aiworklab wraps the open-source agent harness you already use and overlays a teaching layer that nobody else builds. Here's how every piece works — and how they fit together.
The agent is the same — Claude Code, Codex, T3 Code, OpenCode. Your relationship with it changes. Default is Copilot; the system also auto-suggests modes based on the novelty of the files you're touching.
Standard agent loop: plans, executes, edits, runs commands. No interruptions. Concept exposure is silently logged to your skill graph for later spaced retrieval.
Same speed as Autopilot, with one 15-second comprehension check before applying any non-trivial diff containing concepts you haven't yet demonstrated. Pass and merge.
The agent withholds. Reviews your code, points to bugs, asks Socratic questions. Refuses to fix things for you. Heaviest skill-graph updates, deepest learning per minute.
Not LeetCode topics — a per-user, per-repo graph of programming concepts that have appeared in your real code. Each node has a state machine that's honest with you.
The concept appeared in code you accepted but didn't engage with. Logged silently from Autopilot mode.
You read or were given an inline explanation card when the concept came up.
You passed a comprehension check on it, or you wrote it yourself in Coach mode.
You demonstrated it across multiple spaced retrievals over 30+ days. Mastered concepts never trigger checks.
For agent-authored diffs above a configurable threshold (LOC, files touched, or novelty against your skill graph), the merge button is gated. You write 2–3 sentences explaining what the diff does and why. An LLM judges the explanation against the diff.
Concept advances toward "demonstrated." Merge proceeds. Total cost: ~30 seconds.
You see what the judge thought you missed. You can revise, or escalate to Coach mode for that hunk.
You can always skip. We log it. Your weekly retention report shows the trade-offs honestly — no nags, no shame.
FSRS-based scheduler picks 3–7 concepts due for retrieval each day. Prompts are extracted from code you committed weeks ago — not synthetic exercises. The result: review feels like reviewing your own work.
Modern, open algorithm. Same one Anki and Mochi have moved to. Replaces SM-2.
"Two weeks ago you wrote this query — without looking, why did you choose a window function over a self-join?"
Optional weekly 30–60 min window where the agent is read-only. Solo throughput is your headline metric, tracked over time.
Two weeks ago you wrote a request handler that aborts on disconnect. Without looking — what's the difference between the cancellation token and the abort signal you used?
In user_repo.ts you prepared statements once at module load. What's the failure mode if the connection drops?
Recall: in a Yjs document, what happens when two clients edit the same line offline and reconnect?
An anonymised, aggregated dashboard of skill coverage across your engineering organisation. The metric is concept‐level — not source code. Available on Team and Enterprise tiers.
The gap is the cohort effect of unmetered AI assistance. aiworklab closes it without sacrificing throughput.
We don't rebuild the agent loop. We integrate with the open-source harness you already trust, contribute upstream, and put our work in the layer above it.
Anthropic's agent loop, integrated via the official SDK. Our default backend at launch.
Codex's CLI agent, integrated via thin adapter. Reasoning levels and supervised mode supported.
Theo's open-source GUI for agentic coding. We layer on top of its session model.
The open-source agent harness. Full integration including its provider abstraction.
Private alpha is in flight. Public beta opens Q4 2026. Drop your email and we'll write back personally.