From Teaching People the Tool to Building the Tool

An organization that has taught thousands of educators how to use AI knows something rare: where the judgment actually is. The next move is to stop shipping guidance about the tool and start shipping a tool that exercises that judgment with them.

Eli Wood headshot

Eli Wood

June 24, 2026 4 min read
A glowing speech bubble crystallizing into a reusable toolkit object that many hands reach for

The ceiling on guidance

There is a particular kind of organization that has gotten very good at teaching people how to use AI. They've run the workshops, written the frameworks, sat with thousands of practitioners and watched where the confusion is. They know, in a way almost no one else does, exactly where a professional's judgment makes or breaks an AI-assisted task. And they've hit a ceiling — because guidance, however good, asks the practitioner to carry all of it into the moment of work and apply it correctly under time pressure, alone.

That's the limit of content as a product. A great course changes what someone knows; it does not change what happens at 7 p.m. when they're tired and the blank box is staring back. The gap between "I learned the right way to do this" and "I did it the right way this time" is enormous, and no amount of additional curriculum closes it. The frustrating part is that the organization sitting on all that pedagogical insight is usually the one best positioned to close it — and the one least likely to, because shipping content and shipping software feel like different businesses.

The curriculum is a product spec in disguise

The useful frame is to separate the rules engine from the judgment engine — and to notice that good guidance already draws that line. Every piece of solid AI-literacy training is, underneath, a map of where the human's judgment is essential and where the work is mechanical. "Always check the model's sources." "Never let it write the part where you're accountable." "Use it for the first draft, not the final call." Those aren't tips. They're a specification for a tool that does the mechanical part and structures the judgment part. The curriculum is a product spec that hasn't been compiled yet.

Turning guidance into a product means building the thing that embodies that judgment instead of describing it. Not a chatbot that does the practitioner's job, but a tool that does the repetitive setup, then walks the human through the exact decision points the training spent a workshop teaching — surfacing the sources to check, flagging the place where they're accountable, holding them at the call only they should make. The human stays in the loop precisely where being wrong is costly, which is the same place the curriculum already told them to slow down. You start where the judgment is expensive and repetitive — the decision the training covers because every practitioner gets it wrong the same way — and you encode the right move into the workflow.

And because the whole credibility of an AI-literacy organization rests on getting this right, the tool has to be explainable end to end: it shows why it flagged what it flagged, cites what it used, and never hides a judgment call behind a confident answer. That's not a feature; it's the brand. A guidance organization that ships an inscrutable tool contradicts everything it taught. Explainability is how the product earns the same trust the curriculum did — by exercising judgment with the practitioner instead of for them.

A two-day starting point

The trap is to treat the product as a years-long pivot away from the content business. The fix is to take the single most-taught judgment in your curriculum — the one decision every workshop returns to because it's where people reliably go wrong — and build one thin tool that does the mechanical setup and then walks a real practitioner through that decision the way your best instructor would, citing its reasoning and stopping at the human's call.

In two days you can put that in front of someone who took the course and learn the thing content can't tell you: whether the guidance, compiled into a workflow, actually changes what they do at the moment of work. You'll discover which parts of your curriculum are genuinely productizable and which must stay human — and you'll have proof that the leap from teaching the tool to building the tool is one narrow workflow, not a new company. Get it right once and the rest of the curriculum is your roadmap.

Black Flag Design builds applied-AI products that turn hard-won judgment into software practitioners trust. If you've taught people how to use AI and you're ready to build the tool that embodies it, spend two days with us — we call it a Foundation Sprint.

About the author

Eli Wood headshot
Eli Wood

CEO, Black Flag Design

Eli Wood leads Black Flag Design, a creative technology company focused on shipping ambitious digital products, AI systems, and design-forward software with a direct point of view on how technology changes work.

Related stories

More from the journal

Pen-and-ink sketch of a small clockwork robot working at a tool-covered workbench late at night while a human sleeps peacefully on a couch in the background, a wall clock reading 2:00 above
ai April 24, 2026 13 min read

The Agent Stays Up Late, Not Me

Every senior engineer knows the right way to set up a codebase. None of them do it. Here’s the four-stage framework we use — The Ratchet — to take a vibe-coded project all the way to a thing you’d trust in production, and the punchline about why this only just became worth doing.

Most teams have always known they should be running tests, type-checking, security audits, accessibility checks, dead-code analysis, prose linting, and a coverage floor. Most teams run two of those. Here’s why that math has finally inverted, and the four-stage framework we use to ratchet a vibe-coded project to a hardened one.

Keith Pattison

Keith Pattison

Founder, Black Flag Design

Read
Black Flag Journal
claude code April 20, 2026 5 min read

What a Year of Claude Code Trails Tells You About Your Team

Claude Code leaves evidence — sessions, commits, PRs, review notes. Read it like a logbook and you'll find what devs actually need to know before they go deeper.

After a year of shipping with Claude Code across real client work, the signal isn't in any single session — it's in the trails. Here's what those trails told us about where Claude Code shines, where it drifts, and the habits devs should build before they lean in harder.

Eli Wood headshot

Eli Wood

CEO, Black Flag Design

Read
Black Flag Journal
playbook April 20, 2026 6 min read

The Black Flag Playbook: Six Principles for Shipping with AI

Battle-tested principles for teams building real software with AI-generated code. Human judgment, tight scope, and weekly evidence — the disciplines that keep AI-built systems reliable.

The six rules we use to ship production software with AI. Small scope, weekly demos, human-led oversight, and continuous improvement — drawn from six months of real client engagements.

Keith Pattison

Keith Pattison

Founder, Black Flag Design

Read