When the crowd is the underwriter: applied AI for outcome-funding markets

When a crowd funds an outcome that hasn't happened yet, every dollar is a bet on a stranger's future. There's no collateral, no repayment schedule, no balance sheet to underwrite against. The thing being funded is a promise, and the only asset backing it is trust. That makes trust, fraud, and incentive design not features of the platform but the platform itself.

The problem: you're underwriting promises, not assets

Traditional lending has centuries of machinery for pricing risk against something real. Outcome-funding markets have none of that. A crowd is asked to fund a person, a goal, a campaign — and to believe the money will produce what was advertised. The failure modes are specific and they compound.

Fabricated stories attract real money. Funds raised for one purpose drift to another. A small number of bad actors poison the well for everyone, because trust in these markets is shared, not individual: one viral fraud story makes every honest campaign look suspect. And the incentives are quietly misaligned from day one — the people writing the most compelling pitch are not always the people most likely to deliver the outcome, and the platform that takes a cut of every dollar raised has to resist the pull toward volume over legitimacy.

The naive instinct is to throw a model at it: train a classifier on past fraud, score every campaign, auto-approve the clean ones, auto-reject the rest. This fails in a predictable way. Fraud in trust markets is adversarial and non-stationary — the patterns that defined last quarter's bad actors are exactly the patterns this quarter's bad actors have learned to avoid. A model that quietly rejects honest campaigns is not a safety feature; it's an existential threat to a marketplace whose supply is goodwill.

The insight: separate the rules from the judgment

The move that works is to stop treating this as one problem. There are two engines, and conflating them is what gets platforms in trouble.

The rules engine handles the deterministic, auditable checks: identity verification thresholds, payout velocity limits, duplicate-account detection, jurisdictional constraints, hard caps. These should be boring, explicit, and logged. They don't need AI, and you don't want them to be probabilistic — when you decline a payout, you want to point at the exact rule, not a confidence score.

The judgment engine handles the ambiguous calls: does this story hang together, is this funding pattern organic or coordinated, does this campaign's narrative match the funder behavior around it. This is where applied AI earns its keep — surfacing the weak signals a human reviewer would never have time to assemble across thousands of campaigns.

The principle that keeps this honest: human-in-the-loop wherever being wrong is costly. Auto-approving a legitimate campaign is cheap to get wrong — you can catch it downstream. Freezing an honest person's funds, or letting a fraud reach payout, is expensive and sometimes irreversible. So the model's job is not to decide; it's to triage. It ranks, explains, and routes the genuinely ambiguous cases to a human, and it does so with enough of its reasoning attached that the reviewer can act in seconds instead of starting cold.

That explainability is also how you earn trust with the crowd. A platform that can say why a campaign was flagged — and show its work — builds more durable confidence than one that hides behind a black box. In trust markets, the explanation is part of the product.

The path: start where judgment is expensive and repetitive

Don't boil the ocean. The highest-leverage place to apply AI is the work that is simultaneously expensive (a skilled human has to do it), repetitive (it happens hundreds of times a day), and judgment-heavy (rules alone can't settle it). In outcome-funding markets that's almost always campaign review and anomaly triage.

Here's a concrete two-day start. Day one: pull your last few months of reviewed campaigns and split them cleanly into two buckets — decisions a written rule already made (or could have), and decisions that required a human to read, weigh, and judge. The first bucket is your rules engine; codify it and get it out of human hands. The second bucket is the only place AI belongs. Quantify it: how many judgment calls per day, how long each takes, what a wrong call costs in either direction.

Day two: build the thinnest possible judgment assist for the single most expensive review type. Not an auto-decider — a triage layer that scores ambiguity, attaches its reasoning, and routes the hard cases to a person with the context pre-assembled. Measure one thing: did the human reach a confident decision faster, with the costly errors still caught? If yes, you have a foundation. If no, you've spent two days instead of two quarters learning it.

The pattern generalizes, but the discipline is what matters: rules where you can be certain, judgment where you can't, humans on the calls that hurt to get wrong, and explanations attached to everything. Trust isn't a feature you bolt on at the end. In crowd-funded markets, it's the only thing you're really selling.

Black Flag Design builds applied-AI products for markets that run on trust. If this is your world, spend two days with us — we call it a Foundation Sprint.

About the author

Keith Pattison

Founder, Black Flag Design

Keith leads Black Flag Design, a studio that ships production-ready software with AI-assisted development. He writes about the disciplines — small scope, weekly evidence, and human oversight — that keep AI-built systems reliable in the real world.

When the crowd is the underwriter: applied AI for funding outcomes you can't see yet

The problem: you're underwriting promises, not assets

The insight: separate the rules from the judgment

The path: start where judgment is expensive and repetitive

More from the journal

The Agent Stays Up Late, Not Me

What a Year of Claude Code Trails Tells You About Your Team

The Black Flag Playbook: Six Principles for Shipping with AI