A question keeps surfacing in every AI thread here: if AI can generate the implementation, where does human judgment actually live?
I’ve been building a project, Grace Commons, that takes a hard position on it.
Today the authoritative description of how a system behaves is the code. That’s fine until the people accountable for the behavior — clinicians, compliance officers, regulators, designers, execs — can’t read it. The source of truth becomes an artifact most stakeholders are locked out of. That’s a wall.
Grace Commons inverts it. The canonical artifact is a behavioral specification in structured, human-readable language — permissions, approvals, retention, audit, workflows, business behavior. Code, tests, and audit trails are generated from it, not the other way around. Code becomes a build artifact. If the generated code is clever, the spec has failed.
The interesting part isn’t code generation — plenty of tools do that. It’s where authority sits. If AI writes the implementation, the implementation is no longer where judgment lives; judgment lives in the spec humans review, challenge, and approve. And it’s deterministic in a boring, verifiable way: the spec generates the tests and the code; if the code passes the tests, it ships; if not, the AI iterates. You’re not trusting the model — you’re trusting the spec and the check.
We also assume specs are incomplete. Before anything is built, they go through adversarial pressure-testing meant to surface hidden assumptions, governance gaps, and missing decisions — to drag decisions into the open before they disappear into code.
Where this should interest designers specifically: it’s the natural home for design as more than pixels. Every interface sits at the intersection of three models — the business model, the user’s mental model, and the system’s model — and design’s real job is aligning them. That alignment is a contract for human understanding, and it belongs in the spec right next to the behavioral rules. The visual tool becomes a view of that contract, not the source of truth. Buttons don’t fix conceptual mismatch — the spec is where the mismatch gets found and named.
So the division settles out as:
Humans own intent, judgment, and governance.
AI owns translation and execution.
I’d love this room’s read on two things. Is moving authority from code to specification a real shift — or does the implementation inevitably stay the true source of truth no matter how much structure sits above it? And for the designers: if the interface is a contract for human understanding, does authoring that become the design deliverable, ahead of the layout?
Live demo: https://beacon-clinical.fly.dev/
Pattern library and docs: https://gracecommons.dev/