DocumentOperating Framework

ClassificationInternal

Versionv1.0 · 2026.04

OwnerUmbra Studio

Internal Operating Document

The Lighthouse Sprint Operating Framework.

The complete operational playbook for Umbra Studio's core engagement — from discovery through handoff. This document defines every stage, activity, deliverable, quality gate, and governance component of a Lighthouse Sprint.

Read this end-to-end before leading or contributing to a sprint. Every table corresponds to a template in the Lighthouse Kit.

§01

Philosophy & principles.

Seven hard rules · if a sprint violates any, the activity gets redesigned or removed

Every decision in a Lighthouse Sprint is governed by seven operating constraints. These aren't aspirations — they're hard rules. If a sprint activity violates any of them, the activity gets redesigned or removed.

§ 1.1Hard rule

Workflows, not tools.

We rebuild workflows with intelligence at the core. We don't bolt agents onto legacy processes. If the workflow doesn't change, the engagement doesn't start.

§ 1.2Hard rule

Human above the loop.

Humans supervise and make judgment calls. They don't bottleneck. Oversight lives at the governance layer, not inside the execution path.

§ 1.3Hard rule

Production or nothing.

We ship working systems — not prototypes, not proofs of concept, not strategy decks. If it doesn't run in production, it doesn't count.

§ 1.4Hard rule

Measure everything.

Baseline before. Measure after. Every engagement produces quantified before/after data. Opinions don't prove ROI — numbers do.

§ 1.5Hard rule

Governance is not optional.

Every deployed agent ships with monitoring, alerting, audit trail, escalation paths, human override, and rollback capability. No exceptions.

§ 1.6Hard rule

Patterns compound.

Every engagement extracts reusable components into the Pattern Library. The next client benefits from the last one. Studio gets better with every sprint.

§ 1.7Hard rule

Handoff means independence.

The client's team runs the system after we leave. If they can't operate it without us, the handoff failed. Dependency is a design flaw, not a revenue strategy.

§02

Sprint architecture + Watch.

Fixed-scope · time-boxed · four sequential stages · four gates · plus continuous Watch

The Lighthouse Sprint is a fixed-scope, time-boxed engagement with four sequential stages. Each stage ends with a quality gate — a go/no-go decision that must be passed before the next stage begins. After Stage 4, the engagement extends into Lighthouse Watch — the productized continuous service that maintains the system, updates governance, and ships one improvement per month drawn from the pattern library.

The four Sprint stages + the continuation

Stage	Name	Duration	Core output	Gate
01	Discovery	1–2 weeks	Friction map, baseline metrics, go/no-go	Gate 1 · Fit Decision
02	Redesign	1–2 weeks	Agentic workflow blueprint, governance design	Gate 2 · Design Approval
03	Build & Deploy	3–5 weeks	Production agents, governance wrapper, monitoring	Gate 3 · Production Readiness
04	Handoff	1–2 weeks	Training, documentation, outcome measurement	Gate 4 · Independence Verified
+05	Lighthouse Watch Continuous	12+ months · monthly cadence	Monthly health report, the +1 improvement, governance updates, model swap testing	Renewal at 12 mo

After the Sprint: Lighthouse Watch

The Sprint ends at Gate 4 with a working production system. What happens next is the part most consultancies leave unsolved. Foundation models drop every 8–12 weeks; eval suites drift; governance ages; the pattern library sharpens with every subsequent engagement. Without a structural mechanism to absorb that churn, the system you signed off on this quarter starts to look different from the system you've got next quarter — and the people closest to the system aren't always the ones who notice first.

Lighthouse Watch is that mechanism. It is part of the Lighthouse architecture, not an add-on. Every Sprint contract proposes Watch as the default next step in Week 8; about 70% of clients sign Watch · Standard within 30 days of Sprint closeout. The lead engineer who shipped the Sprint is the lead engineer who runs the Watch — no learning-curve gap.

Watch stream	What it covers	Cadence
A · Foundation maintenance	Model swap testing within 7 BD of any major release; eval suite re-runs; prompt-drift detection; provider deprecation tracking	Monthly + on-release
B · Governance & observability	Governance Runbook revisions tied to incidents and policy; acceptance-rate monitoring (target ≥80%); reviewer-UI feedback loop; quarterly architecture review	Monthly + quarterly
C · The +1 improvement Differentiator	One delivered improvement every month: either a new pattern from the library deployed into the client's system, or a workflow optimization addressing a measured friction. Eval-gated before promotion — if it fails its eval, it doesn't ship and we owe it the next month.	Monthly · eval-gated
D · Monthly health report	3–4 page report on the 1st of every month, signed by the lead engineer. Model status, acceptance trends, evals, incidents, the +1 delivered, library updates, next-month preview.	Monthly · 1st of month

§ Why Watch is part of the architecture, not optional

Without a continuous mechanism, the Lighthouse promise erodes. The framework's eight principles (humans above the loop, eval-first deployment, step-level observability, ship-bias) all describe a moving target — the foundation layer churns, the eval set evolves, the governance scope expands. Watch is the structural answer to the question "how does the system stay Lighthouse-aligned in month 6, month 12, month 24?" The Watch sales sheet, operating manual, and contract template live in lighthouse-kit/.

Timeline variants

Variant	Duration	Stages	Use case
Discovery Only	2 weeks	Stage 1 only	Client wants assessment before committing to full sprint
Standard Sprint	6–10 weeks	All four stages	Default engagement — one workflow, end-to-end
Extended Sprint	10–16 weeks	All four stages	Complex workflows, multi-system integrations, regulated industries
LATAM Sprint	6–10 weeks	All four stages	Latin American market — adjusted scope and pricing for regional fit

§ Constraint

One workflow per sprint. If the client has three workflows to redesign, that's three sprints — not one sprint with a bigger scope. Scope creep kills agentic deployments.

§03

Roles.

Six defined roles · three Studio (Forward Deployed Engineers) · three client · no ambiguity at the gates

Every Lighthouse Sprint has six defined roles — three from Studio, three from the client. Role clarity eliminates ambiguity about who owns decisions, who does the work, and who signs off at each gate.

Studio roles · Forward Deployed Engineers

Role	Responsibility	Gate authority
Sprint Lead	Owns the engagement end-to-end. Runs discovery sessions, presents at gates, manages client relationship. Single point of accountability.	Recommends go/no-go at all gates
Agent Engineer	Builds, tests, and deploys the agentic workflows. Owns the technical implementation from Redesign through Handoff.	Signs off on Gate 3
Governance Architect	Designs and implements the governance wrapper — monitoring, alerting, audit trail, escalation, override, rollback. Ensures compliance.	Signs off on governance components at Gate 3

Client roles

Role	Responsibility	Gate authority
Executive Sponsor	Budget authority and organizational air cover. Removes blockers, ensures team access and time. Must attend every gate.	Final go/no-go at Gate 1 and Gate 2
Workflow Owner	The person who currently runs the workflow being redesigned. Domain expert — source of truth for how things actually work.	Validates accuracy at Gate 1; approves design at Gate 2
Technical Counterpart	Client's IT/engineering contact. Handles access, integrations, security review, and production environment requirements.	Signs off on technical readiness at Gate 3

§ Staffing

In early engagements, one person may hold multiple Studio roles (e.g. Sprint Lead + Governance Architect). The roles still exist as distinct accountability structures even if one person fills them.

§ Forward Deployed Engineering — the shape of the role

Across the three Studio seats, what gets delivered is a single, recognizable archetype: an embedded builder who works inside the workflow, a coach who walks the operators through awareness → first win → integration → full transformation → self-sufficiency, and a pattern librarian who extracts what is reusable for the next engagement. In a small Sprint, one person holds all three. In a larger organization, this would be a permanent in-house team. Studio’s posture is to be that team — productized, time-boxed, fractional — for clients who cannot, or should not, build it themselves.

§04

Stage 1 — Discovery.

1–2 weeks · end output: Discovery Report · Gate 1 · Fit Decision

Discovery is the foundation. We map the current workflow end-to-end, stopwatch every step, quantify bottlenecks, and determine whether this workflow is a fit for agentic redesign. If it isn't, we say so — and the engagement ends cleanly at Gate 1.

Day-by-day activities

Day	Activity	Who	Output
Day 1	Kickoff & workflow walkthrough	Sprint Lead + Workflow Owner	Process narrative, initial workflow map
Day 2	Shadow sessions — observe the workflow in real time	Sprint Lead + Agent Engineer	Annotated observations, pain points logged
Day 3	Stopwatch exercise — time every step and handoff	Sprint Lead	Cycle-time data per step
Day 4	Bottleneck analysis & error-rate review	Sprint Lead + Governance Architect	Bottleneck cost analysis, error taxonomy
Day 5	Stakeholder interviews (3–5 people)	Sprint Lead	Qualitative friction points, tribal knowledge documented
Day 6–8	Data synthesis & baseline metrics compilation	Full Studio team	Baseline Metrics Sheet
Day 9–10	Gate 1 preparation & presentation	Sprint Lead	Discovery Report, Gate 1 recommendation

Discovery tools

ToolD-01

Workflow Audit Template.

Structured spreadsheet for capturing every step, actor, system, input, output, time, and decision point in the current workflow.

ToolD-02

Stopwatch Protocol.

Standardized method for timing workflow steps — setup for parallel steps, waiting vs. active time, and handoff delays.

ToolD-03

Bottleneck Scoring Matrix.

Framework for ranking bottlenecks by frequency, cost, and amenability to agentic automation. Produces a prioritized target list.

ToolD-04

Stakeholder Interview Guide.

Semi-structured interview protocol. Captures what the process map misses: workarounds, tribal knowledge, undocumented dependencies.

ToolD-05

Fit Assessment Rubric.

Scoring model for determining if the workflow is suitable for agentic redesign. Evaluates repeatability, data availability, decision complexity, and integration feasibility.

Deliverables

Workflow Audit — Complete map of the current workflow with time, cost, and error data per step.
Baseline Metrics Sheet — Quantified snapshot: cycle time, throughput, error rate, cost-per-unit, human hours per cycle.
Discovery Report — Narrative summary with findings, bottleneck ranking, and initial redesign hypotheses.
Gate 1 Recommendation — Go, no-go, or conditional-go with specific conditions to resolve.

Gate 1 · Fit Decision

Executive Sponsor and Workflow Owner review the Discovery Report and Baseline Metrics.

The Sprint Lead presents the go/no-go recommendation.

Go — The workflow is a fit for agentic redesign. Proceed to Stage 2.
Conditional Go — Proceed, but specific conditions must be met (e.g. data access, system access, organizational commitment).
No-Go — The client receives the Discovery Report and Baseline Metrics as standalone deliverables. Engagement ends cleanly — no hard feelings, no sunk cost pressure.

§05

Stage 2 — Redesign.

1–2 weeks · end output: Redesign Blueprint · Gate 2 · Design Approval

Redesign is where we architect the new workflow. Not optimization of the old one — a fundamentally new workflow where agents carry the operational load and humans supervise from above the loop.

Agent / human allocation framework

Every task in the workflow is classified into one of four categories:

Category	Description	Examples
Fully Autonomous	Agent handles end-to-end. No human in the path. Governance layer monitors.	Data extraction, formatting, routing, scheduling, notifications
Agent-Initiated, Human-Approved	Agent does the work, queues it for human review before execution.	Content generation, financial calculations, customer communications
Human-Led, Agent-Assisted	Human makes the decision; agent provides data, drafts, or recommendations.	Strategy decisions, exception handling, complex negotiations
Human Only	Requires human judgment, creativity, or relationship. No agent role.	Executive decisions, sensitive conversations, creative direction

Deliverables

Redesign Blueprint — The new workflow map: every task, its allocation category, data flows, integration points, and governance touchpoints.
Agent Spec Sheet — Per-agent specification: inputs, outputs, triggers, tools, guardrails, escalation conditions, success metrics.
Governance Design — Monitoring, alerting, and escalation architecture for the redesigned workflow.
Risk Register — Identified risks with probability, impact, and mitigation strategies.

Gate 2 · Design Approval

Executive Sponsor, Workflow Owner, and Technical Counterpart review the Redesign Blueprint and Agent Spec Sheets.

Approved — The design is accepted. Proceed to Build.
Approved with Changes — Specific modifications required — documented, time-boxed, resolved before Build begins.
Rejected — Fundamental design disagreement. Return to Redesign for a new approach, or end engagement. Client keeps all Discovery and Redesign deliverables.

§06

Stage 3 — Build & Deploy.

3–5 weeks · weekly cadence · Friday demos · Gate 3 · Production Readiness

Build is the longest stage. We construct the agents, wire integrations, deploy the governance wrapper, and push to production. Weekly sprint cadence with demos every Friday.

Weekly sprint cadence

Day	Activity	Output
Monday	Sprint planning — scope the week's build targets against the Blueprint	Weekly build plan
Tue–Thu	Build, test, integrate — Agent Engineer and Governance Architect execute	Working increments
Fri AM	Internal review — team tests against acceptance criteria	Test results, bug log
Fri PM	Client demo — show working functionality against baseline metrics	Demo recording, feedback log

5-week build sequence (standard sprint)

Week	Focus	Milestone
Wk 1	Foundation — core agent scaffolding, data pipeline, integration auth	First agent runs end-to-end in staging
Wk 2	Core agents — build primary autonomous and human-approved agents	All primary agents functional in staging
Wk 3	Governance layer — monitoring, alerting, audit trail, escalation paths	Governance wrapper operational
Wk 4	Integration & hardening — production systems, edge cases, load testing	System passes integration tests
Wk 5	Production deploy — supervised operation with Studio monitoring	System live in production with supervision

Deliverables

Production Agents — Deployed, tested, monitored agents running in the client's production environment.
Governance Wrapper — Full monitoring, alerting, audit, escalation, override, and rollback infrastructure.
Integration Layer — All connections to client systems, APIs, and data sources.
Monitoring Dashboard — Real-time visibility into agent performance, error rates, escalation activity.
Test Suite — Automated tests covering agent behavior, edge cases, and governance triggers.

Gate 3 · Production Readiness

Agent Engineer, Governance Architect, and Technical Counterpart jointly review.

All agents pass acceptance criteria against the Redesign Blueprint.
Governance wrapper is operational — monitoring, alerting, escalation, override, rollback all tested.
Integration tests pass — all connections to client systems verified.
Performance meets baseline targets — measurable improvement over Discovery baseline.
Security review complete — Technical Counterpart signs off.

§07

Stage 4 — Handoff.

1–2 weeks · end output: Outcome Report · Gate 4 · Independence Verified

Handoff is where Studio proves the system works and proves the client can run it independently. If either condition fails, Handoff isn't complete.

Handoff activities

Activity	Who	Duration	Output
Operator training sessions	Sprint Lead + Workflow Owner's team	2–3 sessions	Trained operators who can manage day-to-day operations
Supervised operation period	Studio monitors, client operates	3–5 days	Client team runs the system with Studio as safety net
Outcome measurement	Sprint Lead	1–2 days	Before/after metrics comparison against baseline
Pattern extraction	Agent Engineer + Governance Architect	1–2 days	Reusable agent components documented for Pattern Library
Documentation handoff	Full Studio team	1 day	Operations manual, governance runbook, escalation guide
Watch transition If signed	Sprint Lead + Studio Principal	Final day	Lighthouse Watch starts the day after handoff — same lead engineer, no gap. Slack channel transitions, monthly cadence kicks off.

Deliverables

Operations Manual — Step-by-step guide for day-to-day operation of the agentic workflow.
Governance Runbook — How to respond to alerts, escalations, and incidents. Includes override and rollback procedures.
Outcome Report — Quantified before/after comparison: cycle time, throughput, error rate, cost-per-unit, human hours saved.
Pattern Library Contributions — Reusable components extracted and cataloged for future use.
30-Day Support Window — Post-handoff email/Slack support for questions and issues (included in fee). Default next step: transition into Lighthouse Watch — the productized monthly continuation.
Lighthouse Watch proposal — Sent in Sprint Week 8 with sales sheet attached. Default offer is Watch · Standard ($24k/mo, 12-month term) starting the day after handoff. About 70% of Sprint clients sign within 30 days. View sales sheet →

Gate 4 · Independence Verified

The final gate verifies the client can run the system without Studio involvement.

Supervised operation completed — client team operated for 3–5 days with no critical issues.
Operators trained and certified — handle normal operations, escalations, and basic troubleshooting.
Outcome metrics documented — measurable improvement over baseline, signed off by Executive Sponsor.
All documentation delivered — operations manual, governance runbook, escalation guide reviewed.
Pattern contributions cataloged — reusable components extracted and documented for Studio's Pattern Library.

§08

The governance wrapper.

Six components · all required · all production-grade · no lightweight option

The Governance Wrapper is not an add-on — it's a core component shipped with every Lighthouse Sprint. Six components, all required, all production-grade.

Component 01Required

Monitoring.

Real-time visibility into agent activity: invocations, latency, success/failure rates, resource consumption. Feeds the Stage 3 dashboard.

Component 02Required

Alerting.

Configurable alerts for anomalies: error rate spikes, latency degradation, unexpected behavior, throughput drops. Routed via escalation path.

Component 03Required

Audit Trail.

Every agent decision logged: inputs, outputs, reasoning, timestamps, data accessed. Immutable. Queryable. Required for compliance-sensitive industries.

Component 04Required

Escalation Paths.

Defined routes from agent to human for decisions that exceed the agent's authority or confidence. Includes time-based auto-escalation.

Component 05Required

Human Override.

Any agent action can be overridden by an authorized human at any time. Overrides logged in the audit trail. Designed for human supremacy, not agent autonomy.

Component 06Required

Rollback.

Every deployment includes rollback capability to the previous known-good state. Triggered automatically (via thresholds) or manually (via override).

§ Non-negotiable

All six components are required for every engagement. There is no “lightweight” governance option. If a client's workflow doesn't warrant this level of infrastructure, the workflow probably doesn't warrant agentic redesign.

§ How the wrapper stays current

The Governance Wrapper at Sprint handoff is v1. Every incident, policy change, and regulatory ask after handoff is a candidate runbook update. Lighthouse Watch is how those updates ship — Stream B (Governance & observability) covers Runbook revisions, acceptance-rate monitoring against the ≥80% target, and the reviewer-UI feedback loop. Without Watch, the wrapper at month 12 is the same wrapper that shipped at month 0 — which is the failure mode this section exists to prevent.

§09

The pattern library.

Studio's compounding asset · 22 patterns from Umbra Group properties · 18 production-tested

The Pattern Library is Studio's compounding asset. Every engagement extracts reusable agent components — patterns that can be adapted and deployed in future sprints. The library currently holds 18 patterns across four Umbra Group properties: 14 production-tested from Indietheka (each with a traceable incident receipt), plus 4 from Radiohead Community, Transit, and Orbit at various stages of development.

Catalog format

Field	Description
Pattern ID	Unique identifier (USP-XXX format)
Name	Descriptive name
Category	Content Ops, Quality, Infrastructure, Strategy, Distribution, Monitoring, etc.
Source	Engagement or property where the pattern was first extracted
Description	What the pattern does and when to use it
Incident Receipt	The specific production incident (with post ID and date) that created the rule. Proves the pattern is battle-tested, not theoretical
Inputs / Outputs	Data the pattern consumes and produces
Dependencies	Systems, APIs, or other patterns required
Governance	Monitoring, alerting, and escalation needs
Adaptation Notes	How to modify the pattern for different client contexts

Indietheka patterns (USP-001–014) · all production-tested

14 patterns extracted from the first Lighthouse Sprint. The original 4 were designed during the sprint; the remaining 10 emerged from production incidents and were codified as reusable rules. Full detail with incident receipts in §12.8.

ID	Name	Category	Status
USP-001	RSS-to-CMS Ingestion Pipeline 3-layer content enrichment cascade, automated SEO self-check with auto-fix, deduplication lock via spreadsheet column.	Content Ops	Production
USP-002	AI-Assisted Editorial with Voice Preservation Prohibited-word enforcement, verb-tense rules, heading guidelines, 300–420 word hard limit. Maintains brand voice at scale.	Content Ops	Production
USP-003	Multi-Channel Distribution Pipeline 6-template image generation with A/B rotation, CDN-compatible hosting, one-post-per-run pacing for natural feed spacing.	Content Ops	Production
USP-004	Queue-Driven Pipeline with Approval Gates Spreadsheet-as-state-machine with API-backed writes. Thursday→Friday discovery-to-review cycle. Shared client libraries for queue state.	Content Ops	Production
USP-005	Pre-Publish Verification Protocol 9 check groups covering every editorial accuracy dimension. Silent on success, blocks on failure. Each rule traceable to a specific incident.	Quality	Production
USP-006	Cover Art Verification Pipeline Fetches artwork via structured identifiers, never freeform search. Visual verification before rendering. Born from wrong-cover incident.	Quality	Production
USP-007	Chunked Upload System Transfers large HTML articles to CMS by encoding, splitting into chunks, reassembling browser-side, and uploading with integrity checks.	Infrastructure	Production
USP-008	Two-Step SEO Metadata Update Workaround for SEO plugins that don’t refresh metadata in a single API call. Requires a second call with only the metadata payload.	Infrastructure	Production
USP-009	API Endpoint for Spreadsheet Writes Uses Google Sheets as a database via API endpoint instead of browser automation. Solves cell-drift and undo-cascade bugs.	Infrastructure	Production
USP-010	Featured Image ≠ Carousel Cover Never reuse vertical social-media carousel art as horizontal website featured image. Generate separate assets per platform.	Content	Production
USP-011	Temporal Claims Verification Verify “first in N years” and “comeback” claims against publication date. Special rules for multi-weekend events.	Quality	Production
USP-012	Multi-Template Carousel System Two contrasting visual templates that alternate in the feed. Prevents visual monotony in high-volume social publishing.	Content	Production
USP-013	Editorial Feedback Loop Weekly automation that analyzes performance, writes a content plan, and appends to cumulative strategic memory. Closed learning loop.	Strategy	Production
USP-014	Page Builder Safety Never update page-builder pages via standard CMS API — use the builder’s own API or custom endpoints to preserve layout metadata.	Infrastructure	Production

Cross-property patterns (USP-015–018)

Patterns from other Umbra Group properties at various stages of development. These will be enriched with incident receipts as each property’s agents mature in production.

ID	Name	Category	Source	Status
USP-015	Community Content Scheduler Content-calendar management across platforms. Caption generation, hashtag optimization, best-time-to-post scheduling.	Distribution	Radiohead Community	Production
USP-016	Automated Press Kit Generator Generates complete electronic press kits from artist metadata, press coverage, and media assets. Formats for email, web, and download.	Content	Transit	Development
USP-017	Media Outlet Matching & Outreach Matches artist profiles to relevant outlets and journalists based on genre, coverage history, and audience overlap.	Distribution	Transit	Development
USP-018	Fan Engagement Signal Monitor Monitors community engagement signals and surfaces actionable insights. Anomaly detection for viral moments.	Monitoring	Orbit	Design

Substrate & orchestration patterns (USP-019–022) · Indietheka, post-Sprint

Four patterns extracted from migrating Indietheka's 16-agent fleet to durable infrastructure (Q1 2026) and building Atlas — the studio-level orchestrator — on top (Q2 2026). All four are in production at Indietheka and packaged as @umbra/* for cross-property reuse.

ID	Name	Category	Source	Status
USP-019	Durable Workflow Publisher Idempotent, retryable, multi-step publisher with saga-style recovery. Migrated three publishing agents off Cowork sessions to a single shared runtime.	Infrastructure	Indietheka	Production
USP-020	Multi-Channel Saga Fanout Distributes a single payload across N independent channels with per-channel saga and partial-success gate. Used by Distribution and Weekly Recap.	Infrastructure	Indietheka	Production
USP-021	Scheduled Audit with HITL Site-wide scheduled audit that emits dormant findings; activated only when a human approves them in the operational dashboard. Keystone for Comprehensive Site Audit.	Quality	Indietheka	Production
USP-022	Cron Discover-Rank-Publish Periodic discovery → ranking → publish pipeline against a third-party API. Early-abort budgeting, error classification, env-driven page sizing. Spotify v1.	Content Ops	Indietheka	Production

How patterns flow back to existing clients: Lighthouse Watch

A pattern library that only ships with new Sprints is a static asset. Watch is what makes it compounding. Every Watch month, the lead engineer reviews the most recent library additions and proposes one as the next month's +1 improvement — deployed into the client's existing system, eval-gated before promotion. After 12 months on Watch, a client's system has accumulated 12 deployed pattern updates that they didn't commission individually.

The reverse channel matters too: every +1 delivered surfaces a candidate pattern back into the library. Three matching +1's across clients = the pattern moves from candidate to canonical. Watch is therefore both a distribution channel (library → client) and a research instrument (client signal → library) — the mechanism that keeps engagement #N priced below engagement #1 because the pattern shelf is deeper.

§ Why Watch is structural, not commercial

The library-as-asset thesis demands a continuous channel. Without Watch, every client's system is frozen at the version of the library shipped on day one. The library improves; the client's system doesn't. That gap is the failure mode the framework exists to prevent — and Watch is the structural fix. Operating manual: lighthouse-watch-operating-manual.html.

§10

Engagement & pricing.

Fixed-fee · outcome-tied · US and LATAM tracks

Studio engagements are fixed-fee and outcome-tied. The client knows the cost upfront. If the measurable outcome doesn't beat the baseline, the pricing structure reflects that.

Sprint tiers

Tier	Duration	Scope	US pricing	LATAM pricing
Discovery Only	2 weeks	Stage 1 only — assessment and baseline	$15–30K	$2–5K
Standard Sprint	6–10 weeks	All four stages — one workflow end-to-end	$75–200K	$8–25K
Extended Sprint	10–16 weeks	Complex workflows, regulated industries	Scoped per engagement	Scoped per engagement

Lighthouse Watch tiers · the continuous service

After the Sprint ships, the system extends into Lighthouse Watch: monthly cadence, three tiers, 12-month minimum on Standard and Pro. Default offer at Sprint closeout is Watch · Standard. About 70% of Sprint clients sign within 30 days. Sales sheet · contract template.

Watch tier	Term	Includes	US pricing	LATAM pricing
Watch · Essentials	6-month minimum	Foundation maintenance + governance + monthly health report — no +1 improvement	$14K / mo	$2–3K / mo
Watch · Standard Default	12-month minimum	Essentials + the +1 improvement (eval-gated, monthly) + quarterly architecture review	$24K / mo	$3–5K / mo
Watch · Pro	12-month minimum	Standard + named-engineer commitment + 4 hrs/mo advisory + 4-hr SLA on P0 + 8-hr emergency bank/yr	$38K / mo	$5–7K / mo
Sprint + Watch bundle	Sprint + 12 mo Watch · non-cancellable first 6 mo	Standard Sprint at the entry $75K band + 12 months of Watch · Standard, single PO. Saves $66K vs. separate purchase.	$290K flat	$32–38K flat

Pricing philosophy

01Fixed fee

Not hourly.

The client pays for the outcome, not for time. Studio absorbs efficiency risk.

02Tied to outcome

Outcome-tied component.

A portion of the fee is tied to beating the baseline metrics established in Discovery. Structure TBD per engagement.

03Off-ramp

Discovery as off-ramp.

Discovery Only exists so clients can assess fit without committing to the full sprint. Standalone value.

04Regional fit

LATAM reflects market reality.

Adjusted for purchasing power, not discounted as a favor. The engagement quality is identical.

Beyond Sprint and Watch

Additional Sprints — Each new workflow is a new sprint. Returning clients benefit from accumulated pattern library and institutional knowledge. Multi-system Watch arrangements multiply at 1.7× per additional system.
Pattern Licensing — Clients who want to build internally without a Sprint can license patterns from the library for self-directed implementation. Terms TBD; typically used by clients whose data foundation is broken (a disqualifier for the Sprint itself).
Beacon Workshop — Two-day intensive ($18K flat) for leadership teams who want to apply the Lighthouse framework to their own workflows before deciding on a Sprint. About 40% of Workshop clients move to a Sprint within 90 days.

§11

Templates & artifacts.

Standardized deliverables · consistency across engagements · reduce Sprint Lead prep time

Every Lighthouse Sprint produces standardized deliverables using these templates. Templates ensure consistency across engagements and reduce the Sprint Lead's prep time.

Template index

#	Template	Stage	Priority	Status
01	Workflow Audit	Discovery	P0 · Critical	Ready
02	Baseline Metrics Sheet	Discovery	P0 · Critical	Ready
03	Stakeholder Interview Guide	Discovery	P1 · High	To Build
04	Fit Assessment Rubric	Discovery	P1 · High	To Build
05	Discovery Report	Discovery	P1 · High	To Build
06	Redesign Blueprint	Redesign	P0 · Critical	Ready
07	Agent Spec Sheet	Redesign	P0 · Critical	Ready
08	Risk Register	Redesign	P1 · High	To Build
09	Weekly Build Plan	Build	P2 · Medium	To Build
10	Operations Manual	Handoff	P0 · Critical	Ready
11	Governance Runbook	Handoff	P0 · Critical	Ready
12	Outcome Report	Handoff	P1 · High	To Build

§ Build priority

P0 templates (Workflow Audit, Baseline Metrics, Redesign Blueprint, Agent Spec Sheet, Operations Manual, Governance Runbook) must be built before the first client engagement. P1 templates should follow immediately. P2 templates can be developed during the first sprint.

§12

Worked Example — Indietheka.

First Lighthouse Sprint · digital media · Sprint Q4 2025 · 6 months in production · 26 durable workflows running 24/7 · 8 patterns externalized · Atlas orchestrating

The Indietheka editorial pipeline was the first Lighthouse Sprint — applied internally to Umbra Group's own newsroom in Q4 2025. This section shows how the framework's templates were applied to a real engagement — a concrete reference for future sprints. Industry: digital media. Workflow: editorial content pipeline. Duration: 8 weeks. Sprint outcome: 7× throughput, 90% fewer human hours, 16 agents, 14 reusable patterns. Six-month update: the fleet has moved off Cowork sessions onto durable infrastructure — 26 workflows running 24/7, a Slack feedback loop for post-publish corrections, an operational dashboard, Atlas running as a studio-level orchestrator above the fleet, and eight patterns externalized as @umbra/* packages. Zero missed publish days across the cutover. It is also the engagement profile the major foundation labs began publicly hiring for in Mexico City in May 2026 — Studio has been running it since Q4 2025.

§ Context

Indietheka is a Spanish-language indie music publication and Studio's LATAM Forward Deployed Engineering flagship. One editor, manual end-to-end pipeline — sourcing, writing, formatting, publishing, distributing, and playlist curation. The entire workflow was human-bottlenecked at every step, but only 15% of the work required genuine editorial judgment.

12.1 · Workflow Audit (Discovery, Week 1–2)

The Workflow Audit captured 12 discrete steps in the editorial pipeline. Key findings from the stopwatch exercise:

#	Activity	Actor	Active time	Bottleneck	Agent candidate
01	Browse 8+ music outlets for stories	Editor	45 min/day	5	Yes
02	Identify coverage gaps vs. competitors	Editor	30 min/day	4	Yes
03	Research album/artist for review	Editor	60 min	3	Partial
04	Listen to album	Editor	45 min	1	No
05	Write review in Spanish	Editor	90 min	2	Partial
06	Source and crop featured image	Editor	15 min	4	Yes
07	Format and publish to WordPress	Editor	20 min	5	Yes
08	Reformat for Facebook	Editor	10 min	5	Yes
09	Reformat for Instagram	Editor	12 min	5	Yes
10	Post to Lnk.Bio + other channels	Editor	8 min	5	Yes
11	Curate weekly Spotify playlists	Editor	4 hrs/week	4	Partial
12	Sync playlists to Spotify	Editor	30 min/week	5	Yes

§ Key finding

85% of editorial time was spent on mechanizable tasks. Only steps 4 (listening) and 5 (writing voice/angle decisions) required genuine editorial judgment.

12.2 · Baseline Metrics (Discovery, Week 2)

Metric	Baseline value	Unit	Measurement method
End-to-End Cycle Time (per article)	210	minutes	Stopwatch log avg, 1 representative week
Weekly Throughput	2–3	articles/week	WordPress publish log, 4-week average
Error Rate	~8%	%	Formatting errors, missed platforms, broken images
Human Hours per Cycle	3.5	hours/article	Stopwatch active time end-to-end
Cost per Article	~$105	$/article	Estimated at $30/hr editorial rate
Platform Distribution	1–2	platforms/publish	Manual — some platforms frequently skipped
Playlist Curation Time	4.5	hours/week	Stopwatch — selection + sync combined
FTEs Involved	1	person	Solo operator

12.3 · Redesign Blueprint (Redesign, Week 3–4)

The initial redesign classified all 12 workflow steps into four allocation categories. As the system matured in production, four additional agents emerged — expanding the fleet from 6 to 16 and pushing automation coverage across the full editorial lifecycle: RSS → discovery → writing → SEO → publishing → distribution → analytics → calendar planning.

Original 6 agents/automations — designed in Week 3–4 and deployed by Week 7:

ID	Agent	Classification	What it does
AGT-01	Continuous Source Monitor	Autonomous	Watches a defined set of external sources on a fixed cadence, deduplicates against persistent history, and queues only new material for downstream handling. In Indietheka: 15 music-press RSS feeds polled every 40 minutes; new stories written to a shared Google Sheet via an API endpoint
AGT-02	Durable Production Publisher	Autonomous	Enriches, rewrites, publishes, and self-checks production content end-to-end via a multi-layer cascade, with per-field auto-fix on failure. The durable layer that ships work to production without manual handoff. In Indietheka: hourly publishing to WordPress with a 3-layer cascade (source scraping → Spotify embed lookup → AI-assisted rewrite); 4 mandatory SEO fields self-checked and auto-fixed
AGT-03	Long-Form Editorial Pipeline	Human-approved	Picks the next approved long-form subject from a queue, locks the row via an API endpoint to prevent re-processing, runs the full research → write → fetch supporting media → publish flow, writes results back. The queue-gated heavy editorial agent. In Indietheka: weekly album reviews from a curated queue, Friday 10am, with cover art fetched via Spotify/MusicBrainz
AGT-04	Multi-Channel Distribution	Autonomous	Generates branded assets from rotating templates and fans out a single payload across N independent channels simultaneously, with per-channel formatting and saga-style failure recovery. One asset per run for natural feed pacing. In Indietheka: 6 rotating image templates posted to Instagram, Facebook, and Lnk.Bio, hourly
AUT-05	Recurring Catalog Sync + Digest	Human-approved	Syncs a curated catalog to an external platform on a recurring cadence, then auto-generates a companion digest asset that surfaces the top entries for social distribution. In Indietheka: weekly Spotify playlist sync via browser automation (API rate-limits prevent direct calls) followed by a Top-10 Instagram carousel
AUT-06	Periodic Recap Generator	Autonomous	Pulls the most relevant items from the period’s output and assembles a visual recap — cover slide, top items, call-to-action — for social distribution. Fires only above a volume threshold. In Indietheka: a 7-slide Instagram carousel each Sunday 10am, contingent on at least 3 published posts that week

10 agents added in production — each born from gaps discovered after the original 6 went live:

ID	Agent	Classification	What it does
AGT-07	Forward-Window Discovery	Autonomous	Scans upstream sources for inbound candidates inside a forward time window, ranks them, writes the top N to a review queue. The discovery half of a queue-gated pipeline. In Indietheka: ~964 followed artists on Spotify plus 5 indie-press outlets scanned each Thursday for the 7-day forward window; 10 ranked candidates per run; pairs 1:1 with Long-Form Editorial Pipeline
AGT-08	Data-Driven Strategy Proposals	Autonomous	Generates ranked content ideas oriented by what’s actually performing — analytics-informed editorial planning, not gut feel. Each proposal carries layout templates, targets, and an outline. In Indietheka: 2 daily editorial list ideas with carousel layouts, SEO targets, and article outlines, oriented by the weekly performance dashboard’s analytics insights
AGT-09	Page-Level Quality Auditor	Autonomous	Audits production pages field-by-field, auto-fixes what is deterministic, flags what needs editorial judgment. The non-judgmental layer of quality. In Indietheka: 5–10 pages/day with title tags, meta descriptions, and internal links auto-corrected; ambiguous cases queued for the editor
AGT-10	Weekly Planning Loop	Autonomous	Closes the operating period, analyzes performance, programs the next period’s plan, and appends to a cumulative memory file the rest of the system reads from. The feedback loop that makes the system learn. In Indietheka: editorial calendar programmed every Sunday 8pm with draft IDs and SEO targets, appended to a strategic memory file that never gets overwritten; outputs feed Data-Driven Strategy Proposals and Daily Competitive Brief the following week
AGT-11	Daily Competitive Brief	Autonomous	Scans a watch-list of competitor sources daily, cross-references against the operator’s own coverage to find gaps, and writes the highest-value opportunities directly to production as drafts — no approval gate. In Indietheka: 8 indie-music outlets (Pitchfork, Stereogum, NME, Brooklyn Vegan, Consequence, Paste, Line of Best Fit, Rolling Stone) scanned each morning at 9am; gap-coverage drafts written to WordPress
AGT-12	Structured-Metadata Enrichment	Autonomous	Classifies content by type, generates the corresponding structured-data payload, and PATCHes it into the production system’s metadata layer. Powers AI-search visibility and rich-result eligibility. In Indietheka: posts classified into Article / Review / NewsArticle / MusicEvent and enriched with JSON-LD every Monday; 119 posts on the day-1 deployment with zero failures; archive backfill ongoing
AGT-13	Eval Runner	Autonomous	Gates every production publish against a deterministic rules suite — required fields, structural integrity, taxonomy, schema validity. Logs every run; the log is the source of truth the next quality layer reads from. Fires reactively on publish hooks. In Indietheka: every WordPress publish validated against `evals/rules.json` for Yoast compliance, featured image, internal-link minimums, schema validity, slug structure, and tag taxonomy; runs logged to `eval_log.jsonl`
AGT-14	Quality Monitor	Autonomous	Audits the entire agent ecosystem itself. Cross-checks runtime against spec, detects error patterns not yet codified, ranks findings by impact, and proposes new rules and memories — never writes, only proposes. The meta-quality layer that keeps the rest of the system honest. In Indietheka: weekly Sunday-night scan (Sundays 7pm, before the performance dashboard runs) across session transcripts of the last 14 days, eval log, distribution log, MEMORY.md, and the agent repo; outputs a Quality Report
AGT-15	Weekly Performance Dashboard	Autonomous	Pulls multi-source analytics for the closing period, generates a multi-component dashboard with cross-source anomaly detection, and executes a bounded set of autonomous metadata fixes per run. In Indietheka: GA4 + Search Console pulled each Sunday 6pm; 11-component dashboard (bounce-by-page, keyword opportunity matrix, GSC↔GA4 anomalies, position 1–5 zero-click, action plan, autonomous SEO fixes, 7-day cohort retention, internal site search, path analysis, schema CTR comparison); HTML report written and rows appended to historical Sheets; 4–6 metadata PATCHes per run executed without further approval
AGT-16	Comprehensive Site Audit	Autonomous	Site-wide multi-dimension audit on a fixed cadence. Produces a scored Health Report with an explicit owner per finding — scorer, never fixer. Routes work to the right executor agent or to the human. In Indietheka: monthly run on the 1st at 9am plus on-demand; 8 weighted dimensions (Technical, Content/E-E-A-T, On-Page/SXO, Schema, Performance/CWV, AI/GEO, Sitemap, Images); findings routed to AGT-08 (titles/metadescs), AGT-11 (schema), or manual workstreams

§ Governance Model — Spreadsheet-Based Approval Gates

The original four classification categories (Fully Autonomous / Agent-Initiated Human-Approved / Human-Led Agent-Assisted / Human Only) are correct in spirit but proved too coarse. The production system uses column-based approval gates in Google Sheets backed by an API endpoint as the state machine:

Publishing gate — Editor marks a row as approved → agent processes and publishes → marks the row as complete (preventing re-processing).
Review queue gate — Forward-Window Discovery writes proposed albums → editor approves individual picks → Friday pipeline picks them up automatically.
Data-Driven Strategy Proposals gate — proposals generated daily; editor selects one → single binary question (“publish now or schedule?”) → full pipeline executes without further interruption.

This “spreadsheet-as-state-machine with API-backed writes” pattern replaced an earlier approach using browser UI automation that caused cell-drift and undo-cascade bugs. It is a reusable pattern (see USP-009 in §12.8).

§ Allocation

16 agents/automations covering the full editorial lifecycle. 8 fully autonomous, 2 human-approved. The original 12 workflow steps are now augmented by 4 new capabilities (discovery, proposals, technical SEO, feedback loop) that didn’t exist in the manual process. Automation rate: 92%+ of total editorial throughput — human involvement reduced to album listening, voice/angle decisions, and single-column approval gates.

12.4 · Agent Inventory (Production)

Sixteen agents/automations in production, evolved iteratively from the original 6. Each agent’s design was informed by incidents from previous agents — Long-Form Editorial Pipeline’ endpoint-only writes came from Multi-Channel Distribution’s cell-drift bugs; the Pre-Publish Verification Protocol grew organically from 0 to 9 check groups as errors were caught and codified.

ID	Name	Classification	Schedule	Integration points
AGT-01	Continuous Source Monitor	Autonomous	Every 40 min	15 RSS feeds, persistent deduplication log, API endpoint writes to Google Sheets
AGT-02	Durable Production Publisher	Autonomous	Every hour	Google Sheets (approval gate), OpenAI API (rewrite), Spotify (embed lookup), WordPress, Yoast SEO (4-field self-check with auto-fix)
AGT-03	Long-Form Editorial Pipeline	Human-approved	Friday 10am	Review queue spreadsheet (approval gate), API endpoint for queue state, Claude API (review generation), Album of the Year (cover art), WordPress, Yoast SEO (two-step metadata update)
AGT-04	Multi-Channel Distribution	Autonomous	Every hour (1 post/run)	WordPress, image generator (6 templates, A/B rotation), Meta Graph API (Instagram + Facebook), Lnk.Bio, external image hosting, persistent distribution log
AUT-05	Recurring Catalog Sync + Digest	Human-approved	Sunday	Spotify Web Player (browser automation — not API), Meta Graph API (Top 10 carousel), carousel image generator, Lnk.Bio
AUT-06	Periodic Recap Generator	Autonomous	Sunday 10am	WordPress, recap image generator (canvas rendering), Meta Graph API, Lnk.Bio, external image hosting
AGT-07	Forward-Window Discovery	Autonomous	Thursday 10am	Spotify followed-artists + album data, 5 indie-press feeds, review queue spreadsheet (API endpoint), weighted scoring algorithm
AGT-08	Data-Driven Strategy Proposals	Autonomous	Daily 3pm	Google Analytics + Search Console insights, WordPress (existing content check), SEO keyword research
AGT-09	Page-Level Quality Auditor	Autonomous	Daily 10am	WordPress, Yoast SEO, Google Search Console data
AGT-10	Weekly Planning Loop	Autonomous	Sunday 8pm	Google Analytics, Google Search Console, WordPress (post counts), persistent weekly plan + cumulative strategic memory files. Feeds insights into Data-Driven Strategy Proposals and Page-Level Quality Auditor the following week
AGT-11	Daily Competitive Brief	Autonomous	Daily 9am	8 competitor RSS/sitemap feeds, Search Console (own-coverage check), WordPress (draft creation), no approval gate
AGT-12	Structured-Metadata Enrichment	Autonomous	Monday weekly	WordPress REST (PATCH meta), JSON-LD generator, content classifier (Article / Review / NewsArticle / MusicEvent). 119 posts enriched day one with zero failures
AGT-13	Eval Runner	Autonomous	Reactive, on every publish	WordPress REST (last 50 posts), `evals/rules.json`, `eval_log.jsonl` (rolling 30-day window). Failures gate the publish; passes append to log for downstream agents
AGT-14	Quality Monitor	Autonomous	Sunday 7pm	Cowork session transcripts (read-only, 14d window), `eval_log.jsonl`, distribution log, MEMORY.md + linked feedback files, agent repo, WP REST. Outputs `quality-reports/YYYY-MM-DD.md`
AGT-15	Weekly Performance Dashboard	Autonomous	Sunday 6pm	GA4 (property 488842515), Search Console export, WordPress REST (Bloque-A PATCHes 4–6 posts/run), historical Google Sheets (GA4 + GSC tabs), HTML output
AGT-16	Comprehensive Site Audit	Autonomous	Monthly · 1st 9am	Yoast sitemap index, WordPress REST, PageSpeed Insights API, Search Console (manual export → service-account when wired), previous month’s audit (delta diff). Writes `seo-audits/YYYY-MM-DD/audit-report.md` with raw evidence preserved

Supporting production infrastructure — not agents themselves, but shared systems the agents depend on:

QualityLive

Pre-Publish Verification Protocol.

9 check groups covering every dimension of editorial accuracy. Fires on any action where the AI authors content. Covers: lineup/date verification, album data, cover art, artist photos vs album covers, news structure, verb tenses, tags/categories, link integrity, and festival-day validation. Silent on success, blocks on failure.

ContentLive

6 Cornerstone Article Templates.

Guía Interactiva, Editorial Retrospectiva, Ranked List, Artist Monograph, Cornerstone Mega-Guide, Must-See Grid. Each has its own accent color, background style, and component library.

ContentLive

2 Instagram Carousel Templates.

Two deliberately contrasting visual styles — one for album-focused lists (black backgrounds, album art, pink accents) and one for artist-focused features (full-bleed desaturated concert photos). They alternate in the feed for visual diversity.

InfrastructureLive

Chunked Upload System.

Transfers large HTML articles from the sandboxed environment to WordPress by encoding them, splitting into manageable chunks, and reassembling them browser-side before uploading. Solves the constraint that the sandbox can’t reach the WordPress API directly.

InfrastructureLive

GitHub Actions Migration.

Moving agents from local scheduled tasks to cloud-based GitHub Actions. Continuous Source Monitor is live (runs every 40 min in the cloud). Remaining agents pending migration. Architecture: one folder per agent, one workflow per agent, state files committed back to the repository.

InfrastructureLive

Cover Art Verification Pipeline.

Fetches album artwork using structured identifiers from Spotify or MusicBrainz — never freeform search. Every image is visually verified before rendering to prevent wrong-album art from reaching production. Born from a real incident where the wrong cover was published.

12.5 · Build Sequence (Iterative, Incident-Driven)

The original plan called for a linear 3-week deployment (AGT-01/02 in Week 5, AGT-03/04 in Week 6, AGT-05/06 in Week 7). In practice, the system evolved iteratively — each agent’s design was shaped by production incidents from its predecessors. The ten additional agents (AGT-07 through AGT-16) were not in the initial scope; they emerged from gaps revealed only after the core pipeline was running live.

§ Evolution pattern

Multi-Channel Distribution’s cell-drift bugs (Chrome UI writes) → Apps Script endpoint pattern (USP-009) → Long-Form Editorial Pipeline adopted endpoint-only writes from day one. Post 13065’s wrong cover art → Cover Art Verification Pipeline (USP-006). Each incident left a codified rule in the Pre-Publish Verification Protocol, growing it from 0 to 9 check groups over 2 weeks. The build sequence was not a plan — it was a trail left by incidents.

12.6 · Measured Outcomes (Production, Week 8+)

Baseline vs. current production comparison. Initial metrics were captured at Week 8 handoff (6 agents); current figures reflect the mature 16-agent system with SEO, discovery, proposals, and the weekly feedback loop operational.

Metric	Baseline	Post-sprint (Wk 8)	Current (16 agents)	Delta
Weekly Throughput	2–3 articles	12–15 articles	15–20 articles	+7×
Human Hours per Article	3.5 hrs	~1 hr	~20 min	−90%
Platform Distribution	1–2 platforms	6 platforms	6 platforms	+4 platforms
Error Rate	~8%	<2%	<1%	−88%
Agent/Automation Count	0	6	16	+16 agents
Agent Invocations	0	160+/week	1,000+/week	—
SEO Coverage	0 pages/day	—	5–10 pages/day audited	New capability
Album Discovery	Manual browsing	—	10 ranked candidates/week	New capability
Content Strategy	Ad hoc	—	2 data-driven proposals/day	New capability
Editorial Feedback Loop	None	—	Weekly closed-loop (GA4 + GSC → calendar)	New capability
Playlist Curation Time	4.5 hrs/week	30 min/week	30 min/week (review)	−89%
Quality Checks	0	—	9 check groups, every publish	New capability
Reusable Patterns	0	4	14	+14 documented
FTEs Required	1 (overloaded)	1 (with capacity)	1 (strategic only)	Same headcount, 7× output

+7×

Weekly throughput

−90%

Human hours per article

−88%

Error rate

Agents in production

Reusable patterns

−89%

Playlist curation time

12.7 · Governance in Production

MonitoringLive

Agent Health Dashboard.

Tracks success rate, latency, error count, queue depth for all 16 agents. RSS poller checks every 15 min. Alert if any agent fails 3+ consecutive invocations.

Human CheckpointsLive

Editorial Approval Queue.

AGT-03, AGT-04, and AGT-06 queue outputs for editor review. Average approval turnaround: <2 hours during business hours.

OverrideLive

One-click Kill Switch.

Any agent can be paused or stopped instantly from the governance dashboard. In-flight items are held in queue. Override authority: the editor.

RollbackLive

Draft Reversion.

WordPress posts revert to draft. Distribution posts deletable per-platform. Spotify playlists maintain version history. All agent actions logged.

12.8 · Reusable Patterns Extracted

14 patterns extracted from the Indietheka sprint — the original 4 updated with production detail, plus 10 new patterns each born from a real incident with a traceable receipt. Documentation standard for each: inputs/outputs, dependencies, governance needs, incident receipt, and adaptation notes for other client contexts.

Updated originals (USP-001–004)

ID	Pattern	Category	Production enrichment
USP-001	RSS-to-CMS Ingestion Pipeline	Content Ops	3-layer content enrichment cascade (source scraping → music embed lookup → AI-assisted rewrite), automated self-check of 4 SEO fields with auto-fix, and a spreadsheet column lock to prevent duplicate processing
USP-002	AI-Assisted Editorial with Voice Preservation	Content Ops	Prohibited-word list enforcement, verb-tense rules table (5 situations), heading rules (lead with the key fact, not a generic label), 300–420 word hard limit per article
USP-003	Multi-Channel Distribution Pipeline	Content Ops	6-template image generation with A/B rotation per post, external image hosting (selected specifically because Meta’s image fetcher is blocked by certain CDNs), one-post-per-run pacing for natural feed spacing
USP-004	Queue-Driven Pipeline with Column-Based Approval Gates	Content Ops	Renamed from “Data-Driven Curation with Human Approval Gate”. Thursday→Friday discovery-to-review cycle, spreadsheet as state machine backed by an API endpoint, with shared client libraries for reading and writing queue state

New patterns (USP-005–014) — each traceable to a production incident:

ID	Pattern	Category	Description	Incident receipt
USP-005	Pre-Publish Verification Protocol	Quality	9 check groups covering every editorial accuracy dimension. Silent on success, blocks on failure. Fires on any AI-authored content. Each rule is traceable to a specific published article and date.	Grew from 0 to 9 checks over 2 weeks of production errors
USP-006	Cover Art Verification Pipeline	Quality	Fetches album artwork using structured identifiers from Spotify or MusicBrainz — never freeform search. Every image is visually verified before rendering to prevent wrong-album art from reaching production.	Wrong cover published — AM artwork served instead of the correct album (2026-04-13)
USP-007	Chunked Upload System	Infrastructure	Transfers large HTML articles from the sandboxed environment to WordPress by encoding them, splitting into manageable chunks (~6,700 characters each), reassembling browser-side, and uploading. Verifies running totals after each chunk to catch corruption.	Cornerstone article pipeline — articles too large for single-request upload
USP-008	Two-Step SEO Metadata Update	Infrastructure	When updating article content and SEO metadata in a single WordPress API call, the SEO plugin doesn’t refresh certain fields. Requires a second API call with only the metadata to force the refresh.	Backlog cleanup of 85 posts with stale SEO descriptions (2026-04-18)
USP-009	API Endpoint for Spreadsheet Writes (No UI Automation)	Infrastructure	Uses Google Sheets as a database via an API endpoint instead of browser UI automation. Standard HTTP libraries handle the endpoint’s redirect correctly; some command-line tools do not. Rule: never use browser automation to write spreadsheet cells.	Cell-drift and undo-cascade bugs retired after migrating Long-Form Editorial Pipeline to the API endpoint
USP-010	Featured Image ≠ Carousel Cover	Content	Never reuse a vertical Instagram carousel cover (1080×1350) as a WordPress featured image. Always generate a separate 1600×900 landscape asset for the website.	Coachella W1 article used the wrong image ratio (2026-04-14)
USP-011	Temporal Claims Verification	Quality	Claims like “first time in N years”, “comeback”, or “after N years” must be verified against the publication date, not the drafting date. For multi-weekend festivals: never claim a “first” or “comeback” in Weekend 2 if the artist already played Weekend 1.	The xx incorrectly labeled as a comeback in Coachella W2 carousel (2026-04-16)
USP-012	Multi-Template Carousel System	Content	Two deliberately contrasting Instagram carousel templates — one album-focused (black backgrounds, album art, pink accents) and one artist-focused (full-bleed desaturated concert photos, high energy). Alternate in the feed for visual diversity.	Feed looked monotonous; solved by introducing the second template style (2026-04-16)
USP-013	Weekly Planning Loop with Cumulative Memory	Strategy	Sunday evening automation closes the editorial week: analyzes traffic and search performance, identifies which content types and topics worked, writes a weekly plan (with draft IDs and SEO targets) and appends to a cumulative strategic memory file. These outputs feed into Data-Driven Strategy Proposals and Daily Competitive Brief the following week, creating a closed learning loop.	First run 2026-04-20; insights already influenced the following week’s calendar
USP-014	Page Builder Safety	Infrastructure	Never update page-builder pages (Elementor, etc.) via the standard WordPress REST API — it silently strips theme-specific layout metadata, breaking the visual design. Use custom AJAX endpoints or the page builder’s own API instead.	Homepage layout broken after a standard API update (2026-04-18)

12.9 · Studio-Level Orchestration (Atlas) · Post-Sprint extension

Atlas was added in Q2 2026 — six months after Sprint closeout — when the editor needed a single touch-point above the fleet rather than per-agent dashboards. Atlas reads the entire migrated fleet, classifies editorial feedback in Slack, commits corrections back to the repo, dispatches non-deterministic work to Cowork sessions, and runs a morning brief against everything below it. Not part of the original Sprint scope — an extension built using the same patterns the Sprint extracted, which is the proof the library compounds.

Daily briefLive

Morning Brief.

Synthesizes the previous 24h across all 12 agents into a single Slack post at 7am CDMX. Highlights, anomalies, lock-not-acquired events, queue depths.

Feedback loopLive

Feedback → Commit.

Editor reacts to a Slack message; a classifier proposes the correction in-thread; ✅ commits the diff to the agent's repo via GitHub API. No redeploy needed for prompt-level fixes.

DispatchLive

Cowork Escalation.

When a request lands that requires non-deterministic judgment — a redesign, a rewrite, a one-off investigation — Atlas opens a Cowork delegation with full context. The human resumes inside a Claude session, not a triage queue.

WatchersLive

Continuous Sweep.

Every 30 min Atlas sweeps the fleet for stuck queues, broken integrations, drift between spec and runtime. Findings post to a gates dashboard; the editor approves or dismisses each one.

§ Why Atlas counts as proof, not extra credit

Atlas was built using the Sprint's own patterns. Durable publisher, scheduled audit with HITL, the Slack and GitHub integrations — all four had production receipts before Atlas existed. Atlas is the test that the library is rich enough to host a new system without inventing new infrastructure. Same pattern, second-order use.

§ Why this matters

Every table above corresponds to a template in the Lighthouse Kit. The Workflow Audit, Baseline Metrics, Redesign Blueprint, Agent Spec Sheets, Operations Manual, and Governance Runbook were all filled in during this engagement — this section shows what the completed versions look like. Use Indietheka as the reference when filling in templates for the next client.

Read it. Apply it. Or hire the studio to ship against it.

This framework is Umbra Studio's internal operating document. If you want to run a Lighthouse Sprint on one of your own workflows, talk to us.

Request a Sprint → Media Kit Umbra Group