The Lighthouse Sprint Operating Framework.
The complete operational playbook for Umbra Studio's core engagement — from discovery through handoff. This document defines every stage, activity, deliverable, quality gate, and governance component of a Lighthouse Sprint.
Read this end-to-end before leading or contributing to a sprint. Every table corresponds to a template in the Lighthouse Kit.
Philosophy & principles.
Every decision in a Lighthouse Sprint is governed by seven operating constraints. These aren't aspirations — they're hard rules. If a sprint activity violates any of them, the activity gets redesigned or removed.
Workflows, not tools.
We rebuild workflows with intelligence at the core. We don't bolt agents onto legacy processes. If the workflow doesn't change, the engagement doesn't start.
Human above the loop.
Humans supervise and make judgment calls. They don't bottleneck. Oversight lives at the governance layer, not inside the execution path.
Production or nothing.
We ship working systems — not prototypes, not proofs of concept, not strategy decks. If it doesn't run in production, it doesn't count.
Measure everything.
Baseline before. Measure after. Every engagement produces quantified before/after data. Opinions don't prove ROI — numbers do.
Governance is not optional.
Every deployed agent ships with monitoring, alerting, audit trail, escalation paths, human override, and rollback capability. No exceptions.
Patterns compound.
Every engagement extracts reusable components into the Pattern Library. The next client benefits from the last one. Studio gets better with every sprint.
Handoff means independence.
The client's team runs the system after we leave. If they can't operate it without us, the handoff failed. Dependency is a design flaw, not a revenue strategy.
Sprint architecture + Watch.
The Lighthouse Sprint is a fixed-scope, time-boxed engagement with four sequential stages. Each stage ends with a quality gate — a go/no-go decision that must be passed before the next stage begins. After Stage 4, the engagement extends into Lighthouse Watch — the productized continuous service that maintains the system, updates governance, and ships one improvement per month drawn from the pattern library.
The four Sprint stages + the continuation
| Stage | Name | Duration | Core output | Gate |
|---|---|---|---|---|
| 01 | Discovery | 1–2 weeks | Friction map, baseline metrics, go/no-go | Gate 1 · Fit Decision |
| 02 | Redesign | 1–2 weeks | Agentic workflow blueprint, governance design | Gate 2 · Design Approval |
| 03 | Build & Deploy | 3–5 weeks | Production agents, governance wrapper, monitoring | Gate 3 · Production Readiness |
| 04 | Handoff | 1–2 weeks | Training, documentation, outcome measurement | Gate 4 · Independence Verified |
| +05 | Lighthouse Watch Continuous | 12+ months · monthly cadence | Monthly health report, the +1 improvement, governance updates, model swap testing | Renewal at 12 mo |
After the Sprint: Lighthouse Watch
The Sprint ends at Gate 4 with a working production system. What happens next is the part most consultancies leave unsolved. Foundation models drop every 8–12 weeks; eval suites drift; governance ages; the pattern library sharpens with every subsequent engagement. Without a structural mechanism to absorb that churn, the system you signed off on this quarter starts to look different from the system you've got next quarter — and the people closest to the system aren't always the ones who notice first.
Lighthouse Watch is that mechanism. It is part of the Lighthouse architecture, not an add-on. Every Sprint contract proposes Watch as the default next step in Week 8; about 70% of clients sign Watch · Standard within 30 days of Sprint closeout. The lead engineer who shipped the Sprint is the lead engineer who runs the Watch — no learning-curve gap.
| Watch stream | What it covers | Cadence |
|---|---|---|
| A · Foundation maintenance | Model swap testing within 7 BD of any major release; eval suite re-runs; prompt-drift detection; provider deprecation tracking | Monthly + on-release |
| B · Governance & observability | Governance Runbook revisions tied to incidents and policy; acceptance-rate monitoring (target ≥80%); reviewer-UI feedback loop; quarterly architecture review | Monthly + quarterly |
| C · The +1 improvement Differentiator | One delivered improvement every month: either a new pattern from the library deployed into the client's system, or a workflow optimization addressing a measured friction. Eval-gated before promotion — if it fails its eval, it doesn't ship and we owe it the next month. | Monthly · eval-gated |
| D · Monthly health report | 3–4 page report on the 1st of every month, signed by the lead engineer. Model status, acceptance trends, evals, incidents, the +1 delivered, library updates, next-month preview. | Monthly · 1st of month |
Timeline variants
| Variant | Duration | Stages | Use case |
|---|---|---|---|
| Discovery Only | 2 weeks | Stage 1 only | Client wants assessment before committing to full sprint |
| Standard Sprint | 6–10 weeks | All four stages | Default engagement — one workflow, end-to-end |
| Extended Sprint | 10–16 weeks | All four stages | Complex workflows, multi-system integrations, regulated industries |
| LATAM Sprint | 6–10 weeks | All four stages | Latin American market — adjusted scope and pricing for regional fit |
Roles.
Every Lighthouse Sprint has six defined roles — three from Studio, three from the client. Role clarity eliminates ambiguity about who owns decisions, who does the work, and who signs off at each gate.
Studio roles · Forward Deployed Engineers
| Role | Responsibility | Gate authority |
|---|---|---|
| Sprint Lead | Owns the engagement end-to-end. Runs discovery sessions, presents at gates, manages client relationship. Single point of accountability. | Recommends go/no-go at all gates |
| Agent Engineer | Builds, tests, and deploys the agentic workflows. Owns the technical implementation from Redesign through Handoff. | Signs off on Gate 3 |
| Governance Architect | Designs and implements the governance wrapper — monitoring, alerting, audit trail, escalation, override, rollback. Ensures compliance. | Signs off on governance components at Gate 3 |
Client roles
| Role | Responsibility | Gate authority |
|---|---|---|
| Executive Sponsor | Budget authority and organizational air cover. Removes blockers, ensures team access and time. Must attend every gate. | Final go/no-go at Gate 1 and Gate 2 |
| Workflow Owner | The person who currently runs the workflow being redesigned. Domain expert — source of truth for how things actually work. | Validates accuracy at Gate 1; approves design at Gate 2 |
| Technical Counterpart | Client's IT/engineering contact. Handles access, integrations, security review, and production environment requirements. | Signs off on technical readiness at Gate 3 |
Stage 1 — Discovery.
Discovery is the foundation. We map the current workflow end-to-end, stopwatch every step, quantify bottlenecks, and determine whether this workflow is a fit for agentic redesign. If it isn't, we say so — and the engagement ends cleanly at Gate 1.
Day-by-day activities
| Day | Activity | Who | Output |
|---|---|---|---|
| Day 1 | Kickoff & workflow walkthrough | Sprint Lead + Workflow Owner | Process narrative, initial workflow map |
| Day 2 | Shadow sessions — observe the workflow in real time | Sprint Lead + Agent Engineer | Annotated observations, pain points logged |
| Day 3 | Stopwatch exercise — time every step and handoff | Sprint Lead | Cycle-time data per step |
| Day 4 | Bottleneck analysis & error-rate review | Sprint Lead + Governance Architect | Bottleneck cost analysis, error taxonomy |
| Day 5 | Stakeholder interviews (3–5 people) | Sprint Lead | Qualitative friction points, tribal knowledge documented |
| Day 6–8 | Data synthesis & baseline metrics compilation | Full Studio team | Baseline Metrics Sheet |
| Day 9–10 | Gate 1 preparation & presentation | Sprint Lead | Discovery Report, Gate 1 recommendation |
Discovery tools
Workflow Audit Template.
Structured spreadsheet for capturing every step, actor, system, input, output, time, and decision point in the current workflow.
Stopwatch Protocol.
Standardized method for timing workflow steps — setup for parallel steps, waiting vs. active time, and handoff delays.
Bottleneck Scoring Matrix.
Framework for ranking bottlenecks by frequency, cost, and amenability to agentic automation. Produces a prioritized target list.
Stakeholder Interview Guide.
Semi-structured interview protocol. Captures what the process map misses: workarounds, tribal knowledge, undocumented dependencies.
Fit Assessment Rubric.
Scoring model for determining if the workflow is suitable for agentic redesign. Evaluates repeatability, data availability, decision complexity, and integration feasibility.
Deliverables
- Workflow Audit — Complete map of the current workflow with time, cost, and error data per step.
- Baseline Metrics Sheet — Quantified snapshot: cycle time, throughput, error rate, cost-per-unit, human hours per cycle.
- Discovery Report — Narrative summary with findings, bottleneck ranking, and initial redesign hypotheses.
- Gate 1 Recommendation — Go, no-go, or conditional-go with specific conditions to resolve.
Executive Sponsor and Workflow Owner review the Discovery Report and Baseline Metrics.
The Sprint Lead presents the go/no-go recommendation.
- Go — The workflow is a fit for agentic redesign. Proceed to Stage 2.
- Conditional Go — Proceed, but specific conditions must be met (e.g. data access, system access, organizational commitment).
- No-Go — The client receives the Discovery Report and Baseline Metrics as standalone deliverables. Engagement ends cleanly — no hard feelings, no sunk cost pressure.
Stage 2 — Redesign.
Redesign is where we architect the new workflow. Not optimization of the old one — a fundamentally new workflow where agents carry the operational load and humans supervise from above the loop.
Agent / human allocation framework
Every task in the workflow is classified into one of four categories:
| Category | Description | Examples |
|---|---|---|
| Fully Autonomous | Agent handles end-to-end. No human in the path. Governance layer monitors. | Data extraction, formatting, routing, scheduling, notifications |
| Agent-Initiated, Human-Approved | Agent does the work, queues it for human review before execution. | Content generation, financial calculations, customer communications |
| Human-Led, Agent-Assisted | Human makes the decision; agent provides data, drafts, or recommendations. | Strategy decisions, exception handling, complex negotiations |
| Human Only | Requires human judgment, creativity, or relationship. No agent role. | Executive decisions, sensitive conversations, creative direction |
Deliverables
- Redesign Blueprint — The new workflow map: every task, its allocation category, data flows, integration points, and governance touchpoints.
- Agent Spec Sheet — Per-agent specification: inputs, outputs, triggers, tools, guardrails, escalation conditions, success metrics.
- Governance Design — Monitoring, alerting, and escalation architecture for the redesigned workflow.
- Risk Register — Identified risks with probability, impact, and mitigation strategies.
Executive Sponsor, Workflow Owner, and Technical Counterpart review the Redesign Blueprint and Agent Spec Sheets.
- Approved — The design is accepted. Proceed to Build.
- Approved with Changes — Specific modifications required — documented, time-boxed, resolved before Build begins.
- Rejected — Fundamental design disagreement. Return to Redesign for a new approach, or end engagement. Client keeps all Discovery and Redesign deliverables.
Stage 3 — Build & Deploy.
Build is the longest stage. We construct the agents, wire integrations, deploy the governance wrapper, and push to production. Weekly sprint cadence with demos every Friday.
Weekly sprint cadence
| Day | Activity | Output |
|---|---|---|
| Monday | Sprint planning — scope the week's build targets against the Blueprint | Weekly build plan |
| Tue–Thu | Build, test, integrate — Agent Engineer and Governance Architect execute | Working increments |
| Fri AM | Internal review — team tests against acceptance criteria | Test results, bug log |
| Fri PM | Client demo — show working functionality against baseline metrics | Demo recording, feedback log |
5-week build sequence (standard sprint)
| Week | Focus | Milestone |
|---|---|---|
| Wk 1 | Foundation — core agent scaffolding, data pipeline, integration auth | First agent runs end-to-end in staging |
| Wk 2 | Core agents — build primary autonomous and human-approved agents | All primary agents functional in staging |
| Wk 3 | Governance layer — monitoring, alerting, audit trail, escalation paths | Governance wrapper operational |
| Wk 4 | Integration & hardening — production systems, edge cases, load testing | System passes integration tests |
| Wk 5 | Production deploy — supervised operation with Studio monitoring | System live in production with supervision |
Deliverables
- Production Agents — Deployed, tested, monitored agents running in the client's production environment.
- Governance Wrapper — Full monitoring, alerting, audit, escalation, override, and rollback infrastructure.
- Integration Layer — All connections to client systems, APIs, and data sources.
- Monitoring Dashboard — Real-time visibility into agent performance, error rates, escalation activity.
- Test Suite — Automated tests covering agent behavior, edge cases, and governance triggers.
Agent Engineer, Governance Architect, and Technical Counterpart jointly review.
- All agents pass acceptance criteria against the Redesign Blueprint.
- Governance wrapper is operational — monitoring, alerting, escalation, override, rollback all tested.
- Integration tests pass — all connections to client systems verified.
- Performance meets baseline targets — measurable improvement over Discovery baseline.
- Security review complete — Technical Counterpart signs off.
Stage 4 — Handoff.
Handoff is where Studio proves the system works and proves the client can run it independently. If either condition fails, Handoff isn't complete.
Handoff activities
| Activity | Who | Duration | Output |
|---|---|---|---|
| Operator training sessions | Sprint Lead + Workflow Owner's team | 2–3 sessions | Trained operators who can manage day-to-day operations |
| Supervised operation period | Studio monitors, client operates | 3–5 days | Client team runs the system with Studio as safety net |
| Outcome measurement | Sprint Lead | 1–2 days | Before/after metrics comparison against baseline |
| Pattern extraction | Agent Engineer + Governance Architect | 1–2 days | Reusable agent components documented for Pattern Library |
| Documentation handoff | Full Studio team | 1 day | Operations manual, governance runbook, escalation guide |
| Watch transition If signed | Sprint Lead + Studio Principal | Final day | Lighthouse Watch starts the day after handoff — same lead engineer, no gap. Slack channel transitions, monthly cadence kicks off. |
Deliverables
- Operations Manual — Step-by-step guide for day-to-day operation of the agentic workflow.
- Governance Runbook — How to respond to alerts, escalations, and incidents. Includes override and rollback procedures.
- Outcome Report — Quantified before/after comparison: cycle time, throughput, error rate, cost-per-unit, human hours saved.
- Pattern Library Contributions — Reusable components extracted and cataloged for future use.
- 30-Day Support Window — Post-handoff email/Slack support for questions and issues (included in fee). Default next step: transition into Lighthouse Watch — the productized monthly continuation.
- Lighthouse Watch proposal — Sent in Sprint Week 8 with sales sheet attached. Default offer is Watch · Standard ($24k/mo, 12-month term) starting the day after handoff. About 70% of Sprint clients sign within 30 days. View sales sheet →
The final gate verifies the client can run the system without Studio involvement.
- Supervised operation completed — client team operated for 3–5 days with no critical issues.
- Operators trained and certified — handle normal operations, escalations, and basic troubleshooting.
- Outcome metrics documented — measurable improvement over baseline, signed off by Executive Sponsor.
- All documentation delivered — operations manual, governance runbook, escalation guide reviewed.
- Pattern contributions cataloged — reusable components extracted and documented for Studio's Pattern Library.
The governance wrapper.
The Governance Wrapper is not an add-on — it's a core component shipped with every Lighthouse Sprint. Six components, all required, all production-grade.
Monitoring.
Real-time visibility into agent activity: invocations, latency, success/failure rates, resource consumption. Feeds the Stage 3 dashboard.
Alerting.
Configurable alerts for anomalies: error rate spikes, latency degradation, unexpected behavior, throughput drops. Routed via escalation path.
Audit Trail.
Every agent decision logged: inputs, outputs, reasoning, timestamps, data accessed. Immutable. Queryable. Required for compliance-sensitive industries.
Escalation Paths.
Defined routes from agent to human for decisions that exceed the agent's authority or confidence. Includes time-based auto-escalation.
Human Override.
Any agent action can be overridden by an authorized human at any time. Overrides logged in the audit trail. Designed for human supremacy, not agent autonomy.
Rollback.
Every deployment includes rollback capability to the previous known-good state. Triggered automatically (via thresholds) or manually (via override).
The pattern library.
The Pattern Library is Studio's compounding asset. Every engagement extracts reusable agent components — patterns that can be adapted and deployed in future sprints. The library currently holds 18 patterns across four Umbra Group properties: 14 production-tested from Indietheka (each with a traceable incident receipt), plus 4 from Radiohead Community, Transit, and Orbit at various stages of development.
Catalog format
| Field | Description |
|---|---|
| Pattern ID | Unique identifier (USP-XXX format) |
| Name | Descriptive name |
| Category | Content Ops, Quality, Infrastructure, Strategy, Distribution, Monitoring, etc. |
| Source | Engagement or property where the pattern was first extracted |
| Description | What the pattern does and when to use it |
| Incident Receipt | The specific production incident (with post ID and date) that created the rule. Proves the pattern is battle-tested, not theoretical |
| Inputs / Outputs | Data the pattern consumes and produces |
| Dependencies | Systems, APIs, or other patterns required |
| Governance | Monitoring, alerting, and escalation needs |
| Adaptation Notes | How to modify the pattern for different client contexts |
Indietheka patterns (USP-001–014) · all production-tested
14 patterns extracted from the first Lighthouse Sprint. The original 4 were designed during the sprint; the remaining 10 emerged from production incidents and were codified as reusable rules. Full detail with incident receipts in §12.8.
| ID | Name | Category | Status |
|---|---|---|---|
| USP-001 | RSS-to-CMS Ingestion Pipeline 3-layer content enrichment cascade, automated SEO self-check with auto-fix, deduplication lock via spreadsheet column. | Content Ops | Production |
| USP-002 | AI-Assisted Editorial with Voice Preservation Prohibited-word enforcement, verb-tense rules, heading guidelines, 300–420 word hard limit. Maintains brand voice at scale. | Content Ops | Production |
| USP-003 | Multi-Channel Distribution Pipeline 6-template image generation with A/B rotation, CDN-compatible hosting, one-post-per-run pacing for natural feed spacing. | Content Ops | Production |
| USP-004 | Queue-Driven Pipeline with Approval Gates Spreadsheet-as-state-machine with API-backed writes. Thursday→Friday discovery-to-review cycle. Shared client libraries for queue state. | Content Ops | Production |
| USP-005 | Pre-Publish Verification Protocol 9 check groups covering every editorial accuracy dimension. Silent on success, blocks on failure. Each rule traceable to a specific incident. | Quality | Production |
| USP-006 | Cover Art Verification Pipeline Fetches artwork via structured identifiers, never freeform search. Visual verification before rendering. Born from wrong-cover incident. | Quality | Production |
| USP-007 | Chunked Upload System Transfers large HTML articles to CMS by encoding, splitting into chunks, reassembling browser-side, and uploading with integrity checks. | Infrastructure | Production |
| USP-008 | Two-Step SEO Metadata Update Workaround for SEO plugins that don’t refresh metadata in a single API call. Requires a second call with only the metadata payload. | Infrastructure | Production |
| USP-009 | API Endpoint for Spreadsheet Writes Uses Google Sheets as a database via API endpoint instead of browser automation. Solves cell-drift and undo-cascade bugs. | Infrastructure | Production |
| USP-010 | Featured Image ≠ Carousel Cover Never reuse vertical social-media carousel art as horizontal website featured image. Generate separate assets per platform. | Content | Production |
| USP-011 | Temporal Claims Verification Verify “first in N years” and “comeback” claims against publication date. Special rules for multi-weekend events. | Quality | Production |
| USP-012 | Multi-Template Carousel System Two contrasting visual templates that alternate in the feed. Prevents visual monotony in high-volume social publishing. | Content | Production |
| USP-013 | Editorial Feedback Loop Weekly automation that analyzes performance, writes a content plan, and appends to cumulative strategic memory. Closed learning loop. | Strategy | Production |
| USP-014 | Page Builder Safety Never update page-builder pages via standard CMS API — use the builder’s own API or custom endpoints to preserve layout metadata. | Infrastructure | Production |
Cross-property patterns (USP-015–018)
Patterns from other Umbra Group properties at various stages of development. These will be enriched with incident receipts as each property’s agents mature in production.
| ID | Name | Category | Source | Status |
|---|---|---|---|---|
| USP-015 | Community Content Scheduler Content-calendar management across platforms. Caption generation, hashtag optimization, best-time-to-post scheduling. | Distribution | Radiohead Community | Production |
| USP-016 | Automated Press Kit Generator Generates complete electronic press kits from artist metadata, press coverage, and media assets. Formats for email, web, and download. | Content | Transit | Development |
| USP-017 | Media Outlet Matching & Outreach Matches artist profiles to relevant outlets and journalists based on genre, coverage history, and audience overlap. | Distribution | Transit | Development |
| USP-018 | Fan Engagement Signal Monitor Monitors community engagement signals and surfaces actionable insights. Anomaly detection for viral moments. | Monitoring | Orbit | Design |
Substrate & orchestration patterns (USP-019–022) · Indietheka, post-Sprint
Four patterns extracted from migrating Indietheka's 16-agent fleet to durable infrastructure (Q1 2026) and building Atlas — the studio-level orchestrator — on top (Q2 2026). All four are in production at Indietheka and packaged as @umbra/* for cross-property reuse.
| ID | Name | Category | Source | Status |
|---|---|---|---|---|
| USP-019 | Durable Workflow Publisher Idempotent, retryable, multi-step publisher with saga-style recovery. Migrated three publishing agents off Cowork sessions to a single shared runtime. | Infrastructure | Indietheka | Production |
| USP-020 | Multi-Channel Saga Fanout Distributes a single payload across N independent channels with per-channel saga and partial-success gate. Used by Distribution and Weekly Recap. | Infrastructure | Indietheka | Production |
| USP-021 | Scheduled Audit with HITL Site-wide scheduled audit that emits dormant findings; activated only when a human approves them in the operational dashboard. Keystone for Comprehensive Site Audit. | Quality | Indietheka | Production |
| USP-022 | Cron Discover-Rank-Publish Periodic discovery → ranking → publish pipeline against a third-party API. Early-abort budgeting, error classification, env-driven page sizing. Spotify v1. | Content Ops | Indietheka | Production |
How patterns flow back to existing clients: Lighthouse Watch
A pattern library that only ships with new Sprints is a static asset. Watch is what makes it compounding. Every Watch month, the lead engineer reviews the most recent library additions and proposes one as the next month's +1 improvement — deployed into the client's existing system, eval-gated before promotion. After 12 months on Watch, a client's system has accumulated 12 deployed pattern updates that they didn't commission individually.
The reverse channel matters too: every +1 delivered surfaces a candidate pattern back into the library. Three matching +1's across clients = the pattern moves from candidate to canonical. Watch is therefore both a distribution channel (library → client) and a research instrument (client signal → library) — the mechanism that keeps engagement #N priced below engagement #1 because the pattern shelf is deeper.
Engagement & pricing.
Studio engagements are fixed-fee and outcome-tied. The client knows the cost upfront. If the measurable outcome doesn't beat the baseline, the pricing structure reflects that.
Sprint tiers
| Tier | Duration | Scope | US pricing | LATAM pricing |
|---|---|---|---|---|
| Discovery Only | 2 weeks | Stage 1 only — assessment and baseline | $15–30K | $2–5K |
| Standard Sprint | 6–10 weeks | All four stages — one workflow end-to-end | $75–200K | $8–25K |
| Extended Sprint | 10–16 weeks | Complex workflows, regulated industries | Scoped per engagement | Scoped per engagement |
Lighthouse Watch tiers · the continuous service
After the Sprint ships, the system extends into Lighthouse Watch: monthly cadence, three tiers, 12-month minimum on Standard and Pro. Default offer at Sprint closeout is Watch · Standard. About 70% of Sprint clients sign within 30 days. Sales sheet · contract template.
| Watch tier | Term | Includes | US pricing | LATAM pricing |
|---|---|---|---|---|
| Watch · Essentials | 6-month minimum | Foundation maintenance + governance + monthly health report — no +1 improvement | $14K / mo | $2–3K / mo |
| Watch · Standard Default | 12-month minimum | Essentials + the +1 improvement (eval-gated, monthly) + quarterly architecture review | $24K / mo | $3–5K / mo |
| Watch · Pro | 12-month minimum | Standard + named-engineer commitment + 4 hrs/mo advisory + 4-hr SLA on P0 + 8-hr emergency bank/yr | $38K / mo | $5–7K / mo |
| Sprint + Watch bundle | Sprint + 12 mo Watch · non-cancellable first 6 mo | Standard Sprint at the entry $75K band + 12 months of Watch · Standard, single PO. Saves $66K vs. separate purchase. | $290K flat | $32–38K flat |
Pricing philosophy
Not hourly.
The client pays for the outcome, not for time. Studio absorbs efficiency risk.
Outcome-tied component.
A portion of the fee is tied to beating the baseline metrics established in Discovery. Structure TBD per engagement.
Discovery as off-ramp.
Discovery Only exists so clients can assess fit without committing to the full sprint. Standalone value.
LATAM reflects market reality.
Adjusted for purchasing power, not discounted as a favor. The engagement quality is identical.
Beyond Sprint and Watch
- Additional Sprints — Each new workflow is a new sprint. Returning clients benefit from accumulated pattern library and institutional knowledge. Multi-system Watch arrangements multiply at 1.7× per additional system.
- Pattern Licensing — Clients who want to build internally without a Sprint can license patterns from the library for self-directed implementation. Terms TBD; typically used by clients whose data foundation is broken (a disqualifier for the Sprint itself).
- Beacon Workshop — Two-day intensive ($18K flat) for leadership teams who want to apply the Lighthouse framework to their own workflows before deciding on a Sprint. About 40% of Workshop clients move to a Sprint within 90 days.
Templates & artifacts.
Every Lighthouse Sprint produces standardized deliverables using these templates. Templates ensure consistency across engagements and reduce the Sprint Lead's prep time.
Template index
| # | Template | Stage | Priority | Status |
|---|---|---|---|---|
| 01 | Workflow Audit | Discovery | P0 · Critical | Ready |
| 02 | Baseline Metrics Sheet | Discovery | P0 · Critical | Ready |
| 03 | Stakeholder Interview Guide | Discovery | P1 · High | To Build |
| 04 | Fit Assessment Rubric | Discovery | P1 · High | To Build |
| 05 | Discovery Report | Discovery | P1 · High | To Build |
| 06 | Redesign Blueprint | Redesign | P0 · Critical | Ready |
| 07 | Agent Spec Sheet | Redesign | P0 · Critical | Ready |
| 08 | Risk Register | Redesign | P1 · High | To Build |
| 09 | Weekly Build Plan | Build | P2 · Medium | To Build |
| 10 | Operations Manual | Handoff | P0 · Critical | Ready |
| 11 | Governance Runbook | Handoff | P0 · Critical | Ready |
| 12 | Outcome Report | Handoff | P1 · High | To Build |
Worked Example — Indietheka.
The Indietheka editorial pipeline was the first Lighthouse Sprint — applied internally to Umbra Group's own newsroom in Q4 2025. This section shows how the framework's templates were applied to a real engagement — a concrete reference for future sprints. Industry: digital media. Workflow: editorial content pipeline. Duration: 8 weeks. Sprint outcome: 7× throughput, 90% fewer human hours, 16 agents, 14 reusable patterns. Six-month update: the fleet has moved off Cowork sessions onto durable infrastructure — 26 workflows running 24/7, a Slack feedback loop for post-publish corrections, an operational dashboard, Atlas running as a studio-level orchestrator above the fleet, and eight patterns externalized as @umbra/* packages. Zero missed publish days across the cutover. It is also the engagement profile the major foundation labs began publicly hiring for in Mexico City in May 2026 — Studio has been running it since Q4 2025.
12.1 · Workflow Audit (Discovery, Week 1–2)
The Workflow Audit captured 12 discrete steps in the editorial pipeline. Key findings from the stopwatch exercise:
| # | Activity | Actor | Active time | Bottleneck | Agent candidate |
|---|---|---|---|---|---|
| 01 | Browse 8+ music outlets for stories | Editor | 45 min/day | 5 | Yes |
| 02 | Identify coverage gaps vs. competitors | Editor | 30 min/day | 4 | Yes |
| 03 | Research album/artist for review | Editor | 60 min | 3 | Partial |
| 04 | Listen to album | Editor | 45 min | 1 | No |
| 05 | Write review in Spanish | Editor | 90 min | 2 | Partial |
| 06 | Source and crop featured image | Editor | 15 min | 4 | Yes |
| 07 | Format and publish to WordPress | Editor | 20 min | 5 | Yes |
| 08 | Reformat for Facebook | Editor | 10 min | 5 | Yes |
| 09 | Reformat for Instagram | Editor | 12 min | 5 | Yes |
| 10 | Post to Lnk.Bio + other channels | Editor | 8 min | 5 | Yes |
| 11 | Curate weekly Spotify playlists | Editor | 4 hrs/week | 4 | Partial |
| 12 | Sync playlists to Spotify | Editor | 30 min/week | 5 | Yes |
12.2 · Baseline Metrics (Discovery, Week 2)
| Metric | Baseline value | Unit | Measurement method |
|---|---|---|---|
| End-to-End Cycle Time (per article) | 210 | minutes | Stopwatch log avg, 1 representative week |
| Weekly Throughput | 2–3 | articles/week | WordPress publish log, 4-week average |
| Error Rate | ~8% | % | Formatting errors, missed platforms, broken images |
| Human Hours per Cycle | 3.5 | hours/article | Stopwatch active time end-to-end |
| Cost per Article | ~$105 | $/article | Estimated at $30/hr editorial rate |
| Platform Distribution | 1–2 | platforms/publish | Manual — some platforms frequently skipped |
| Playlist Curation Time | 4.5 | hours/week | Stopwatch — selection + sync combined |
| FTEs Involved | 1 | person | Solo operator |
12.3 · Redesign Blueprint (Redesign, Week 3–4)
The initial redesign classified all 12 workflow steps into four allocation categories. As the system matured in production, four additional agents emerged — expanding the fleet from 6 to 16 and pushing automation coverage across the full editorial lifecycle: RSS → discovery → writing → SEO → publishing → distribution → analytics → calendar planning.
Original 6 agents/automations — designed in Week 3–4 and deployed by Week 7:
| ID | Agent | Classification | What it does |
|---|---|---|---|
| AGT-01 | Continuous Source Monitor | Autonomous | Watches a defined set of external sources on a fixed cadence, deduplicates against persistent history, and queues only new material for downstream handling. In Indietheka: 15 music-press RSS feeds polled every 40 minutes; new stories written to a shared Google Sheet via an API endpoint |
| AGT-02 | Durable Production Publisher | Autonomous | Enriches, rewrites, publishes, and self-checks production content end-to-end via a multi-layer cascade, with per-field auto-fix on failure. The durable layer that ships work to production without manual handoff. In Indietheka: hourly publishing to WordPress with a 3-layer cascade (source scraping → Spotify embed lookup → AI-assisted rewrite); 4 mandatory SEO fields self-checked and auto-fixed |
| AGT-03 | Long-Form Editorial Pipeline | Human-approved | Picks the next approved long-form subject from a queue, locks the row via an API endpoint to prevent re-processing, runs the full research → write → fetch supporting media → publish flow, writes results back. The queue-gated heavy editorial agent. In Indietheka: weekly album reviews from a curated queue, Friday 10am, with cover art fetched via Spotify/MusicBrainz |
| AGT-04 | Multi-Channel Distribution | Autonomous | Generates branded assets from rotating templates and fans out a single payload across N independent channels simultaneously, with per-channel formatting and saga-style failure recovery. One asset per run for natural feed pacing. In Indietheka: 6 rotating image templates posted to Instagram, Facebook, and Lnk.Bio, hourly |
| AUT-05 | Recurring Catalog Sync + Digest | Human-approved | Syncs a curated catalog to an external platform on a recurring cadence, then auto-generates a companion digest asset that surfaces the top entries for social distribution. In Indietheka: weekly Spotify playlist sync via browser automation (API rate-limits prevent direct calls) followed by a Top-10 Instagram carousel |
| AUT-06 | Periodic Recap Generator | Autonomous | Pulls the most relevant items from the period’s output and assembles a visual recap — cover slide, top items, call-to-action — for social distribution. Fires only above a volume threshold. In Indietheka: a 7-slide Instagram carousel each Sunday 10am, contingent on at least 3 published posts that week |
10 agents added in production — each born from gaps discovered after the original 6 went live:
| ID | Agent | Classification | What it does |
|---|---|---|---|
| AGT-07 | Forward-Window Discovery | Autonomous | Scans upstream sources for inbound candidates inside a forward time window, ranks them, writes the top N to a review queue. The discovery half of a queue-gated pipeline. In Indietheka: ~964 followed artists on Spotify plus 5 indie-press outlets scanned each Thursday for the 7-day forward window; 10 ranked candidates per run; pairs 1:1 with Long-Form Editorial Pipeline |
| AGT-08 | Data-Driven Strategy Proposals | Autonomous | Generates ranked content ideas oriented by what’s actually performing — analytics-informed editorial planning, not gut feel. Each proposal carries layout templates, targets, and an outline. In Indietheka: 2 daily editorial list ideas with carousel layouts, SEO targets, and article outlines, oriented by the weekly performance dashboard’s analytics insights |
| AGT-09 | Page-Level Quality Auditor | Autonomous | Audits production pages field-by-field, auto-fixes what is deterministic, flags what needs editorial judgment. The non-judgmental layer of quality. In Indietheka: 5–10 pages/day with title tags, meta descriptions, and internal links auto-corrected; ambiguous cases queued for the editor |
| AGT-10 | Weekly Planning Loop | Autonomous | Closes the operating period, analyzes performance, programs the next period’s plan, and appends to a cumulative memory file the rest of the system reads from. The feedback loop that makes the system learn. In Indietheka: editorial calendar programmed every Sunday 8pm with draft IDs and SEO targets, appended to a strategic memory file that never gets overwritten; outputs feed Data-Driven Strategy Proposals and Daily Competitive Brief the following week |
| AGT-11 | Daily Competitive Brief | Autonomous | Scans a watch-list of competitor sources daily, cross-references against the operator’s own coverage to find gaps, and writes the highest-value opportunities directly to production as drafts — no approval gate. In Indietheka: 8 indie-music outlets (Pitchfork, Stereogum, NME, Brooklyn Vegan, Consequence, Paste, Line of Best Fit, Rolling Stone) scanned each morning at 9am; gap-coverage drafts written to WordPress |
| AGT-12 | Structured-Metadata Enrichment | Autonomous | Classifies content by type, generates the corresponding structured-data payload, and PATCHes it into the production system’s metadata layer. Powers AI-search visibility and rich-result eligibility. In Indietheka: posts classified into Article / Review / NewsArticle / MusicEvent and enriched with JSON-LD every Monday; 119 posts on the day-1 deployment with zero failures; archive backfill ongoing |
| AGT-13 | Eval Runner | Autonomous | Gates every production publish against a deterministic rules suite — required fields, structural integrity, taxonomy, schema validity. Logs every run; the log is the source of truth the next quality layer reads from. Fires reactively on publish hooks. In Indietheka: every WordPress publish validated against evals/rules.json for Yoast compliance, featured image, internal-link minimums, schema validity, slug structure, and tag taxonomy; runs logged to eval_log.jsonl |
| AGT-14 | Quality Monitor | Autonomous | Audits the entire agent ecosystem itself. Cross-checks runtime against spec, detects error patterns not yet codified, ranks findings by impact, and proposes new rules and memories — never writes, only proposes. The meta-quality layer that keeps the rest of the system honest. In Indietheka: weekly Sunday-night scan (Sundays 7pm, before the performance dashboard runs) across session transcripts of the last 14 days, eval log, distribution log, MEMORY.md, and the agent repo; outputs a Quality Report |
| AGT-15 | Weekly Performance Dashboard | Autonomous | Pulls multi-source analytics for the closing period, generates a multi-component dashboard with cross-source anomaly detection, and executes a bounded set of autonomous metadata fixes per run. In Indietheka: GA4 + Search Console pulled each Sunday 6pm; 11-component dashboard (bounce-by-page, keyword opportunity matrix, GSC↔GA4 anomalies, position 1–5 zero-click, action plan, autonomous SEO fixes, 7-day cohort retention, internal site search, path analysis, schema CTR comparison); HTML report written and rows appended to historical Sheets; 4–6 metadata PATCHes per run executed without further approval |
| AGT-16 | Comprehensive Site Audit | Autonomous | Site-wide multi-dimension audit on a fixed cadence. Produces a scored Health Report with an explicit owner per finding — scorer, never fixer. Routes work to the right executor agent or to the human. In Indietheka: monthly run on the 1st at 9am plus on-demand; 8 weighted dimensions (Technical, Content/E-E-A-T, On-Page/SXO, Schema, Performance/CWV, AI/GEO, Sitemap, Images); findings routed to AGT-08 (titles/metadescs), AGT-11 (schema), or manual workstreams |
Publishing gate — Editor marks a row as approved → agent processes and publishes → marks the row as complete (preventing re-processing).
Review queue gate — Forward-Window Discovery writes proposed albums → editor approves individual picks → Friday pipeline picks them up automatically.
Data-Driven Strategy Proposals gate — proposals generated daily; editor selects one → single binary question (“publish now or schedule?”) → full pipeline executes without further interruption.
This “spreadsheet-as-state-machine with API-backed writes” pattern replaced an earlier approach using browser UI automation that caused cell-drift and undo-cascade bugs. It is a reusable pattern (see USP-009 in §12.8).
12.4 · Agent Inventory (Production)
Sixteen agents/automations in production, evolved iteratively from the original 6. Each agent’s design was informed by incidents from previous agents — Long-Form Editorial Pipeline’ endpoint-only writes came from Multi-Channel Distribution’s cell-drift bugs; the Pre-Publish Verification Protocol grew organically from 0 to 9 check groups as errors were caught and codified.
| ID | Name | Classification | Schedule | Integration points |
|---|---|---|---|---|
| AGT-01 | Continuous Source Monitor | Autonomous | Every 40 min | 15 RSS feeds, persistent deduplication log, API endpoint writes to Google Sheets |
| AGT-02 | Durable Production Publisher | Autonomous | Every hour | Google Sheets (approval gate), OpenAI API (rewrite), Spotify (embed lookup), WordPress, Yoast SEO (4-field self-check with auto-fix) |
| AGT-03 | Long-Form Editorial Pipeline | Human-approved | Friday 10am | Review queue spreadsheet (approval gate), API endpoint for queue state, Claude API (review generation), Album of the Year (cover art), WordPress, Yoast SEO (two-step metadata update) |
| AGT-04 | Multi-Channel Distribution | Autonomous | Every hour (1 post/run) | WordPress, image generator (6 templates, A/B rotation), Meta Graph API (Instagram + Facebook), Lnk.Bio, external image hosting, persistent distribution log |
| AUT-05 | Recurring Catalog Sync + Digest | Human-approved | Sunday | Spotify Web Player (browser automation — not API), Meta Graph API (Top 10 carousel), carousel image generator, Lnk.Bio |
| AUT-06 | Periodic Recap Generator | Autonomous | Sunday 10am | WordPress, recap image generator (canvas rendering), Meta Graph API, Lnk.Bio, external image hosting |
| AGT-07 | Forward-Window Discovery | Autonomous | Thursday 10am | Spotify followed-artists + album data, 5 indie-press feeds, review queue spreadsheet (API endpoint), weighted scoring algorithm |
| AGT-08 | Data-Driven Strategy Proposals | Autonomous | Daily 3pm | Google Analytics + Search Console insights, WordPress (existing content check), SEO keyword research |
| AGT-09 | Page-Level Quality Auditor | Autonomous | Daily 10am | WordPress, Yoast SEO, Google Search Console data |
| AGT-10 | Weekly Planning Loop | Autonomous | Sunday 8pm | Google Analytics, Google Search Console, WordPress (post counts), persistent weekly plan + cumulative strategic memory files. Feeds insights into Data-Driven Strategy Proposals and Page-Level Quality Auditor the following week |
| AGT-11 | Daily Competitive Brief | Autonomous | Daily 9am | 8 competitor RSS/sitemap feeds, Search Console (own-coverage check), WordPress (draft creation), no approval gate |
| AGT-12 | Structured-Metadata Enrichment | Autonomous | Monday weekly | WordPress REST (PATCH meta), JSON-LD generator, content classifier (Article / Review / NewsArticle / MusicEvent). 119 posts enriched day one with zero failures |
| AGT-13 | Eval Runner | Autonomous | Reactive, on every publish | WordPress REST (last 50 posts), evals/rules.json, eval_log.jsonl (rolling 30-day window). Failures gate the publish; passes append to log for downstream agents |
| AGT-14 | Quality Monitor | Autonomous | Sunday 7pm | Cowork session transcripts (read-only, 14d window), eval_log.jsonl, distribution log, MEMORY.md + linked feedback files, agent repo, WP REST. Outputs quality-reports/YYYY-MM-DD.md |
| AGT-15 | Weekly Performance Dashboard | Autonomous | Sunday 6pm | GA4 (property 488842515), Search Console export, WordPress REST (Bloque-A PATCHes 4–6 posts/run), historical Google Sheets (GA4 + GSC tabs), HTML output |
| AGT-16 | Comprehensive Site Audit | Autonomous | Monthly · 1st 9am | Yoast sitemap index, WordPress REST, PageSpeed Insights API, Search Console (manual export → service-account when wired), previous month’s audit (delta diff). Writes seo-audits/YYYY-MM-DD/audit-report.md with raw evidence preserved |
Supporting production infrastructure — not agents themselves, but shared systems the agents depend on:
Pre-Publish Verification Protocol.
9 check groups covering every dimension of editorial accuracy. Fires on any action where the AI authors content. Covers: lineup/date verification, album data, cover art, artist photos vs album covers, news structure, verb tenses, tags/categories, link integrity, and festival-day validation. Silent on success, blocks on failure.
6 Cornerstone Article Templates.
Guía Interactiva, Editorial Retrospectiva, Ranked List, Artist Monograph, Cornerstone Mega-Guide, Must-See Grid. Each has its own accent color, background style, and component library.
2 Instagram Carousel Templates.
Two deliberately contrasting visual styles — one for album-focused lists (black backgrounds, album art, pink accents) and one for artist-focused features (full-bleed desaturated concert photos). They alternate in the feed for visual diversity.
Chunked Upload System.
Transfers large HTML articles from the sandboxed environment to WordPress by encoding them, splitting into manageable chunks, and reassembling them browser-side before uploading. Solves the constraint that the sandbox can’t reach the WordPress API directly.
GitHub Actions Migration.
Moving agents from local scheduled tasks to cloud-based GitHub Actions. Continuous Source Monitor is live (runs every 40 min in the cloud). Remaining agents pending migration. Architecture: one folder per agent, one workflow per agent, state files committed back to the repository.
Cover Art Verification Pipeline.
Fetches album artwork using structured identifiers from Spotify or MusicBrainz — never freeform search. Every image is visually verified before rendering to prevent wrong-album art from reaching production. Born from a real incident where the wrong cover was published.
12.5 · Build Sequence (Iterative, Incident-Driven)
The original plan called for a linear 3-week deployment (AGT-01/02 in Week 5, AGT-03/04 in Week 6, AGT-05/06 in Week 7). In practice, the system evolved iteratively — each agent’s design was shaped by production incidents from its predecessors. The ten additional agents (AGT-07 through AGT-16) were not in the initial scope; they emerged from gaps revealed only after the core pipeline was running live.
12.6 · Measured Outcomes (Production, Week 8+)
Baseline vs. current production comparison. Initial metrics were captured at Week 8 handoff (6 agents); current figures reflect the mature 16-agent system with SEO, discovery, proposals, and the weekly feedback loop operational.
| Metric | Baseline | Post-sprint (Wk 8) | Current (16 agents) | Delta |
|---|---|---|---|---|
| Weekly Throughput | 2–3 articles | 12–15 articles | 15–20 articles | +7× |
| Human Hours per Article | 3.5 hrs | ~1 hr | ~20 min | −90% |
| Platform Distribution | 1–2 platforms | 6 platforms | 6 platforms | +4 platforms |
| Error Rate | ~8% | <2% | <1% | −88% |
| Agent/Automation Count | 0 | 6 | 16 | +16 agents |
| Agent Invocations | 0 | 160+/week | 1,000+/week | — |
| SEO Coverage | 0 pages/day | — | 5–10 pages/day audited | New capability |
| Album Discovery | Manual browsing | — | 10 ranked candidates/week | New capability |
| Content Strategy | Ad hoc | — | 2 data-driven proposals/day | New capability |
| Editorial Feedback Loop | None | — | Weekly closed-loop (GA4 + GSC → calendar) | New capability |
| Playlist Curation Time | 4.5 hrs/week | 30 min/week | 30 min/week (review) | −89% |
| Quality Checks | 0 | — | 9 check groups, every publish | New capability |
| Reusable Patterns | 0 | 4 | 14 | +14 documented |
| FTEs Required | 1 (overloaded) | 1 (with capacity) | 1 (strategic only) | Same headcount, 7× output |
12.7 · Governance in Production
Agent Health Dashboard.
Tracks success rate, latency, error count, queue depth for all 16 agents. RSS poller checks every 15 min. Alert if any agent fails 3+ consecutive invocations.
Editorial Approval Queue.
AGT-03, AGT-04, and AGT-06 queue outputs for editor review. Average approval turnaround: <2 hours during business hours.
One-click Kill Switch.
Any agent can be paused or stopped instantly from the governance dashboard. In-flight items are held in queue. Override authority: the editor.
Draft Reversion.
WordPress posts revert to draft. Distribution posts deletable per-platform. Spotify playlists maintain version history. All agent actions logged.
12.8 · Reusable Patterns Extracted
14 patterns extracted from the Indietheka sprint — the original 4 updated with production detail, plus 10 new patterns each born from a real incident with a traceable receipt. Documentation standard for each: inputs/outputs, dependencies, governance needs, incident receipt, and adaptation notes for other client contexts.
Updated originals (USP-001–004)
| ID | Pattern | Category | Production enrichment |
|---|---|---|---|
| USP-001 | RSS-to-CMS Ingestion Pipeline | Content Ops | 3-layer content enrichment cascade (source scraping → music embed lookup → AI-assisted rewrite), automated self-check of 4 SEO fields with auto-fix, and a spreadsheet column lock to prevent duplicate processing |
| USP-002 | AI-Assisted Editorial with Voice Preservation | Content Ops | Prohibited-word list enforcement, verb-tense rules table (5 situations), heading rules (lead with the key fact, not a generic label), 300–420 word hard limit per article |
| USP-003 | Multi-Channel Distribution Pipeline | Content Ops | 6-template image generation with A/B rotation per post, external image hosting (selected specifically because Meta’s image fetcher is blocked by certain CDNs), one-post-per-run pacing for natural feed spacing |
| USP-004 | Queue-Driven Pipeline with Column-Based Approval Gates | Content Ops | Renamed from “Data-Driven Curation with Human Approval Gate”. Thursday→Friday discovery-to-review cycle, spreadsheet as state machine backed by an API endpoint, with shared client libraries for reading and writing queue state |
New patterns (USP-005–014) — each traceable to a production incident:
| ID | Pattern | Category | Description | Incident receipt |
|---|---|---|---|---|
| USP-005 | Pre-Publish Verification Protocol | Quality | 9 check groups covering every editorial accuracy dimension. Silent on success, blocks on failure. Fires on any AI-authored content. Each rule is traceable to a specific published article and date. | Grew from 0 to 9 checks over 2 weeks of production errors |
| USP-006 | Cover Art Verification Pipeline | Quality | Fetches album artwork using structured identifiers from Spotify or MusicBrainz — never freeform search. Every image is visually verified before rendering to prevent wrong-album art from reaching production. | Wrong cover published — AM artwork served instead of the correct album (2026-04-13) |
| USP-007 | Chunked Upload System | Infrastructure | Transfers large HTML articles from the sandboxed environment to WordPress by encoding them, splitting into manageable chunks (~6,700 characters each), reassembling browser-side, and uploading. Verifies running totals after each chunk to catch corruption. | Cornerstone article pipeline — articles too large for single-request upload |
| USP-008 | Two-Step SEO Metadata Update | Infrastructure | When updating article content and SEO metadata in a single WordPress API call, the SEO plugin doesn’t refresh certain fields. Requires a second API call with only the metadata to force the refresh. | Backlog cleanup of 85 posts with stale SEO descriptions (2026-04-18) |
| USP-009 | API Endpoint for Spreadsheet Writes (No UI Automation) | Infrastructure | Uses Google Sheets as a database via an API endpoint instead of browser UI automation. Standard HTTP libraries handle the endpoint’s redirect correctly; some command-line tools do not. Rule: never use browser automation to write spreadsheet cells. | Cell-drift and undo-cascade bugs retired after migrating Long-Form Editorial Pipeline to the API endpoint |
| USP-010 | Featured Image ≠ Carousel Cover | Content | Never reuse a vertical Instagram carousel cover (1080×1350) as a WordPress featured image. Always generate a separate 1600×900 landscape asset for the website. | Coachella W1 article used the wrong image ratio (2026-04-14) |
| USP-011 | Temporal Claims Verification | Quality | Claims like “first time in N years”, “comeback”, or “after N years” must be verified against the publication date, not the drafting date. For multi-weekend festivals: never claim a “first” or “comeback” in Weekend 2 if the artist already played Weekend 1. | The xx incorrectly labeled as a comeback in Coachella W2 carousel (2026-04-16) |
| USP-012 | Multi-Template Carousel System | Content | Two deliberately contrasting Instagram carousel templates — one album-focused (black backgrounds, album art, pink accents) and one artist-focused (full-bleed desaturated concert photos, high energy). Alternate in the feed for visual diversity. | Feed looked monotonous; solved by introducing the second template style (2026-04-16) |
| USP-013 | Weekly Planning Loop with Cumulative Memory | Strategy | Sunday evening automation closes the editorial week: analyzes traffic and search performance, identifies which content types and topics worked, writes a weekly plan (with draft IDs and SEO targets) and appends to a cumulative strategic memory file. These outputs feed into Data-Driven Strategy Proposals and Daily Competitive Brief the following week, creating a closed learning loop. | First run 2026-04-20; insights already influenced the following week’s calendar |
| USP-014 | Page Builder Safety | Infrastructure | Never update page-builder pages (Elementor, etc.) via the standard WordPress REST API — it silently strips theme-specific layout metadata, breaking the visual design. Use custom AJAX endpoints or the page builder’s own API instead. | Homepage layout broken after a standard API update (2026-04-18) |
12.9 · Studio-Level Orchestration (Atlas) · Post-Sprint extension
Atlas was added in Q2 2026 — six months after Sprint closeout — when the editor needed a single touch-point above the fleet rather than per-agent dashboards. Atlas reads the entire migrated fleet, classifies editorial feedback in Slack, commits corrections back to the repo, dispatches non-deterministic work to Cowork sessions, and runs a morning brief against everything below it. Not part of the original Sprint scope — an extension built using the same patterns the Sprint extracted, which is the proof the library compounds.
Morning Brief.
Synthesizes the previous 24h across all 12 agents into a single Slack post at 7am CDMX. Highlights, anomalies, lock-not-acquired events, queue depths.
Feedback → Commit.
Editor reacts to a Slack message; a classifier proposes the correction in-thread; ✅ commits the diff to the agent's repo via GitHub API. No redeploy needed for prompt-level fixes.
Cowork Escalation.
When a request lands that requires non-deterministic judgment — a redesign, a rewrite, a one-off investigation — Atlas opens a Cowork delegation with full context. The human resumes inside a Claude session, not a triage queue.
Continuous Sweep.
Every 30 min Atlas sweeps the fleet for stuck queues, broken integrations, drift between spec and runtime. Findings post to a gates dashboard; the editor approves or dismisses each one.
Read it. Apply it. Or hire the studio to ship against it.
This framework is Umbra Studio's internal operating document. If you want to run a Lighthouse Sprint on one of your own workflows, talk to us.