Governance Framework

Fleet Health iReal-time fleet status: provider circuit breakers, message bus stats, and agent heartbeats. Data refreshes every 10s from fleet_health.json.

Providers Healthy

—

Bus Messages

—

Oldest Unprocessed

—

Agents Healthy

—

Decision Breakdown i

RA Classifications (all sessions): —

Fleet Status iFleet Status shows the circuit breaker state for each LLM provider. CLOSED = healthy, HALF_OPEN = recovering, OPEN = failing. Data from fleet_health.json.

● Anthropic

No data

● Google

No data

● OpenAI

No data

🔴 Live Agent State iAgent State reflects the last write to agent_state.jsonl. Agents write on state transitions — a stale timestamp means no state change, not inactivity. For a full activity view including governance decisions, RA classifications, and task runs, see the Activity tab → Activity Stream (BL-088). A BLOCKED or WAITING state for hours requires human Telegram input.

No state data yet.

active idle done on-call waiting for Telegram BLOCKED — escalation pending

🏃 Active Sprint iActive Sprint Scope (BL-244) — Live view of the current sprint items and their execution stage. Updates every 10s from sprint_state.json. Stages: READY → IN_PROGRESS → COMPLETE. Source of truth for what the Builder Agent is doing during headless autonomous execution.

Loading sprint data…

Enterprise Workflow — Agent Interaction Map i Live status from agent_state.jsonl · Hover node for detail

📋

How to Assess "What's Next"

Milestone structure · WSJF scoring · sprint flow · human checkpoints · human-gate resolution

Open Guide ↗

⚠️ Pending Escalation

No escalation pending.

🚦 Human Gates i

No human gates pending.

🎯 Strategic Themes i

Loading strategic themes...

📊 Executive Agent Alignment i

Loading EA status...

📏 Context Budget Utilization i

Loading context budget data...

Agent Registry — 9+1 Roster (R-4) BL-095 i

🔨 Builder Agent

📄 docs/personas/builder_agent.md

Lead — executes tasks, coordinates swarm

Claude Code Desktop

Tier: 1 — Three Lines of Defense
Skills: /submit_build · /risk_assess · /validate_paths · /ingest · /ingest_commit
River access: Read/Write (pending_review, approved)

Default: claude-sonnet-4-6

Complex: claude-opus-4-6

Cannot write to: audit_logs · context_corpus · policy_memory

🛡️ Risk Assessor

📄 docs/personas/risk_assessment_agent.md

Safety — independent risk classification peer

Claude Code Agent Teams

Tier: Swarm Peer (independent context window)
Scope: Evaluate and classify only — never execute
Classifies: LOW · MEDIUM · HIGH · CRITICAL · RED TEAM
Classifications: —

Current: claude-sonnet-4-6

High-stakes: claude-opus-4-6

Self-authorizes LOW > 0.85 · Escalates CRITICAL to Governance

⚖️ Governance Agent

📄 docs/personas/governance_agent.md

Safety — autonomous approval authority

Gemini CLI (WSL)

Tier: 2 — Governance Layer
Model: —
Decisions: —
River access: Read/Write (approved, escalated, resolved)

Current: gemini-2.5-flash

Target: gemini-3.1-pro (BL-083 pending)

Alt: gpt-4o if provider switch

Writes policy_memory · Communicates via River only

👤 Human (Chris)

Safety — ultimate authority for CRITICAL actions

Telegram Claude Code Chat Antigravity This Dashboard

Tier: 3 — Human Override
Telegram: approve · reject · revoke · confirm
Resolutions: —
CRITICAL guard: 60s confirmation window i

Only invoked when Governance escalates — no routine involvement

ACTIVE — PLA

📊 Planning Agent

📄 docs/personas/planning_agent.md

Planning — WSJF scoring, sprint proposals, backlog health, postmortems, velocity tracking

Claude Code Scheduled Task (daily/planning cycle)

Tier: 2 — Autonomous Planning
Schedule: Daily briefing · backlog hygiene · sprint proposals · velocity tracking
Outputs: agent_state.jsonl · planning_dashboard.json · sprint proposals

Current: claude-opus-4-6

Read: BACKLOG.md · sprint_state.json · Write: agent_state.jsonl · planning_dashboard.json

ACTIVE — R-4

🧭 Executive Agent

📄 docs/personas/executive_agent.md

Strategy — strategic alignment, cross-agent coordination, escalation routing

Claude Code Agent Teams

Tier: 1 — Executive Layer
Scope: Strategic theme ownership · strategic coherence review · escalation triage
Outputs: ea_alignment.json · strategic_themes.json · escalation routing decisions

Current: claude-opus-4-6

Writes .artifacts/ea_* · escalation routing — no direct River writes

ACTIVE — R-4

📋 Planning Agent

📄 docs/personas/planning_agent.md

Planning — sprint scoping, WSJF prioritization, backlog triage

Claude Code Agent Teams

Tier: 1 — Planning Layer
Scope: Sprint proposal · WSJF scoring · scope lock enforcement (hook-based, F-002)
Outputs: sprint_state.json · sprint_proposals/ · milestone_estimates.json

Current: claude-sonnet-4-6

Writes sprint_state.json · backlog_scores.json — read-only on audit_logs

ACTIVE — BL-084

🎯 Consultant Agent

📄 docs/personas/consultant_agent.md

Expert advisor — audit, complex analysis, design sessions

Claude Code Agent Teams

Scope: Mandatory for audits and major design sessions · McKinsey/Bain/BCG methodology

Activity (BL-099): — today · — total engagements

Model: claude-opus-4-6

ca_agent.py · writes .artifacts/consultant_report_*.json · weekly ingest scheduled

ACTIVE — BL-063

🔎 Audit Agent

📄 docs/personas/audit_agent.md

Compliance — protocol adherence, constraint violation detection

Claude Code Agent Teams

Scope: Reads audit_logs · flags RA bypass attempts, false autonomy claims · reports to Human via Telegram

Planned: gemini-2.5-flash or claude-sonnet-4-6

READ ONLY — observes, never modifies

PLANNED — BL-064

📡 Update Agent

Research — daily AI model releases, tools, security advisories

Scheduled Task (TBD)

Scope: Monitors Claude/Gemini/OpenAI releases · ingests via /ingest workflow · runs overnight daily

Planned: web-browsing capable model (TBD)

No direct file writes — ingest pipeline only

ACTIVE — BL-079

👔 Persona Agent

📄 docs/personas/persona_agent.md

Agent HR — performance reviews, persona governance, model evaluation

Claude Code Scheduled Task

Tier: CHRO — governs all agent personas

Schedule: Daily roadmap scan + dashboard sync · Staggered reviews Mon–Thu · Weekly report Fri · Model eval Sun

Outputs: persona_reviews/ · persona_reports/ · persona_dashboard.json · agent_performance_registry.json

Scope: Sole write authority over docs/personas/ · Performance KPIs · Value-per-cost benchmarking · Model selection advisory (joint with CA)

Current: claude-opus-4-6

Writes docs/personas/ · .artifacts/persona_* · reads all agent data sources

ACTIVE — BL-207 / BL-256

🔧 Repair Agent

📄 docs/personas/repair_agent.md

Rapid-response SRE — autonomous diagnosis, playbook repair, quality scoring, preventive proposals

Claude Code Scheduled (30min) + On-demand

Tier: Operational — system health, not feature delivery

Schedule: 30-min diagnostic scan · SMA heartbeat dispatch · Telegram repair-scan · Any agent request

Outputs: repair_log.jsonl · repair_patterns.json · repair_quality_scores.json · repair_proposals/

Scope: OCAV control loop (Observe-Compare-Act-Verify) · 5 root-cause cluster playbooks · Durability scoring · 3-strike recurrence rule · Preventive proposal generation

Primary KPI: System velocity (>= 95% autonomous operation) · Secondary: Blocker repair count (quality > quantity)

Current: claude-sonnet-4-6 Fallback: claude-haiku-4-5

Autonomous LOW playbook repairs · RA + Governance for MEDIUM+ · Never writes to .river/ or docs/personas/

ACTIVE — SMA

🩺 Stability Monitor Agent

📄 docs/personas/stability_monitor_agent.md

Stability guardian — detects failures, diagnoses root causes, coordinates recovery

WinSW Service (2-min reconciliation loop)

Tier: Operational — autonomous for stability-domain remediations, governed otherwise

Scope: 10 stability signals · composite stability scoring · known-safe remediation · REP delegation · state-transition Telegram alerting

Model: stability_monitor.primary (model_config.json)

Stability-domain remediation autonomous · escalates out-of-domain to RA + Governance

ACTIVE — SSA

⚓ Stability Steward Agent

📄 docs/personas/stability_steward_agent.md

Operational steward — drives the autonomous sprint lifecycle to clean close

Headless steward (standing operator authorization)

Tier: Operational — keeps the autonomous build system afloat and converging on clean sprints

Scope: Recover orphaned sprints · sprint approvals under standing authorization · clean-streak accounting · intervention ledger

Model: stability_steward.primary (model_config.json)

Executes recovery + sprint approvals under standing operator authorization · sends SSA-titled FYI Telegram on use

ACTIVE — PSA (advisory)

🧭 Process Steward Agent

📄 docs/personas/process_steward_agent.md

Process steward — advisory reviews of process health and adherence

PER-approved advisory use

Tier: Advisory only — non-gating, non-authoritative for runtime control

Scope: Process adherence review · advisory recommendations · no sprint or execution gating

Model: process_steward.primary (model_config.json)

Advisory only — produces recommendations, never gates sprints or execution

PLANNED — BL-134 · M4 Sprint 17+

⚡ Efficiency Expert Agent

Optimization — identifies redundant work, token waste, and workflow bottlenecks

Claude Code Agent Teams (TBD)

Scope: Research phase — BL-134 covers exploration. Will analyze sprint logs, token usage, and task durations to surface optimization opportunities. Mandatory CA involvement before design gate.

Planned model: TBD — pending CA model selection advisory

Observer only — outputs recommendations, no writes

PLANNED — M5 Fleet Scale

🌐 A2A Protocol Research

Infrastructure — Agent-to-Agent direct communication (replaces River file exchange)

Multi-environment (TBD)

Scope: M5 milestone item. When A2A support becomes available in Claude Code + Antigravity, replaces River trigger mechanism with lower-latency direct agent signaling. Currently in research; no sprint assigned.

Planned model: TBD — depends on A2A protocol adoption

Architecture upgrade — not yet an independent agent

Legend: Active (solid border) In Development (dashed) Planned (dotted, greyed)

🔄 AGENT LIFECYCLE STATE MACHINE (Managed by Persona Agent)

📡 Activity Stream iActivity Stream merges all agent events in chronological order: Governance decisions, Risk Assessor classifications, Builder Agent state changes, and scheduled task runs. This is a composite view — not limited to agent_state.jsonl writes. Genuine idle = no events in any source for >30 min. (BL-088). Color-coded by agent type (BL-129).

Legend: ⚖️ Governance 🛡️ Risk Assessor 🔨 Builder / Agent State ⚙️ Tasks / System 🔧 Repair Agent 🚨 HIGH Escalation

Filter:

Last activity: — —

TimeSourceEvent

Loading activity...

🛡️ Risk Assessor — Classifications i

Recent Classifications

⚖️ Governance — Decision History i

Recent Decisions

Claude — Model Rate Limits i

Builder Default

claude-sonnet-4-6

All standard tasks

Builder Complex

claude-opus-4-6

High-stakes / complex tasks

Reset Window

5 hrs

Rolling usage cycle

Ceiling Visibility

BL-086

Live API integration backlogged

Claude Code limits are subscription-based and not exposed via API. To check current headroom: platform.claude.com/usage ↗. Opus invocations count ~5× against budget vs Sonnet — use sparingly for tasks that genuinely require it. Live API integration backlogged as BL-086.

Gemini Governance — Usage & Quota (BL-086) i

Governance model: — | Billing: PAYG (BL-083 ✅)

Today's governance calls —

Reference: ~50 calls/day = light sprint · ~200 = heavy sprint · PAYG so no hard ceiling

Rate limit hit rate (recent 50 calls) —

PAYG: rate limits are per-minute RPM, not daily quota. Transient hits resolve automatically.

Calls Today

—

governance evals

Avg Latency

—

per review (recent 50)

Max Latency

—

worst observed

Rate Limit Hits

—

quota exceeded

Recent Calls

No usage data yet.

🎯 Consultant Agent — Activity & Usage iLive CA activity and usage, read directly from the canonical .artifacts/ca_metrics.json (written by scripts/ca_agent.py · update_ca_metrics() on every CA engagement). No parallel state — rolling counts come from the artifact's summary block and the per-engagement list from its entries. BL-099.

Source: canonical .artifacts/ca_metrics.json | Last engagement: —

Total Invocations

—

all CA engagements

Advisory Engagements

—

last: —

Ingest Cycles

—

last: —

Engagements Today

—

since 00:00 UTC

Recent Engagements

No CA activity yet.

Model Inventory — Last Audit iDaily probe of every model assignment in model_config.json. All roles should show OK. FAIL means the model is unreachable — check quota, availability, or auth. Run python scripts/model_audit.py to refresh.

No audit data for today. Run: python scripts/model_audit.py

🔌 Provider Health & Fallback Chains i

Loading provider health...

⚡ Circuit Breaker States i

Loading circuit breaker states...

🏛️ Governance Framework

⚠️ Pending Escalation