Pantheon vs Co-Scientist — fleet redesign journey

Live record of fleet-redesign milestones. Each entry is a versioned step with deliverable, status and references. New steps append here as we ship them — this frame grows.

2026-05-29 · §1–§8 of this pageshipped

Co-Scientist gap analysis · v1

Read Google DeepMind's Co-Scientist (Nature 2026-05-19); mapped its seven specialised agents onto our six ministers; identified five structural gaps (no Generation phase, no Proximity, no Tournament, no Evolution, no Meta-review) + two operational gaps (HEPH-as-decider rigidity, IDENTITY emptiness) + six shared strengths. First proposal of a 9-step inception protocol.

page: analysis.goldnetgroup.com.au §1–§8

2026-05-29 · §2 live-flow diagramshipped

Live transaction-flow diagram added

SVG of the actual wiring: Minister · OpenClaw → nginx /llm/ → inference_proxy :8899 (auth · auto-recall · tier-force · dispatch) → claude-relay :8896 tier cascade → upstream LLMs. Plus the orthogonal critic side-channel via PC chatgpt-bridge :4242 over WireGuard. Live ports verified via ss -lntp.

page: analysis.goldnetgroup.com.au §2

2026-05-29 · v1 draftshipped

Static SOULs + dynamic project-contexts — first draft

First proposal: SOUL stays role + discipline; add per-project context.md, silent pre-injection, mtime-based staleness, "one project per turn", channel-only routing. Sent for adversarial critique.

file: outputs/static-soul-dynamic-projects-draft.md

2026-05-29 · gpt-5.5 via PC bridgeshipped

Critique pass #1 — 50 findings, no "Approach sound"

Routed through PC chatgpt-bridge per canon:rule-critic-route-is-chatgpt-pc-bridge-only-2026-05-25. Hardest hits: silent pre-injection treated as truth → overconfident posts; mtime is operational not epistemic; "one project per turn" breaks real orchestration; lane reminders inside context = shadow SOUL. Eight hard requirements set.

2026-05-29 · v2 revisedshipped

SOUL + context layer v2 — 8 requirements adopted

Visible preamble · project-detection states · hard stop on stale/missing/conflicting · immutable per-task snapshot · explicit source hierarchy + conflict protocol · project-scoped recall · strict YAML schema with provenance + expiry · gateway-level enforcement (not minister markdown). Dropped lane reminders, success heuristics, domain experts from context (shadow-SOUL risk).

file: outputs/static-soul-dynamic-projects-revised.md

2026-05-30 · canon stamped + SPINE createdshipped

SOUL+context v2 bound to canon + SPINE

Architectural reference card committed; SPINE seeded with 11 nodes across identity / context-layer / inception-protocol / tournament-infra / meta-review stages.

canon: arch-static-soul-dynamic-project-context-v2-2026-05-30 (sha 1ef86a4, 6619 B) · SPINE: /home/ubuntu/api/spines/fleet-redesign-2026-05-30.json

2026-05-30 · integrated plan v1shipped

Synthesis — four layers + 9-step inception protocol interlocked

Combined the v2 context layer with the Co-Scientist 9-step inception (Generate → Cluster → Reflect → Tournament → Evolve → Expert wrap-up → HEPH dispatch → Ratify + meta-review). Explicit reads/writes per step per layer.

file: outputs/fleet-integrated-plan-v1.md

2026-05-30 · gpt-5.5 via PC bridgeshipped

Critique pass #2 — 35 findings, "directionally strong but not yet integration-safe"

Hardest hits: precedence ambiguity, ATHENA role concentration (player + referee + court of appeal), gpt-5-only critic centralisation, ceremonial hashes (decorative without server-side verification), task-snapshot rigidity over sprint duration, "one project per turn" still breaks. Six load-bearing fixes required.

2026-05-30 · integrated plan v2shipped

Integrated plan v2 — six load-bearing fixes locked

(1) Explicit 8-level precedence lattice (must_not beats persona). (2) Mechanically verified preamble at every endpoint (server-side hash check, not text trust). (3) Mid-sprint rebase protocol (drift score ≥ 3 → worker requests → HEPH approves → ATHENA ratifies). (4) Critic pluralism — two model families per Gate A. (5) ATHENA-as-minister separated from ATHENA-as-protocol-function. (6) Per-domain scoring rubrics (3–5 dimensions, weighted-mean rank). Plus 22 smaller fixes. SPINE bumped to v0.2.0.

canon: arch-fleet-integrated-plan-v2-2026-05-30 (sha 144e4ce, 7440 B) · SPINE: 24 nodes / 19 blocking gaps · project_v2: fleet-redesign-2026-05-30 (eca099fa) · file: outputs/fleet-integrated-plan-v2.md

2026-05-30 → 2026-05-31 · Weeks 0–2 of 4-week planshipped

Weeks 0–2 LANDED — identity, schemas, gateway, calibration

Five Sabour-authored IDENTITY.md persona seeds. Theo SOUL personalised (θεωρός / 🔭). Fleet PM authority scoped build-phase only. Precedence-lattice canon + supersession protocol + channel-routing map + context.yaml schema locked. Tournament-rubric + meta-review schemas + 20+ structured monitoring event types. Minister-edge gateway live at 127.0.0.1:8911 — capability tokens + forward proxy + read-side provenance. Calibration suite scaled 17 → 24 → 32 / 32 PASS HARD gate. OpenClaw proxy.enabled flipped all 5 ministers + gaios_exec HTTPS_PROXY.

canon: fleet-layer-precedence-2026-05-30 · fleet-canon-supersession-rule-2026-05-30 · fleet-channel-routing-map-2026-05-30 · arch-gateway-minister-edge-egress-2026-05-30 · capability_gateway.py @ 127.0.0.1:8911

2026-05-31 · Phase 5 ENFORCE prep, Step 2 LANDEDshipped

Gateway preamble auto-construct + MONITOR orchestration

A.1 canon arch-gateway-preamble-auto-construct-2026-05-31 stamped + 3 critic-driven amendments (R5 reload→restart + sha16; R1 conjunctive→header-authoritative; R3 CONNECT leak fix; R9 added). R5 admin auth on __gateway/mode LANDED (bearer + sha16-redaction + 3-state audit). A.2a helpers + A.2b orchestration block in handle_client + 7 new event types + correlation fields + 9 tests — 32 / 32 calibration PASS. 24h MONITOR soak began.

canon: arch-gateway-preamble-auto-construct-2026-05-31 (commit trail 2561d254 → 831e52c5 → 52f05611 → 1b50654a). File: capability_gateway.py, gateway_caches.py, gateway_auto_construct.py.

2026-05-31 · 22:00 AEST → 2026-06-01 · 00:30 AESTshipped

Marathon — trading-platform recovery + SOUL hardening + ARGUS rollback

Sabour flagged two trading-platform incidents (ministers reading only when @-mentioned; HEPH DM'd Apollo bot user-id instead of group). Diagnosis + critic-vet + 5-SOUL primary-addressee patch + canonical FLEET-ROSTER artifact + canon stamps (rule-named-address-hardened-2026-05-31 + rule-no-config-churn-during-coordination-tests-2026-05-30). ARGUS made unauthorised edits before STOP → rollback playbook executed via temp canon:argus-soul-hardening-2026-05-31. Apollo isolated-polling lane-stall recovered (evidence-preserving quarantine + restart). PCS Heartbeat v2 spec critic-vetted GO; pcs_heartbeat.py + state file + systemd units + calibration test #34 SHIPPED; DRY_RUN soak began.

canon: rule-named-address-hardened-2026-05-31 · fact-openclaw-isolated-polling-lane-stall-2026-05-31 · arch-pcs-heartbeat-v2-2026-05-31. Sessions: 58fbae1c → c85024a6.

2026-06-01 · early morning · trading-platform incidentshipped

Apollo doctrine wave — 3 canons + 5 SOUL updates + controlled HEPH restart

Apollo declared PnL dashboard DONE ✅ HTTP 200 at 23:26; live site was 502 (Streamlit crash-loop 20× due to IndentationError in his code). HEPH dispatched fix to PROM; ATHENA self-appointed QA with the canonical HTTP 200 ≠ ratified gate; HEPH ratified. Critic-vetted GO-WITH-CHANGES on 6 items. 3 new canons stamped: rule-runtime-dependency-coordination-2026-06-01 (dep coordination before DONE), rule-rendered-surface-acceptance-2026-06-01 (HTTP 200 ≠ ratified; promotes provisional #126; ATHENA authored), fact-openclaw-sendmessage-fallback-defect-2026-06-01 (canned-fallback delivery defect). 5 SOULs updated: HEPH/ATH/APO/PROM/THEO got rules 7–8 (dep-manifest + rendered-verification + dep-evidence + dispatch routing). Controlled HEPH restart verified runtime SOUL pickup (rules 7+8 visible inside container).

canon: rule-runtime-dependency-coordination-2026-06-01 · rule-rendered-surface-acceptance-2026-06-01 · fact-openclaw-sendmessage-fallback-defect-2026-06-01. SOUL backups: /opt/gaios/backups/task-apollo-incident-soul-edits-20260601T010907Z/. Session: d398b97d.

2026-06-01 · 00:06 UTC · PCS Heartbeat v2 LIVE flipshipped

PCS Heartbeat v2 — DRY_RUN flipped after 16.2h soak (critic GO-WITH-CHANGES)

3 pre-flip blockers executed: (a) canon:fact-pcs-bootstrap-red-artifacts-2026-06-01 stamped (4 historical RED events preserved as pre-correction bootstrap artifacts), (b) DRY_RUN=true restart smoke test (state file held dedupe across restart), (c) phase_map re-confirmed (5/5 ministers in watch/inception → no RED possible). PCS_DRY_RUN=false applied; verification tick confirmed dry_run=0 in payload, severities held, zero pager emissions. Arch canon status → ACTIVE (LIVE). HEPH idle-detection productionized — closes the "minister idle, nothing pushing" gap from 2026-05-31.

canon: arch-pcs-heartbeat-v2-2026-05-31 (LIVE) · fact-pcs-bootstrap-red-artifacts-2026-06-01. Backups: /opt/gaios/backups/task-pcs-dryrun-flip-20260601T000525Z/. Session: c85024a6.

2026-06-01 · morning · fleet hygiene + canon tombstonesshipped

47-file fleet shadow purge + ARGUS canon tombstoned + 5-minister SOUL refresh

Fleet-wide sweep surfaced 47 files violating canon:rule-no-shadow-files-in-agent-state-2026-05-28 (across HEPH/ATH/APO/PROM workspaces; THEO clean, post-rule). Per-minister .tgz archives + hard-delete in workspace. canon:argus-soul-hardening-2026-05-31 tombstoned (FULFILLED — 26h after stamp; ARGUS held the hardening stably). Plain-name recognition fix applied to all 5 SOULs (HEPH/HEPHAESTUS/ATHENA/APOLLO/PROMETHEUS/PROM/THEO/ARGUS now trigger primary-addressee). Tool-body discipline added to 5 SOULs (rule 6: no planning narration in message body). All 5 ministers restarted to pick up new SOUL doctrine in runtime.

canon: argus-soul-hardening-2026-05-31 (FULFILLED). Backups: /opt/gaios/backups/task-fleet-shadow-purge-20260601T001436Z/ · /opt/gaios/backups/task-soul-name-recognition-20260531T235442Z/.

2026-06-01 · noon · PCS v3 Week 3 unblockshipped

T1 cache_unready FIXED · T5 spine versions CLOSED · R9 mechanism PIVOTED (port-per-minister)

T1 (CRITICAL FIX): MinistersCache._load() was reading flat dict; MINISTERS.json had been wrapped in _doc/_updated/ministers/... envelope on 2026-05-30 when grotto-suite shipped → loader hit AttributeError: 'str' object has no attribute 'get' on the doc string. 24h MONITOR soak had been contaminated (cache never loaded; would have 503'd every send under ENFORCE). One-line fix: data.get('ministers', data) before iterate. Cache_unready stopped firing post-restart. T5: 3 spines got version: "1.0.0" + _SPINE_FIELD_MAP extended → SpineStateCache now reads all 4. T3 (R9 pivot): live test on HEPH revealed OpenClaw v2026.5.26 schema rejects proxy.headers (only enabled/proxyUrl/tls/loopbackMode allowed). HEPH rolled back clean. Critic-vetted Option E: port-per-minister at gateway (8911=theo, 8912=HEPH, 8913=ATH, 8914=APO, 8915=PROM), new confidence value port_authoritative, fail-closed mismatch. Canon amendment stamped.

canon: amend-r9-port-per-minister-2026-06-01. Backups: task-pcs-T1-cache-loader-fix-20260601T012315Z/ · task-pcs-T3-r9-20260601T012800Z/ · task-pcs-T5-spine-version-20260601T013217Z/. Session: 3ffbd288.

2026-06-01 · early afternoon · R9 port-per-minister IMPLEMENTATIONshipped

R9 Steps 1–10 LANDED — port-authoritative identity LIVE

Per canon:amend-r9-port-per-minister-2026-06-01: gateway opens 5 listeners on 127.0.0.1 (8911=theo, 8912=hephaestus, 8913=athena, 8914=apollo, 8915=prometheus). Each minister's openclaw.json proxyUrl points to its dedicated port → identity_inferred attributes via port_authoritative confidence. Step 6 mismatch detection: new event identity_port_header_mismatch fires when caller header conflicts with port (verified live via adversarial probe). Step 10: 3 new calibration tests (port_authoritative attribution / mismatch wiring / 5-rule cache validation). Modernised T16+T17 (post-SOUL-trim markers). Suite 37/37 PASS — HARD GATE PASS. Bug caught mid-flight: Step 5 PYEOF write-miss → silent TypeError → only egress_violation emitted; detected via fleet_events probe.

canon: amend-r9-port-per-minister-2026-06-01 (progress section appended) · backups: task-r9-step{1..10}-*. Sessions 4424a862 → b00a570c.

2026-06-01 · 02:28:39Z · Phase 5 ENFORCE FLIPPEDshipped

ENFORCE LIVE — closes task #117

MODE=monitor → MODE=enforce via R5-authed POST to __gateway/mode. admin_mode_change emitted (bearer:93e4a8bf595cff4a, client_ip=127.0.0.1, success=true). Gate semantics (Option A-prime, gpt-5.5 critic GO-WITH-CHANGES with 5 blockers folded): accept (a) explicit valid X-Fleet-Preamble OR (b) auto-construct succeeds + identity_confidence ∈ {port_authoritative, header_confirmed} OR (c) CONNECT to allowed host with identity_confidence. EXCLUDED from auto-accept: token_informational (shared bot token across 4 v2 ministers). Critic-anticipated sneak path #2 caught + fixed live: CONNECT lacks chat_id by R3 design (TLS hides it); host-allowlist carve-out added. Initial host allowlist (2 hosts only — strict R4 reading).

canon: arch-gateway-preamble-auto-construct-2026-05-31 (Phase 5 ENFORCE LIVE section appended) · session 3143fe10.

2026-06-01 · 02:30 → 04:15 · ATHENA crash-loop discoveryshipped

Post-flip side-effects mapped — narrow R4 caused fleet-wide LLM blocking

Sabour flagged ATHENA web-console “Unauthorized” + HEPH dispatch (msg-3506 to Apollo for PnL-over-time plot) no progress. Investigation revealed: ATHENA crash-looped 427× over ~1h 45min on boot-time fetches to openrouter.ai + raw.githubusercontent.com (eventually stabilised when pricing-fetch became non-blocking). Apollo's anthropic LLM call failed at 02:21:29Z — that's 7 min BEFORE flip (unrelated network blip, but any retry would 407 under ENFORCE). Full host inventory mapped: api.anthropic.com + api.minimax.io + 127.0.0.1 (inference_proxy + claude-relay) + openrouter.ai + raw.githubusercontent.com + chatgpt.com + github.com all rejected. Original R4 (only TG + Pantheon Room) was written BEFORE cascade was wired — architecturally incomplete.

Sabour directive: incremental allowlist expansion. Hosts added: api.anthropic.com, api.minimax.io, openrouter.ai, raw.githubusercontent.com, chatgpt.com, github.com. Backup: task-enforce-allowlist-batch-*. Session 8730b9c9.

2026-06-01 · 04:30Z · Wildcard CONNECT pivot — R4 amendedshipped

Gateway pivots to observability — R4 firewalling dropped, ARGUS gets watcher role

Sabour directive: “the ministers will do web search, other api servers like elevenlabs, and manual tests that i will ask them. so accept all traffic using wildcard, and update all the plan to monitor access only. it's argus's task to flag suspicious activities/issues”. Wildcard CONNECT: any host accepted when identity_confidence ∈ {port_authoritative, header_confirmed}. Then extended to wildcard ALL METHODS (~04:56Z) via _is_identity_trusted path. R4 in canon amended: wildcard substring matching now permitted under identity-trusted model. R6 mismatch DETECTION still active — verified live: port 8912 + claim athena → mismatch event fires, gateway passes through (TG returns 401 to fake token), event in fleet_events for ARGUS. ARGUS Panoptes role expanded (canon:argus-myclaw): egress anomaly watcher — periodic fleet_events scan, DM Sabour on suspicious patterns.

canon: arch-gateway-preamble-auto-construct-2026-05-31 §R4 amended · argus-myclaw §all-seeing added. Backups: task-enforce-wildcard-pivot-* + task-wildcard-allmethods-*.

2026-06-01 · 05:00Z · Gateway = identity-trusted observability layercurrent

Functional split: gateway loses ONLY firewalling+enforcement, retains 11 other functions

Direct answer to Sabour's wildcard question: by going observatory, the gateway loses ONLY two functions (R4 destination filter + 407 ENFORCE rejection). Retained 11 functions: (1) Identity attribution R1+R9 port_authoritative · (2) Channel routing R2 · (3) Intent inference R4 classifier · (4) Preamble auto-construct A.2b · (5) R3 no-leak (TLS body sanitisation) · (6) R6 mismatch DETECTION (still emits events) · (7) R8 caller-preamble verification (still emits) · (8) R7 cache LKG · (9) R5 admin auth on __gateway/mode · (10) Audit emission (every request → fleet_events) · (11) Transport (HTTP/CONNECT proxy). Becomes: non-bypassable identity-attribution + audit chokepoint, the substrate ARGUS reads for anomaly flagging.

ARGUS observability impl PENDING (scheduled fleet_events scanner + 24h rolling baseline + 3σ volume detection + new-host flag + off-hours DM).

Week 0 · 2026-05-30 → 31shipped

Identity unblock + Week-0 canon locks (LANDED)

Identity unblock + Week-0 canon locks

Sabour fills five IDENTITY.md persona seeds (HEPH / ATHENA / PROM / APOLLO / Theo). Theo SOUL personalised + emoji decided. Every minister's "Fleet PM authority" block scoped to build phase only. Channel-routing canon authored. Precedence-lattice canon stamped. context.yaml schema locked.

Week 1 · 2026-05-30shipped

Schemas + structured monitoring (LANDED)

Tournament rubric schema (per-domain dimensions + weights). 5-field meta-review schema. Structured monitoring on inference_proxy + Pantheon Room — nine log event types reviewed daily.

Week 2 · 2026-05-30 → 31shipped

Context infrastructure + adversarial calibration (HARD gate, LANDED 32/32)

Gateway pre-injection hook with mechanical preamble verification. Recall namespace filter on recall_lib. Rebase cron. Five-test adversarial calibration suite: seeded contradiction · stale-context · namespace-leak · persona-portability · tournament-manipulation. All five MUST pass before any project runs the new protocol.

Week 3 · IN FLIGHT (Phase 5 ENFORCE prep)current

Tournament infra + protocol codified — partial; Phase 5 ENFORCE blocked on R9 implementation

Tournament endpoint + protocol codification still planned. Phase 5 ENFORCE transition: Steps 1+2 LANDED, Step 3 (A.3 explicit X-Fleet-Preamble emission + R9 port-per-minister) IN FLIGHT, Step 4 (live black-box #18 — now expanded to include port-attribution matrix per critic) PENDING, Step 5 (MONITOR→ENFORCE flip + 1h watch + 24h soak) PENDING. Today's R9 design pivot from proxy.headers (OpenClaw schema rejects) to port-per-minister at gateway: critic-vetted GO-WITH-CHANGES, canon amended, implementation queued.

Week 4planned

Four-project parallel pilot

Pilot on four archetypes simultaneously to expose layer conflicts: Trading Platform (engineering-heavy) · VicCrashRisk (governance + frontend mixed) · Pantheon Room infra iteration (ambiguous) · CallBridge AU iteration (medium-mixed). Single-project pilot was rejected as too narrow by critique #2.

Futureextensible

Subsequent fleet upgrades append here

This frame grows as we ship new milestones. Every future architectural step (new minister roles, protocol amendments, infrastructure migrations, critic-family rotations, schema bumps) appends as a new entry with date, deliverable, status, and canon / SPINE / project references.

Methodology note. Every milestone marked "shipped" passed through the two-critique-pass design discipline: draft → gpt-5.5 critique via PC chatgpt-bridge → revise. The bridge is the canonical critic route per canon:rule-critic-route-is-chatgpt-pc-bridge-only-2026-05-25. This is the methodology we want every future fleet upgrade to follow.

Minister	LLM	Role	Body
HEPHAESTUS PM	MiniMax-M2.7 / Sonnet fallback	Coordinate, dispatch, gate, close — PM-pure	VPS container · `hephaestus.goldnetgroup.com.au`
ATHENA ratifier	Claude Opus 4.7 (cost-gated)	Governance, strategy, canon ratification	VPS container · `athena.goldnetgroup.com.au`
PROMETHEUS engineer	MiniMax-M2.7 (thinking)	Hard engineering, long-context reasoning	VPS container · `prometheus.goldnetgroup.com.au`
APOLLO renderer	MiniMax / Sonnet cascade	Frontend, UX, render-anchored delivery	VPS container · `apollo.goldnetgroup.com.au`
Theo outside voice	MiniMax direct (no relay)	Market-watch, sanity-check, outside-in lens	VPS container · `theo.goldnetgroup.com.au`
ARGUS emergency	MiniMax (MyClaw seat)	Passive Emergency Officer — alerts to Sabour DM only	MyClaw cloud · @SabClawBot_bot

Minister	SOUL.md	IDENTITY.md	Voice professionalism	Historical conflict
HEPHAESTUS	filled · 145 lines · PM-pure	default template	Defensive-heavy — silence discipline, REDIRECT scripts. Personal voice nil.	Resolved 2026-05-18: was Master Craftsman + PM + Builder. Fixed to PM-only.
ATHENA	filled · 119 lines	default template	Governance voice clear. Personal voice nil.	Token-rotation 2026-05-21, resolved.
PROMETHEUS	filled · 126 lines	default template	Deep-engineering voice clear. Heavily anti-loop. Personal voice nil.	Resolved 2026-05-19: 89 turns signed "— ATHENA ⚖️". Hard rule installed.
APOLLO	filled · 88 lines	default template	Renderer voice clear. Personal voice nil.	Resolved 2026-05-27: off-role engineering critique on PROM's lane.
Theo	default openclaw template	default template	Persona only via TOOLS.md and MEMORY.md. Fragile.	Signature emoji ⚖️ collides with ATHENA.
ARGUS	ALERT ROUTING prepended 2026-05-28	n/a (MyClaw seat)	EO voice clear post-fix.	Resolved 2026-05-28: broadcasting to Pantheon despite directive.

Area	Today	Co-Scientist equivalent	Change to apply
Generation phase	One owner proposes one approach.	Generation agent.	NEW Step 2 — each minister 2-3 candidates, parallel.
Proximity / clustering	None.	Proximity agent.	NEW Step 3 — Cluster skill + protected singletons.
Reflection	Critique-bracket ad-hoc.	Reflection agent.	UPGRADE Mandatory two-critic Step 4.
Tournament	PM / Sabour picks.	Ranking agent · Elo.	NEW Step 5 — per-dimension Elo, ATHENA mechanical aggregator.
Evolution	None.	Evolution agent.	NEW Step 6 — merge top 2-3, frozen snapshot.
Meta-review	Ad-hoc retros.	Meta-review agent.	UPGRADE Typed 5-field canon card per close.
Supervisor	HEPH rigid PM.	Adaptive planner.	UPGRADE Cowork-Claude is inception supervisor; HEPH is build PM only.
Multi-LLM diversity	Opus / MiniMax / Sonnet / Direct / gpt-5.	Single Gemini family.	KEEP our advantage.
SOUL.md	Filled for 4 of 5.	n/a (stateless).	KEEP + scope PM-wait to build only.
IDENTITY.md	Empty default for all 5 VPS.	n/a	FILL NOW Sabour writes persona seed.
Theo identity	SOUL is default template.	n/a	FILL NOW author proper SOUL + emoji.
Project-context layer	None — only vector recall (similar, not authoritative).	n/a (project = paper).	NEW per-project context.yaml with schema + provenance + expiry.
Precedence lattice	Implicit / inconsistent.	n/a	NEW 8 levels, must_not beats persona, machine-checked.
Critic centralisation	One gpt-5 oracle.	n/a	UPGRADE Two critics, different families; disagreement = signal.
ATHENA role concentration	Voter + aggregator + ratifier.	n/a	UPGRADE Pantheon Room aggregates mechanically; ATHENA discloses voting.
Verification depth	Render + watchdog + SPINE.	Most compute on hypothesis verification.	EXTEND Hypothesis-verification gate before Step 8 via context.yaml.must_not.

Layer	Role	Lifetime
`SOUL.md`	role + discipline	eternal (overrideable only via canon supersession)
`IDENTITY.md`	persona	eternal (Sabour-seeded Week 0)
`projects/<id>/context.yaml`	vocab + invariants + must_not + verification + gaps	versioned per-class
task snapshot	hash-pinned frozen view	immutable until rebase

0. The path — journey so far + what's next

Co-Scientist gap analysis · v1

Live transaction-flow diagram added

Static SOULs + dynamic project-contexts — first draft

Critique pass #1 — 50 findings, no "Approach sound"

SOUL + context layer v2 — 8 requirements adopted

SOUL+context v2 bound to canon + SPINE

Synthesis — four layers + 9-step inception protocol interlocked

Critique pass #2 — 35 findings, "directionally strong but not yet integration-safe"

Integrated plan v2 — six load-bearing fixes locked

Weeks 0–2 LANDED — identity, schemas, gateway, calibration

Gateway preamble auto-construct + MONITOR orchestration

Marathon — trading-platform recovery + SOUL hardening + ARGUS rollback

Apollo doctrine wave — 3 canons + 5 SOUL updates + controlled HEPH restart

PCS Heartbeat v2 — DRY_RUN flipped after 16.2h soak (critic GO-WITH-CHANGES)

47-file fleet shadow purge + ARGUS canon tombstoned + 5-minister SOUL refresh

T1 cache_unready FIXED · T5 spine versions CLOSED · R9 mechanism PIVOTED (port-per-minister)

R9 Steps 1–10 LANDED — port-authoritative identity LIVE

ENFORCE LIVE — closes task #117

Post-flip side-effects mapped — narrow R4 caused fleet-wide LLM blocking

Gateway pivots to observability — R4 firewalling dropped, ARGUS gets watcher role

Functional split: gateway loses ONLY firewalling+enforcement, retains 11 other functions

Identity unblock + Week-0 canon locks (LANDED)

Identity unblock + Week-0 canon locks

Schemas + structured monitoring (LANDED)

Context infrastructure + adversarial calibration (HARD gate, LANDED 32/32)

Tournament infra + protocol codified — partial; Phase 5 ENFORCE blocked on R9 implementation

Four-project parallel pilot

Subsequent fleet upgrades append here

1. What Co-Scientist actually is

Generation agent

Proximity agent

Reflection agent

Ranking agent

Evolution agent

Meta-review agent

2. What we actually have

Live transaction flow — what happens between prompt and reply

3. What we already share with Co-Scientist

Multi-agent specialisation

Different model voices

Grounding in literature and data

Adversarial critique

Persistent identity + memory

Adaptive planning lives here too

4. The gaps that matter

5. Identity audit — character, conflicts, professionalism

6. Proposed inception phase design

7. Summary table — gaps and changes

8. Sequencing — what to do first

9. PCS — Pantheon Co-Scientist v3 (where we landed, 2026-05-31)

Four-layer stack (precedence-lattice locked, machine-enforced)

Six load-bearing fixes (post critic round 2)

What's LANDED (and where)

Pantheon Room proxy — bidirectional observability layer

Self-healing / auto-recovery layer