mirror of https://github.com/azaion/gps-denied-onboard.git synced 2026-06-21 05:51:13 +00:00

Files

T

Oleksandr Bezdieniezhnykh aa8b9f2ee9 [AZ-899] [AZ-900] [AZ-901] Baseline doc + retro gate + EVIDENCE_OUT fix

AZ-899: create _docs/02_document/architecture_compliance_baseline.md
seeded with 0 violations and the 2026-05-20 structural snapshot facts
(15 inventory entries, 0 import cycles, 5 contract files). Documents
the append-on-violation / mark-resolved-on-fix / snapshot-refresh
protocol so cumulative reviews can emit Baseline Delta sections.
Closes cycle-1 retro Top-3 #3 (third attempt).

AZ-900: codify LESSONS 2026-05-26 [process] in
.cursor/skills/autodev/flows/existing-code.md - Re-Entry After
Completion now hosts a Previous-Cycle Retro Existence Gate that
BLOCKS the cycle increment if no _docs/06_metrics/retro_*.md file
dated within [cycle_start, cycle_end] exists. Skipped on
state.cycle == 1. Presents Choose A (author retro) / B (stub +
leftover) / C (abort). state.md - Session Boundaries gains a
cross-reference bullet.

AZ-901: fix e2e/runner/conftest.py:56 EVIDENCE_OUT default - host
pytest now resolves <repo_root>/e2e-results/evidence/ instead of
/e2e-results/evidence (container-only path; crashed on macOS / non-
root Linux). Docker + Jetson harnesses unaffected (they pass
--evidence-out explicitly). Verified locally: 24 SKIPPED, exit 0,
evidence written. Closes leftover 2026-05-26_evidence_out_default_path.md.

Co-authored-by: Cursor <cursoragent@cursor.com>

2026-05-26 17:18:54 +03:00

10 KiB

Raw Blame History

Autodev State Management

State File: `_docs/_autodev_state.md`

The autodev persists its position to _docs/_autodev_state.md. This is a lightweight pointer — only the current step. All history lives in _docs/ artifacts and git log. Folder scanning is the fallback when the state file doesn't exist.

Template

Saved at: _docs/_autodev_state.md (workspace-relative, one file per project). Created on the first /autodev invocation; updated in place on every state transition; never deleted.

# Autodev State

## Current Step
flow: [greenfield | existing-code | meta-repo]
step: [1-17 for greenfield (incl. fractional 16.5), 1-17 for existing-code (incl. fractional 16.5), 1-6 for meta-repo (incl. fractional 2.5 and 3.5), or "done"]
name: [step name from the active flow's Step Reference Table]
status: [not_started / in_progress / completed / skipped / failed]
sub_step:
  phase: [integer — sub-skill internal phase/step number, or 0 if not started]
  name: [kebab-case short identifier from the sub-skill, or "awaiting-invocation"]
  detail: [optional free-text note, may be empty]
retry_count: [0-3 — consecutive auto-retry attempts, reset to 0 on success]
cycle: [1-N — feature cycle counter for existing-code flow; increments on each "Re-Entry After Completion" loop; always 1 for greenfield and meta-repo]

The sub_step field is structured. Every sub-skill must save both phase (integer) and name (kebab-case token matching the skill's documented phase names). detail is optional human-readable context. On re-entry the orchestrator parses phase and name to resume; if parsing fails, fall back to folder scan and log the parse failure.

Sub-Skill Phase Persistence — Rules (not a registry)

Each sub-skill is authoritative for its own phase list. Phase names and numbers live inside the sub-skill's own SKILL.md (and any steps/ / phases/ files). The orchestrator does not maintain a central phase table — it reads whatever phase / name the sub-skill last wrote.

Every sub-skill MUST follow these rules when persisting sub_step:

phase — a strictly monotonic integer per invocation, starting at 0 (awaiting-invocation) and incrementing by 1 at each internal save point. No fractional values are ever persisted. If the skill's own docs use half-step numbering (e.g., "Phase 4.5", decompose's "Step 1.5"), the persisted integer is simply the next integer, and all subsequent phases shift up by one in that skill's own file.
name — a kebab-case short identifier unique within that sub-skill. Use the phase's heading or step title in kebab-case (e.g., component-decomposition, auto-fix-gate, cross-task-consistency). Different modes of the same skill may reuse a phase integer with distinct name values (e.g., decompose phase 1 is bootstrap-structure in default mode, test-infrastructure-bootstrap in tests-only mode).
detail — optional free-text note (batch index, mode flag, retry hint); may be empty.
Reserved name — name: awaiting-invocation with phase: 0 is the universal "skill was chained but has not started" marker. Every sub-skill implicitly supports it; no sub-skill should reuse the token for anything else.

On re-entry, the orchestrator parses the structured field and resumes at (phase, name). If parsing fails, it falls back to folder scan and logs the parse error — it does NOT guess a phase.

The cycle counter is used by existing-code flow Step 10 (Implement) detection and by implementation report naming (implementation_report_{feature_slug}_cycle{N}.md). It starts at 1 when a project enters existing-code flow (either by routing from greenfield's Done branch, or by first invocation on an existing codebase). It increments on each completed Retrospective → New Task loop.

Examples

flow: greenfield
step: 3
name: Plan
status: in_progress
sub_step:
  phase: 4
  name: architecture-review-risk-assessment
  detail: ""
retry_count: 0
cycle: 1

flow: existing-code
step: 3
name: Test Spec
status: failed
sub_step:
  phase: 1
  name: test-case-generation
  detail: "variant 1b"
retry_count: 3
cycle: 1

flow: meta-repo
step: 2
name: Config Review
status: in_progress
sub_step:
  phase: 0
  name: awaiting-human-review
  detail: "awaiting review of _docs/_repo-config.yaml"
retry_count: 0
cycle: 1

flow: meta-repo
step: 3.5
name: Suite Implement
status: in_progress
sub_step:
  phase: 7
  name: batch-loop
  detail: "AZ-543 batch 1 of 1; suite-level"
retry_count: 0
cycle: 1

flow: existing-code
step: 10
name: Implement
status: in_progress
sub_step:
  phase: 7
  name: batch-loop
  detail: "batch 2 of ~4"
retry_count: 0
cycle: 3

State File Rules

Create on the first autodev invocation (after state detection determines Step 1)
Update after every change — this includes: batch completion, sub-step progress, step completion, session boundary, failed retry, or any meaningful state transition. The state file must always reflect the current reality.
Read as the first action on every invocation — before folder scanning
Cross-check: verify against actual _docs/ folder contents. If they disagree, trust the folder structure and update the state file. Parent suite docs/: on every invocation, also probe <workspace-root>/../docs (the parent directory’s docs folder — typical suite-level shared documentation next to a component repo). If it exists, mention it in the Status Summary footer per protocols.md; use it only as supplemental reading context unless a flow step explicitly ties detection to it. It never replaces workspace _docs/ for step detection by default.
Never delete the state file
Retry tracking: increment retry_count on each failed auto-retry; reset to 0 on success. If retry_count reaches 3, set status: failed
Failed state on re-entry: if status: failed with retry_count: 3, do NOT auto-retry — present the issue to the user first
Skill-internal state: when the active skill maintains its own state file (e.g., document skill's _docs/02_document/state.json), the autodev's sub_step field should reflect the skill's internal progress. On re-entry, cross-check the skill's state file against the autodev's sub_step for consistency.

State Detection

Read _docs/_autodev_state.md first. If it exists and is consistent with the folder structure, use the Current Step from the state file. If the state file doesn't exist or is inconsistent, fall back to folder scanning.

Folder Scan Rules (fallback)

Scan the workspace and _docs/ to determine the current workflow position. The detection rules are defined in each flow file (flows/greenfield.md, flows/existing-code.md, flows/meta-repo.md). Resolution order:

Apply the Flow Resolution rules in SKILL.md to pick the flow first (meta-repo detection takes priority over greenfield/existing-code).
Within the selected flow, check its detection rules in order — first match wins.

Re-Entry Protocol

When the user invokes /autodev and work already exists:

Read _docs/_autodev_state.md
Cross-check against _docs/ folder structure
Present Status Summary (render using the banner template in protocols.md → "Banner Template", filled in with the active flow's "Status Summary — Step List" fragment)
If the detected step has a sub-skill with built-in resumability, the sub-skill handles mid-step recovery
Continue execution from detected state

Session Boundaries

A session boundary is a transition that explicitly breaks auto-chain. Which transitions are boundaries is declared in each flow file's Auto-Chain Rules table — rows marked **Session boundary**. The details live with the steps they apply to; this section defines only the shared mechanism.

Invariant: a flow row without the Session boundary marker auto-chains unconditionally. Missing marker = missing boundary.

Cross-reference — content gates that can also stop auto-chain. Some flow files declare additional gates that block a transition even when the row would otherwise auto-chain. These are NOT session boundaries (they don't end the conversation); they BLOCK the next step until a content prerequisite is satisfied. The orchestrator must respect them in the same place it respects session boundaries — between completing one step and starting the next. Currently declared:

existing-code Re-Entry After Completion → Previous-Cycle Retro Existence Gate (AZ-900) — blocks the cycle: state.cycle + 1 increment if no _docs/06_metrics/retro_*.md file dated within the closing cycle's range exists. Skipped when state.cycle == 1. Presents an A/B/C choice (author now / stub-and-leftover / abort) when triggered. Full spec: .cursor/skills/autodev/flows/existing-code.md § "Previous-Cycle Retro Existence Gate".

Orchestrator mechanism at a boundary

Update the state file: mark the current step completed; set the next step with status: not_started; reset sub_step: {phase: 0, name: awaiting-invocation, detail: ""}; keep retry_count: 0.
Present a brief summary of what just finished (tasks produced, batches expected, etc., as relevant to the boundary).
Present the shared Choose block (template below) — or a flow-specific override if the flow file supplies one.
End the session — do not start the next skill in the same conversation.

Shared Choose template

══════════════════════════════════════
 DECISION REQUIRED: <what just completed> — start <next phase>?
══════════════════════════════════════
 A) Start a new conversation for <next phase> (recommended for context freshness)
 B) Continue in this conversation (NOT recommended — context may degrade)
    Warning: if context fills mid-<next phase>, state will be saved and you will
    still be asked to resume in a new conversation — option B only delays that.
══════════════════════════════════════
 Recommendation: A — <next phase> is long; fresh context helps
══════════════════════════════════════

Individual boundaries MAY override this template with a flow-specific Choose block when the pause has different semantics (e.g., meta-repo.md Step 2 Config Review pauses for human review of a config flag, not for context freshness). The flow file is authoritative for any such override.

10 KiB Raw Blame History Unescape Escape