Files
Oleksandr Bezdieniezhnykh 59d9116d36 [AZ-406] Blackbox test harness bootstrap (Tier-1 + Tier-2 scaffold)
Bootstraps the public-boundary blackbox test harness owned by epic
AZ-262 (E-BBT). Establishes the e2e/ directory tree at the repo root,
fully separated from src/gps_denied_onboard/** and from the in-process
tests/** tree, and commits to the contracts every subsequent test
ticket (AZ-407..AZ-446) builds against.

Tier-1 (workstation Docker):
- docker/docker-compose.test.yml wires SUT + ArduPilot SITL + iNav SITL
  + mock Suite Sat Service + mavproxy listener + e2e-runner onto one
  e2e-net bridge with internal: true (enforces RESTRICT-SAT-1 /
  NFT-SEC-02 egress isolation at the network layer).
- docker/docker-compose.tier2-bridge.yml override disables the in-
  compose SUT so Tier-2 pairs SITLs + mock + runner on an x86 host
  while the SUT runs natively on the Jetson under systemd.

Tier-2 (Jetson):
- jetson/run-tier2.sh + tier2.service systemd unit + tegrastats /
  jtop parsers feed per-sample telemetry into the evidence bundle.

Runner image (e2e/runner/):
- Dockerfile + requirements.txt install ONLY ground-side libs
  (pymavlink, opencv-python>=4.12, numpy/scipy/geopy/pyproj, httpx,
  orjson, pydantic, structlog, pytest 8.x). The runner deliberately
  does NOT install the SUT package.
- conftest.py implements the AC-9 skip-rule mapping (tier2_only,
  chamber_only, vins_mono, deferred_ac) tied to environment.md
  parametrize axes.
- reporting/csv_reporter.py is a pytest plugin emitting one row per
  test with the exact 11-column schema from environment.md §
  Reporting (test_id, test_name, traces_to, fc_adapter, vio_strategy,
  tier, started_at_utc, execution_time_ms, result, error_message,
  evidence_paths). XFAIL surfaced only when a test carries
  @pytest.mark.deferred_ac(verdict="xfail", reason=...).
- reporting/evidence_bundler.py exposes the attach_evidence fixture
  that copies per-test artifacts (.tlog, FDR archives, screenshots,
  tegrastats / jtop CSVs) into the run bundle and records relative
  paths into the reporter's evidence_paths column.
- helpers/{frame_source_replay,imu_replay,sitl_observer,
  mavproxy_tlog_reader,fdr_reader}.py declare the public surfaces
  (concrete implementations owned by AZ-407 / AZ-408 / AZ-416 /
  AZ-417 / AZ-441 per the dependency table); helpers/geo.py ships
  today (no downstream task dep) — WGS84 distance / forward-bearing
  / offset via pyproj with NaN rejection.

Mock Suite Sat Service (e2e/fixtures/mock-suite-sat/):
- FastAPI app: POST /tiles (ingest contract from D-PROJ-2 follow-up),
  GET /tiles/audit + /mock/audit (per-run read-back), POST
  /mock/config (force-status, response delay), POST /mock/reset
  (clears audit between tests), GET /mock/health.

Fixture scaffolds (e2e/fixtures/{tile-cache-builder, age-injector,
injectors, cold-boot, secrets, security}/):
- Public surfaces only. Concrete builders land in AZ-407 (static
  fixtures), AZ-408 (runtime synthetic injection), AZ-419 (cold-boot
  fixture), AZ-439 (CVE-2025-53644 JPEG generator).

Test tree (e2e/tests/{positive,negative,performance,resilience,
security,resource_limit}/):
- Mirror of the test-spec category grouping in
  _docs/02_document/tests/*-tests.md.
- tests/positive/test_smoke.py is the AC-1 harness-boot smoke run
  inside the e2e-runner image once Docker brings everything up.

Out-of-container unit tests (e2e/_unit_tests/):
- Exercises the harness internals (CSV reporter plugin lifecycle,
  conftest skip rules, helper modules, parsers, mock app, compose
  YAML structural contract, public-boundary enforcement) without
  Docker / SITL. 97 unit tests, all passing.

Build / config:
- pyproject.toml: testpaths extended with e2e/_unit_tests; pythonpath
  extended with e2e; fastapi>=0.111,<0.120 added to dev extras for the
  mock-app TestClient unit test.

AC coverage:
- AC-1 (Tier-1 boot)         → compose YAML test + directory layout
                                + smoke test (Docker-bound)
- AC-2 (mock services)       → 6 FastAPI TestClient unit tests
- AC-3 (SITLs accept output) → contract present; concrete check
                                deferred to AZ-416 / AZ-417
- AC-4 (CSV columns)         → in-process plugin lifecycle test
                                emits the exact 11-column schema
- AC-5 (egress isolation)    → static config test + runtime probe
                                in Docker-bound smoke
- AC-6 (Tier-2 contract)     → tegrastats + jtop parser unit tests
                                + jetson/* layout test; full Tier-2
                                contract is AZ-444
- AC-7 (fixture reproducibility) → deferred to AZ-407 per task spec
- AC-8 (parametrize matrix)  → vins_mono skip-rule cases +
                                tests/positive/test_smoke
- AC-9 (skip semantics)      → 9 conftest skip-rule unit tests

Module layout entry for blackbox_tests was added in 2026-05-16
preparatory commit d7a17a8 so this diff stays focused on the harness
scaffold. AZ-406 advances to In Testing on commit.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-16 16:22:44 +03:00

85 lines
2.8 KiB
Python

"""Evidence bundler pytest plugin.
For each test, collects supporting artifacts (`.tlog`, FDR archive snapshots,
screenshots, profiler traces, tegrastats / jtop CSVs) into a per-run bundle
at ``--evidence-out`` (default ``/e2e-results/<run-id>/evidence/``) and
records the resulting paths in the CSV reporter's ``evidence_paths`` column.
The bundler is INERT by default: tests opt in by calling the
``attach_evidence`` fixture with a file path. The runner conftest registers
this plugin via `pytest_plugins`.
"""
from __future__ import annotations
import shutil
from collections.abc import Callable
from pathlib import Path
import pytest
from .csv_reporter import reporter_for
def _safe_relpath(target: Path, base: Path) -> str:
try:
return str(target.relative_to(base))
except ValueError:
# If the target isn't under base, we still record its absolute path
# — the bundle copy below makes the absolute fallback robust to
# arbitrary source locations (e.g. /tlogs/<run>.tlog).
return str(target)
@pytest.fixture
def attach_evidence(
request: pytest.FixtureRequest,
evidence_dir: Path,
) -> Callable[[str | Path], str]:
"""Copy a file into the run evidence bundle and record its CSV path.
Returns a callable ``attach(path) -> str`` — the test invokes it after
capturing an artifact (e.g., the .tlog file or an FDR snapshot). The
returned string is the path that will appear in the CSV
``evidence_paths`` column.
The implementation copies the file (rather than moving it) so the same
artifact can be referenced by multiple tests if needed.
"""
nodeid = request.node.nodeid
config = request.config
reporter = reporter_for(config)
bundle_root = evidence_dir / _slug(nodeid)
bundle_root.mkdir(parents=True, exist_ok=True)
def _attach(path: str | Path) -> str:
src = Path(path)
if not src.exists():
raise FileNotFoundError(f"attach_evidence: {src} not found")
dst = bundle_root / src.name
# If a test attaches the same name twice in one run, disambiguate.
if dst.exists():
stem, suffix = src.stem, src.suffix
counter = 1
while dst.exists():
dst = bundle_root / f"{stem}__{counter}{suffix}"
counter += 1
shutil.copy2(src, dst)
rel = _safe_relpath(dst, evidence_dir.parent)
if reporter is not None:
reporter.attach_evidence(nodeid, rel)
return rel
return _attach
def _slug(nodeid: str) -> str:
"""Filesystem-safe slug for the nodeid (preserves uniqueness, no path chars)."""
return (
nodeid.replace("/", "_")
.replace("::", "__")
.replace("[", "_")
.replace("]", "")
.replace(" ", "")
)