Files
Oleksandr Bezdieniezhnykh 59d9116d36 [AZ-406] Blackbox test harness bootstrap (Tier-1 + Tier-2 scaffold)
Bootstraps the public-boundary blackbox test harness owned by epic
AZ-262 (E-BBT). Establishes the e2e/ directory tree at the repo root,
fully separated from src/gps_denied_onboard/** and from the in-process
tests/** tree, and commits to the contracts every subsequent test
ticket (AZ-407..AZ-446) builds against.

Tier-1 (workstation Docker):
- docker/docker-compose.test.yml wires SUT + ArduPilot SITL + iNav SITL
  + mock Suite Sat Service + mavproxy listener + e2e-runner onto one
  e2e-net bridge with internal: true (enforces RESTRICT-SAT-1 /
  NFT-SEC-02 egress isolation at the network layer).
- docker/docker-compose.tier2-bridge.yml override disables the in-
  compose SUT so Tier-2 pairs SITLs + mock + runner on an x86 host
  while the SUT runs natively on the Jetson under systemd.

Tier-2 (Jetson):
- jetson/run-tier2.sh + tier2.service systemd unit + tegrastats /
  jtop parsers feed per-sample telemetry into the evidence bundle.

Runner image (e2e/runner/):
- Dockerfile + requirements.txt install ONLY ground-side libs
  (pymavlink, opencv-python>=4.12, numpy/scipy/geopy/pyproj, httpx,
  orjson, pydantic, structlog, pytest 8.x). The runner deliberately
  does NOT install the SUT package.
- conftest.py implements the AC-9 skip-rule mapping (tier2_only,
  chamber_only, vins_mono, deferred_ac) tied to environment.md
  parametrize axes.
- reporting/csv_reporter.py is a pytest plugin emitting one row per
  test with the exact 11-column schema from environment.md §
  Reporting (test_id, test_name, traces_to, fc_adapter, vio_strategy,
  tier, started_at_utc, execution_time_ms, result, error_message,
  evidence_paths). XFAIL surfaced only when a test carries
  @pytest.mark.deferred_ac(verdict="xfail", reason=...).
- reporting/evidence_bundler.py exposes the attach_evidence fixture
  that copies per-test artifacts (.tlog, FDR archives, screenshots,
  tegrastats / jtop CSVs) into the run bundle and records relative
  paths into the reporter's evidence_paths column.
- helpers/{frame_source_replay,imu_replay,sitl_observer,
  mavproxy_tlog_reader,fdr_reader}.py declare the public surfaces
  (concrete implementations owned by AZ-407 / AZ-408 / AZ-416 /
  AZ-417 / AZ-441 per the dependency table); helpers/geo.py ships
  today (no downstream task dep) — WGS84 distance / forward-bearing
  / offset via pyproj with NaN rejection.

Mock Suite Sat Service (e2e/fixtures/mock-suite-sat/):
- FastAPI app: POST /tiles (ingest contract from D-PROJ-2 follow-up),
  GET /tiles/audit + /mock/audit (per-run read-back), POST
  /mock/config (force-status, response delay), POST /mock/reset
  (clears audit between tests), GET /mock/health.

Fixture scaffolds (e2e/fixtures/{tile-cache-builder, age-injector,
injectors, cold-boot, secrets, security}/):
- Public surfaces only. Concrete builders land in AZ-407 (static
  fixtures), AZ-408 (runtime synthetic injection), AZ-419 (cold-boot
  fixture), AZ-439 (CVE-2025-53644 JPEG generator).

Test tree (e2e/tests/{positive,negative,performance,resilience,
security,resource_limit}/):
- Mirror of the test-spec category grouping in
  _docs/02_document/tests/*-tests.md.
- tests/positive/test_smoke.py is the AC-1 harness-boot smoke run
  inside the e2e-runner image once Docker brings everything up.

Out-of-container unit tests (e2e/_unit_tests/):
- Exercises the harness internals (CSV reporter plugin lifecycle,
  conftest skip rules, helper modules, parsers, mock app, compose
  YAML structural contract, public-boundary enforcement) without
  Docker / SITL. 97 unit tests, all passing.

Build / config:
- pyproject.toml: testpaths extended with e2e/_unit_tests; pythonpath
  extended with e2e; fastapi>=0.111,<0.120 added to dev extras for the
  mock-app TestClient unit test.

AC coverage:
- AC-1 (Tier-1 boot)         → compose YAML test + directory layout
                                + smoke test (Docker-bound)
- AC-2 (mock services)       → 6 FastAPI TestClient unit tests
- AC-3 (SITLs accept output) → contract present; concrete check
                                deferred to AZ-416 / AZ-417
- AC-4 (CSV columns)         → in-process plugin lifecycle test
                                emits the exact 11-column schema
- AC-5 (egress isolation)    → static config test + runtime probe
                                in Docker-bound smoke
- AC-6 (Tier-2 contract)     → tegrastats + jtop parser unit tests
                                + jetson/* layout test; full Tier-2
                                contract is AZ-444
- AC-7 (fixture reproducibility) → deferred to AZ-407 per task spec
- AC-8 (parametrize matrix)  → vins_mono skip-rule cases +
                                tests/positive/test_smoke
- AC-9 (skip semantics)      → 9 conftest skip-rule unit tests

Module layout entry for blackbox_tests was added in 2026-05-16
preparatory commit d7a17a8 so this diff stays focused on the harness
scaffold. AZ-406 advances to In Testing on commit.

Co-authored-by: Cursor <cursoragent@cursor.com>
2026-05-16 16:22:44 +03:00

130 lines
4.4 KiB
Python
Executable File

"""Sample jtop (jetson-stats) Python API → per-sample CSV rows.
Unlike tegrastats which is a stdout stream, jtop exposes a Python API
that emits a polled state dictionary. We poll at a caller-supplied
cadence and convert the relevant fields to CSV columns aligned with the
tegrastats output where the two overlap.
Schema (CSV columns):
timestamp_utc_iso, ram_used_mb, ram_total_mb, gpu_load_pct,
gpu_freq_mhz, cpu_load_avg_pct, soc_temp_c, gpu_temp_c, power_mw,
extras_json
Usage:
python3 jtop_parser.py --out out.csv --interval 1.0
"""
from __future__ import annotations
import argparse
import csv
import json
import time
from datetime import datetime, timezone
UTC = timezone.utc
from pathlib import Path
CSV_COLUMNS = (
"timestamp_utc_iso",
"ram_used_mb",
"ram_total_mb",
"gpu_load_pct",
"gpu_freq_mhz",
"cpu_load_avg_pct",
"soc_temp_c",
"gpu_temp_c",
"power_mw",
"extras_json",
)
def state_to_row(state: object) -> dict[str, object]:
"""Convert one jtop polled-state object to a CSV row.
`state` is whatever `jtop.jtop().stats` returns; on real Jetson runs it
is a `JtopStats` dataclass-ish object exposing `ram`, `gpu`, `cpu`,
`temperature`, `power`. We extract defensively because jetson-stats
schema has shifted across versions.
"""
def _get(obj: object, *path: str, default: object = "") -> object:
cur = obj
for key in path:
if cur is None:
return default
if isinstance(cur, dict):
cur = cur.get(key, default)
else:
cur = getattr(cur, key, default)
return cur if cur is not None else default
row: dict[str, object] = {
"timestamp_utc_iso": datetime.now(UTC).isoformat(timespec="milliseconds"),
"ram_used_mb": _get(state, "ram", "used"),
"ram_total_mb": _get(state, "ram", "tot"),
"gpu_load_pct": _get(state, "gpu", "load"),
"gpu_freq_mhz": _get(state, "gpu", "freq", "cur"),
"cpu_load_avg_pct": _get(state, "cpu", "load_avg", default=""),
"soc_temp_c": _get(state, "temperature", "SOC", default=""),
"gpu_temp_c": _get(state, "temperature", "GPU", default=""),
"power_mw": _get(state, "power", "total", default=""),
"extras_json": "",
}
return row
def run(out_path: Path, interval_s: float, samples_max: int | None = None) -> int:
"""Poll jtop and write rows to ``out_path``. Returns rows written.
On hosts without jetson-stats installed (e.g., unit-test runs on dev
workstations), the function ImportError → emits a single "stub" row
pointing at the missing dependency and exits. This keeps Tier-2 dry
runs and CI smoke happy without forcing CI to install jetson-stats.
"""
out_path.parent.mkdir(parents=True, exist_ok=True)
rows_written = 0
try:
from jtop import jtop # type: ignore[import-untyped]
except ImportError as exc:
with out_path.open("w", newline="", encoding="utf-8") as fh:
writer = csv.DictWriter(fh, fieldnames=list(CSV_COLUMNS))
writer.writeheader()
writer.writerow(
{
**{col: "" for col in CSV_COLUMNS},
"timestamp_utc_iso": datetime.now(UTC).isoformat(timespec="milliseconds"),
"extras_json": json.dumps({"stub": True, "missing_dep": "jetson-stats", "import_error": str(exc)}),
}
)
return 1
with jtop() as poll, out_path.open("w", newline="", encoding="utf-8") as fh:
writer = csv.DictWriter(fh, fieldnames=list(CSV_COLUMNS))
writer.writeheader()
while poll.ok():
row = state_to_row(poll.stats)
writer.writerow(row)
fh.flush()
rows_written += 1
if samples_max is not None and rows_written >= samples_max:
break
time.sleep(interval_s)
return rows_written
def main() -> int:
parser = argparse.ArgumentParser(description="Sample jtop → CSV.")
parser.add_argument("--out", type=Path, required=True)
parser.add_argument("--interval", type=float, default=1.0, help="Poll interval in seconds.")
parser.add_argument("--samples-max", type=int, default=None)
args = parser.parse_args()
n = run(args.out, args.interval, args.samples_max)
print(f"jtop_parser: wrote {n} rows to {args.out}")
return 0
if __name__ == "__main__":
raise SystemExit(main())