[AZ-697] [AZ-702] tlog GPS truth + KHP20S30 factory calibration

Batch 98 (cycle 2) — first two PBIs of epic AZ-696 (real-flight validation harness): AZ-697: direct binary-tlog GPS-truth extractor - New src/gps_denied_onboard/replay_input/tlog_ground_truth.py reads GLOBAL_POSITION_INT (with GPS_RAW_INT fallback) from a binary ArduPilot tlog via pymavlink.mavutil and returns a frozen+slotted TlogGroundTruth DTO with per-record ts_ns / lat_deg / lon_deg / alt_m / hdg_deg / vx_m_s / vy_m_s / vz_m_s. - Promoted l2_horizontal_m + match_percentage + GroundTruthRow from tests/e2e/replay/_helpers.py into the new production module src/gps_denied_onboard/helpers/gps_compare.py. The e2e helper now re-exports the same objects (identity, not copies) so existing test imports continue working untouched. - tests/e2e/replay/conftest.py prefers the real derkachi.tlog when present, falls back to the CSV synth path otherwise. - 22 new unit tests cover AC-1..AC-5 (mypy --strict subprocess test included). All passing. AZ-702: Topotek KHP20S30 factory-sheet camera calibration - New _docs/00_problem/input_data/flight_derkachi/khp20s30_factory.json: fx = fy = 4644.444, cx = 960, cy = 540, HFOV ~ 23.3 deg, VFOV ~ 13.2 deg, computed from the published 8.5 mm focal length + 1/2.8" sensor + 1920x1080 capture at lowest zoom step. Distortion zeroed, body_to_camera_se3 = identity with nadir convention. Acquisition method explicitly recorded as factory_sheet so downstream code can expect higher residual error than a lab calibration. - _docs/00_problem/input_data/flight_derkachi/camera_info.md updated to document the assumptions, expected residual error window, and conftest pick-up rule. - tests/e2e/replay/conftest.py::_calibration_path() prefers khp20s30_factory.json when present, falls back to adti26.json. - 9 new unit tests cover AC-1..AC-4 (schema, intrinsics traceback, doc reference, conftest pick-up). All passing. Test run: 45 new tests, all passing. Full-suite gate deferred to Step 16 (after the last batch in cycle 2 per the implement skill). Adjacent note (not fixed in this batch, recorded in the batch report): auto_sync.py has the same redundant pymavlink type:ignore + a few numpy/cv2 mypy --strict issues. None on this batch's path. Refs: _docs/03_implementation/batch_98_cycle2_report.md Refs: _docs/02_tasks/done/AZ-697_tlog_ground_truth_extractor.md Refs: _docs/02_tasks/done/AZ-702_khp20s30_calibration.md Co-authored-by: Cursor <cursoragent@cursor.com>
2026-06-23 00:41:13 +00:00 · 2026-05-20 16:09:03 +03:00
parent a12638dd92
commit 64d961f60c
16 changed files with 1503 additions and 134 deletions
@@ -16,6 +16,11 @@ from gps_denied_onboard.helpers.engine_filename_schema import (
    EngineFilenameSchema,
    EngineFilenameSchemaError,
 )
+from gps_denied_onboard.helpers.gps_compare import (
+    GroundTruthRow,
+    l2_horizontal_m,
+    match_percentage,
+)
 from gps_denied_onboard.helpers.imu_preintegrator import (
    CombinedImuFactor,
    ImuPreintegrationError,
@@ -71,6 +76,7 @@ __all__ = [
    "DescriptorNormaliserError",
    "EngineFilenameSchema",
    "EngineFilenameSchemaError",
+    "GroundTruthRow",
    "ImuPreintegrationError",
    "ImuPreintegrator",
    "LightGlueConcurrentAccessError",
@@ -89,7 +95,9 @@ __all__ = [
    "is_valid_rotation",
    "iso_ts_from_clock",
    "iso_ts_now",
+    "l2_horizontal_m",
    "log_map",
+    "match_percentage",
    "make_imu_preintegrator",
    "matrix_to_se3",
    "se3_to_matrix",
@@ -0,0 +1,120 @@
+"""WGS84 GPS comparison helpers (AZ-697 / E-DEMO-REPLAY).
+
+Production helpers for comparing estimator GPS emissions against a
+ground-truth track. Promoted from the AZ-404 e2e test helpers so the
+AZ-699 (real-flight validation runner) and AZ-701 (HTTP replay API)
+code paths can consume them without dragging ``tests/`` into the
+import graph.
+
+The numerical kernels are identical to the prior test-helpers location;
+the snapshot test in ``tests/unit/helpers/test_gps_compare.py`` pins
+that equivalence so a future change to either side breaks loudly.
+"""
+
+from __future__ import annotations
+
+import math
+from dataclasses import dataclass
+from typing import Any
+
+__all__ = [
+    "GroundTruthRow",
+    "l2_horizontal_m",
+    "match_percentage",
+]
+
+
+# WGS84 mean Earth radius. Matches the value used by
+# `helpers/wgs_converter.py` (AZ-279) so this comparison stays
+# consistent with the production geodesy converter.
+_EARTH_RADIUS_M: float = 6_371_008.8
+
+
+@dataclass(frozen=True)
+class GroundTruthRow:
+    """One row of GPS ground-truth (lat/lon/alt at a time)."""
+
+    t_s: float
+    lat_deg: float
+    lon_deg: float
+    alt_m: float
+
+
+def l2_horizontal_m(
+    lat1_deg: float, lon1_deg: float, lat2_deg: float, lon2_deg: float
+) -> float:
+    """WGS84-spherical great-circle distance in metres.
+
+    Haversine with the C5/AZ-279 mean Earth radius. The spherical
+    approximation diverges from the WGS84 ellipsoid by < 0.5 % in the
+    [-60°, 60°] latitude band — sufficient for the AZ-696 epic's
+    ≤ 100 m AC-3 threshold.
+    """
+    phi1 = math.radians(lat1_deg)
+    phi2 = math.radians(lat2_deg)
+    dphi = phi2 - phi1
+    dlam = math.radians(lon2_deg - lon1_deg)
+    a = (
+        math.sin(dphi / 2.0) ** 2
+        + math.cos(phi1) * math.cos(phi2) * math.sin(dlam / 2.0) ** 2
+    )
+    c = 2.0 * math.asin(min(1.0, math.sqrt(a)))
+    return _EARTH_RADIUS_M * c
+
+
+def match_percentage(
+    emissions: list[dict[str, Any]],
+    ground_truth: list[GroundTruthRow],
+    *,
+    threshold_m: float,
+) -> float:
+    """Share of emissions within ``threshold_m`` of the closest GT row.
+
+    For each emitted ``EstimatorOutput`` JSONL record, finds the
+    nearest-in-time ground-truth row, computes the horizontal L2
+    distance, and counts it as a hit when ≤ ``threshold_m``. Returns the
+    hit ratio in ``[0.0, 1.0]``.
+
+    Nearest-in-time is sufficient when GT cadence (5–10 Hz for tlog
+    GPS) places the candidate row within ~100 ms of the emit timestamp,
+    well below typical drone-replay error budgets.
+    """
+    if not emissions:
+        return 0.0
+    if not ground_truth:
+        raise AssertionError("ground_truth must be non-empty")
+    gt_sorted = sorted(ground_truth, key=lambda r: r.t_s)
+    gt_times = [r.t_s for r in gt_sorted]
+    hits = 0
+    for emit in emissions:
+        emit_ts_ns = int(emit["emitted_at"])
+        emit_t_s = emit_ts_ns / 1e9
+        idx = _bisect_left(gt_times, emit_t_s)
+        candidates = []
+        if idx > 0:
+            candidates.append(gt_sorted[idx - 1])
+        if idx < len(gt_sorted):
+            candidates.append(gt_sorted[idx])
+        nearest = min(candidates, key=lambda r: abs(r.t_s - emit_t_s))
+        emit_pos = emit["position_wgs84"]
+        d = l2_horizontal_m(
+            emit_pos["lat_deg"],
+            emit_pos["lon_deg"],
+            nearest.lat_deg,
+            nearest.lon_deg,
+        )
+        if d <= threshold_m:
+            hits += 1
+    return hits / len(emissions)
+
+
+def _bisect_left(seq: list[float], target: float) -> int:
+    """Stdlib bisect_left, inlined to keep this module's import surface narrow."""
+    lo, hi = 0, len(seq)
+    while lo < hi:
+        mid = (lo + hi) // 2
+        if seq[mid] < target:
+            lo = mid + 1
+        else:
+            hi = mid
+    return lo