Files
gps-denied-desktop/docs/03_tests/00_test_summary.md
T
Oleksandr Bezdieniezhnykh b12f37ab01 update tests
2025-11-30 16:21:03 +02:00

6.8 KiB

ASTRAL-Next Test Specifications Summary

Overview

Comprehensive test specifications for the GPS-denied navigation system following the QA testing pyramid approach.

Total Test Specifications: 49

Test Organization

Integration Tests (01-16): Component Level

Tests individual system components in isolation with their dependencies.

Vision Pipeline (01-04):

  • 01: Sequential Visual Odometry (F07 - SuperPoint + LightGlue)
  • 02: Global Place Recognition (F08 - AnyLoc/DINOv2)
  • 03: Metric Refinement (F09 - LiteSAM)
  • 04: Factor Graph Optimizer (F10 - GTSAM)

Data Management (05-08):

  • 05: Satellite Data Manager (F04)
  • 06: Coordinate Transformer (F13)
  • 07: Image Input Pipeline (F05)
  • 08: Image Rotation Manager (F06)

Service Infrastructure (09-12):

  • 09: REST API (F01 - FastAPI endpoints)
  • 10: SSE Event Streamer (F15 - real-time streaming)
  • 11a: Flight Lifecycle Manager (F02.1 - CRUD, initialization, API delegation)
  • 11b: Flight Processing Engine (F02.2 - processing loop, recovery coordination)
  • 12: Result Manager (F14)

Support Components (13-16):

  • 13: Model Manager (F16 - TensorRT)
  • 14: Failure Recovery Coordinator (F11)
  • 15: Configuration Manager (F17)
  • 16: Database Layer (F03)

System Integration Tests (21-25): Multi-Component Flows

Tests integration between multiple components.

  • 21: End-to-End Normal Flight
  • 22: Satellite to Vision Pipeline (F04 → F07/F08/F09)
  • 23: Vision to Optimization Pipeline (F07/F08/F09 → F10)
  • 24: Multi-Component Error Propagation
  • 25: Real-Time Streaming Pipeline (F02 → F14 → F15)

Acceptance Tests (31-50): Requirements Validation

Tests mapped to 10 acceptance criteria.

Accuracy (31-33):

  • 31: AC-1 - 80% < 50m error (baseline)
  • 32: AC-1 - 80% < 50m error (varied terrain)
  • 33: AC-2 - 60% < 20m error (high precision)

Robustness - Outliers (34-35):

  • 34: AC-3 - Single 350m outlier handling
  • 35: AC-3 - Multiple outliers handling

Robustness - Sharp Turns (36-38):

  • 36: AC-4 - Sharp turn zero overlap recovery
  • 37: AC-4 - Sharp turn minimal overlap (<5%)
  • 38: Outlier anchor detection

Multi-Fragment (39):

  • 39: AC-5 - Multi-fragment route connection (chunk architecture)

User Interaction (40):

  • 40: AC-6 - User input after 3 consecutive failures

Performance (41-44):

  • 41: AC-7 - <5s single image processing
  • 42: AC-7 - Sustained throughput performance
  • 43: AC-8 - Real-time streaming results
  • 44: AC-8 - Async refinement delivery

Quality Metrics (45-47):

  • 45: AC-9 - Registration rate >95% (baseline)
  • 46: AC-9 - Registration rate >95% (challenging conditions)
  • 47: AC-10 - Mean Reprojection Error <1.0 pixels

Cross-Cutting (48-50):

  • 48: Long flight (3000 images)
  • 49: Degraded satellite data
  • 50: Complete system acceptance validation

Chunk-Based Recovery (55-56):

  • 55: Chunk rotation recovery (rotation sweeps for chunks)
  • 56: Multi-chunk simultaneous processing (Atlas architecture)

GPS-Analyzed Scenario Tests (51-54): Real Data

Tests using GPS-analyzed test datasets.

  • 51: Test_Baseline (AD000001-030) - Standard flight
  • 52: Test_Outlier_350m (AD000045-050) - Outlier scenario
  • 53: Test_Sharp_Turn - Multiple sharp turn datasets
  • 54: Test_Long_Flight (AD000001-060) - Full dataset

Test Data

GPS Analysis Results

  • Mean distance: 120.8m
  • Min distance: 24.2m
  • Max distance: 268.6m

Identified Sharp Turns (>200m):

  • AD000003 → AD000004: 202.2m
  • AD000032 → AD000033: 220.6m
  • AD000042 → AD000043: 234.2m
  • AD000044 → AD000045: 230.2m
  • AD000047 → AD000048: 268.6m (largest outlier)

Test Datasets

Test_Baseline: AD000001-030 (30 images, normal spacing) Test_Outlier_350m: AD000045-050 (6 images, 268.6m outlier) Test_Sharp_Turn_A: AD000042, AD000044, AD000045, AD000046 (skip 043) Test_Sharp_Turn_B: AD000032-035 (220m jump) Test_Sharp_Turn_C: AD000003, AD000009 (5-frame gap) Test_Long_Flight: AD000001-060 (all 60 images, all variations)

Acceptance Criteria Coverage

AC Requirement Test Specs Status
AC-1 80% < 50m error 31, 32, 50, 51, 54 ✓ Covered
AC-2 60% < 20m error 33, 50, 51, 54 ✓ Covered
AC-3 350m outlier robust 34, 35, 50, 52, 54 ✓ Covered
AC-4 Sharp turn <5% overlap 36, 37, 50, 53, 54, 55 ✓ Covered
AC-5 Multi-fragment connection 39, 50, 56 ✓ Covered
AC-6 User input after 3 failures 40, 50 ✓ Covered
AC-7 <5s per image 41, 42, 50, 51, 54 ✓ Covered
AC-8 Real-time + refinement 43, 44, 50 ✓ Covered
AC-9 Registration >95% 45, 46, 50, 51, 54 ✓ Covered
AC-10 MRE <1.0px 47, 50 ✓ Covered

Component to Test Mapping

Component ID Integration Test
Flight API F01 09
Flight Lifecycle Manager F02.1 11a
Flight Processing Engine F02.2 11b
Flight Database F03 16
Satellite Data Manager F04 05
Image Input Pipeline F05 07
Image Rotation Manager F06 08
Sequential Visual Odometry F07 01
Global Place Recognition F08 02
Metric Refinement F09 03
Factor Graph Optimizer F10 04
Failure Recovery Coordinator F11 14
Route Chunk Manager F12 39, 55, 56
Coordinate Transformer F13 06
Result Manager F14 12
SSE Event Streamer F15 10
Model Manager F16 13
Configuration Manager F17 15

Test Execution Strategy

Phase 1: Component Integration (01-16)

  • Validate each component individually
  • Verify interfaces and dependencies
  • Establish baseline performance metrics

Phase 2: System Integration (21-25)

  • Test multi-component interactions
  • Validate end-to-end flows
  • Verify error handling across components

Phase 3: Acceptance Testing (31-50)

  • Validate all acceptance criteria
  • Use GPS-analyzed real data
  • Measure against requirements

Phase 4: Special Scenarios (51-56)

  • Test specific GPS-identified situations
  • Validate outliers and sharp turns
  • Chunk-based recovery scenarios
  • Full system validation

Success Criteria Summary

Integration Tests: All components pass individual tests, interfaces work correctly System Tests: Multi-component flows work, errors handled properly Acceptance Tests: All 10 ACs met with real data Overall: System meets all requirements, ready for deployment

Test Metrics to Track

  • Accuracy: Mean error, RMSE, percentiles
  • Performance: Processing time per image, total time
  • Reliability: Registration rate, success rate
  • Quality: MRE, confidence scores
  • Robustness: Outlier handling, error recovery

Notes

  • All test specs follow standard format (Integration vs Acceptance)
  • GPS-analyzed datasets based on actual test data coordinates
  • Specifications ready for QA team implementation
  • No code included per requirement
  • Tests cover all components and all acceptance criteria