# ASTRAL-Next Test Specifications Summary ## Overview Comprehensive test specifications for the GPS-denied navigation system following the QA testing pyramid approach. **Total Test Specifications**: 49 ## Test Organization ### Integration Tests (01-16): Component Level Tests individual system components in isolation with their dependencies. **Vision Pipeline (01-04)**: - 01: Sequential Visual Odometry (F07 - SuperPoint + LightGlue) - 02: Global Place Recognition (F08 - AnyLoc/DINOv2) - 03: Metric Refinement (F09 - LiteSAM) - 04: Factor Graph Optimizer (F10 - GTSAM) **Data Management (05-08)**: - 05: Satellite Data Manager (F04) - 06: Coordinate Transformer (F13) - 07: Image Input Pipeline (F05) - 08: Image Rotation Manager (F06) **Service Infrastructure (09-12)**: - 09: REST API (F01 - FastAPI endpoints) - 10: SSE Event Streamer (F15 - real-time streaming) - 11a: Flight Lifecycle Manager (F02.1 - CRUD, initialization, API delegation) - 11b: Flight Processing Engine (F02.2 - processing loop, recovery coordination) - 12: Result Manager (F14) **Support Components (13-16)**: - 13: Model Manager (F16 - TensorRT) - 14: Failure Recovery Coordinator (F11) - 15: Configuration Manager (F17) - 16: Database Layer (F03) ### System Integration Tests (21-25): Multi-Component Flows Tests integration between multiple components. - 21: End-to-End Normal Flight - 22: Satellite to Vision Pipeline (F04 → F07/F08/F09) - 23: Vision to Optimization Pipeline (F07/F08/F09 → F10) - 24: Multi-Component Error Propagation - 25: Real-Time Streaming Pipeline (F02 → F14 → F15) ### Acceptance Tests (31-50): Requirements Validation Tests mapped to 10 acceptance criteria. **Accuracy (31-33)**: - 31: AC-1 - 80% < 50m error (baseline) - 32: AC-1 - 80% < 50m error (varied terrain) - 33: AC-2 - 60% < 20m error (high precision) **Robustness - Outliers (34-35)**: - 34: AC-3 - Single 350m outlier handling - 35: AC-3 - Multiple outliers handling **Robustness - Sharp Turns (36-38)**: - 36: AC-4 - Sharp turn zero overlap recovery - 37: AC-4 - Sharp turn minimal overlap (<5%) - 38: Outlier anchor detection **Multi-Fragment (39)**: - 39: AC-5 - Multi-fragment route connection (chunk architecture) **User Interaction (40)**: - 40: AC-6 - User input after 3 consecutive failures **Performance (41-44)**: - 41: AC-7 - <5s single image processing - 42: AC-7 - Sustained throughput performance - 43: AC-8 - Real-time streaming results - 44: AC-8 - Async refinement delivery **Quality Metrics (45-47)**: - 45: AC-9 - Registration rate >95% (baseline) - 46: AC-9 - Registration rate >95% (challenging conditions) - 47: AC-10 - Mean Reprojection Error <1.0 pixels **Cross-Cutting (48-50)**: - 48: Long flight (3000 images) - 49: Degraded satellite data - 50: Complete system acceptance validation **Chunk-Based Recovery (55-56)**: - 55: Chunk rotation recovery (rotation sweeps for chunks) - 56: Multi-chunk simultaneous processing (Atlas architecture) ### GPS-Analyzed Scenario Tests (51-54): Real Data Tests using GPS-analyzed test datasets. - 51: Test_Baseline (AD000001-030) - Standard flight - 52: Test_Outlier_350m (AD000045-050) - Outlier scenario - 53: Test_Sharp_Turn - Multiple sharp turn datasets - 54: Test_Long_Flight (AD000001-060) - Full dataset ## Test Data ### GPS Analysis Results - Mean distance: 120.8m - Min distance: 24.2m - Max distance: 268.6m **Identified Sharp Turns (>200m)**: - AD000003 → AD000004: 202.2m - AD000032 → AD000033: 220.6m - AD000042 → AD000043: 234.2m - AD000044 → AD000045: 230.2m - AD000047 → AD000048: 268.6m (largest outlier) ### Test Datasets **Test_Baseline**: AD000001-030 (30 images, normal spacing) **Test_Outlier_350m**: AD000045-050 (6 images, 268.6m outlier) **Test_Sharp_Turn_A**: AD000042, AD000044, AD000045, AD000046 (skip 043) **Test_Sharp_Turn_B**: AD000032-035 (220m jump) **Test_Sharp_Turn_C**: AD000003, AD000009 (5-frame gap) **Test_Long_Flight**: AD000001-060 (all 60 images, all variations) ## Acceptance Criteria Coverage | AC | Requirement | Test Specs | Status | |----|-------------|------------|--------| | AC-1 | 80% < 50m error | 31, 32, 50, 51, 54 | ✓ Covered | | AC-2 | 60% < 20m error | 33, 50, 51, 54 | ✓ Covered | | AC-3 | 350m outlier robust | 34, 35, 50, 52, 54 | ✓ Covered | | AC-4 | Sharp turn <5% overlap | 36, 37, 50, 53, 54, 55 | ✓ Covered | | AC-5 | Multi-fragment connection | 39, 50, 56 | ✓ Covered | | AC-6 | User input after 3 failures | 40, 50 | ✓ Covered | | AC-7 | <5s per image | 41, 42, 50, 51, 54 | ✓ Covered | | AC-8 | Real-time + refinement | 43, 44, 50 | ✓ Covered | | AC-9 | Registration >95% | 45, 46, 50, 51, 54 | ✓ Covered | | AC-10 | MRE <1.0px | 47, 50 | ✓ Covered | ## Component to Test Mapping | Component | ID | Integration Test | |-----------|-----|------------------| | Flight API | F01 | 09 | | Flight Lifecycle Manager | F02.1 | 11a | | Flight Processing Engine | F02.2 | 11b | | Flight Database | F03 | 16 | | Satellite Data Manager | F04 | 05 | | Image Input Pipeline | F05 | 07 | | Image Rotation Manager | F06 | 08 | | Sequential Visual Odometry | F07 | 01 | | Global Place Recognition | F08 | 02 | | Metric Refinement | F09 | 03 | | Factor Graph Optimizer | F10 | 04 | | Failure Recovery Coordinator | F11 | 14 | | Route Chunk Manager | F12 | 39, 55, 56 | | Coordinate Transformer | F13 | 06 | | Result Manager | F14 | 12 | | SSE Event Streamer | F15 | 10 | | Model Manager | F16 | 13 | | Configuration Manager | F17 | 15 | ## Test Execution Strategy ### Phase 1: Component Integration (01-16) - Validate each component individually - Verify interfaces and dependencies - Establish baseline performance metrics ### Phase 2: System Integration (21-25) - Test multi-component interactions - Validate end-to-end flows - Verify error handling across components ### Phase 3: Acceptance Testing (31-50) - Validate all acceptance criteria - Use GPS-analyzed real data - Measure against requirements ### Phase 4: Special Scenarios (51-56) - Test specific GPS-identified situations - Validate outliers and sharp turns - Chunk-based recovery scenarios - Full system validation ## Success Criteria Summary **Integration Tests**: All components pass individual tests, interfaces work correctly **System Tests**: Multi-component flows work, errors handled properly **Acceptance Tests**: All 10 ACs met with real data **Overall**: System meets all requirements, ready for deployment ## Test Metrics to Track - **Accuracy**: Mean error, RMSE, percentiles - **Performance**: Processing time per image, total time - **Reliability**: Registration rate, success rate - **Quality**: MRE, confidence scores - **Robustness**: Outlier handling, error recovery ## Notes - All test specs follow standard format (Integration vs Acceptance) - GPS-analyzed datasets based on actual test data coordinates - Specifications ready for QA team implementation - No code included per requirement - Tests cover all components and all acceptance criteria