# ASTRAL-Next Test Specifications Summary

## Overview
Comprehensive test specifications for the GPS-denied navigation system following the QA testing pyramid approach.

**Total Test Specifications**: 49

## Test Organization

### Integration Tests (01-16): Component Level
Tests individual system components in isolation with their dependencies.

**Vision Pipeline (01-04)**:
- 01: Sequential Visual Odometry (F07 - SuperPoint + LightGlue)
- 02: Global Place Recognition (F08 - AnyLoc/DINOv2)
- 03: Metric Refinement (F09 - LiteSAM)
- 04: Factor Graph Optimizer (F10 - GTSAM)

**Data Management (05-08)**:
- 05: Satellite Data Manager (F04)
- 06: Coordinate Transformer (F13)
- 07: Image Input Pipeline (F05)
- 08: Image Rotation Manager (F06)

**Service Infrastructure (09-12)**:
- 09: REST API (F01 - FastAPI endpoints)
- 10: SSE Event Streamer (F15 - real-time streaming)
- 11a: Flight Lifecycle Manager (F02.1 - CRUD, initialization, API delegation)
- 11b: Flight Processing Engine (F02.2 - processing loop, recovery coordination)
- 12: Result Manager (F14)

**Support Components (13-16)**:
- 13: Model Manager (F16 - TensorRT)
- 14: Failure Recovery Coordinator (F11)
- 15: Configuration Manager (F17)
- 16: Database Layer (F03)

### System Integration Tests (21-25): Multi-Component Flows
Tests integration between multiple components.

- 21: End-to-End Normal Flight
- 22: Satellite to Vision Pipeline (F04 → F07/F08/F09)
- 23: Vision to Optimization Pipeline (F07/F08/F09 → F10)
- 24: Multi-Component Error Propagation
- 25: Real-Time Streaming Pipeline (F02 → F14 → F15)

### Acceptance Tests (31-50): Requirements Validation
Tests mapped to 10 acceptance criteria.

**Accuracy (31-33)**:
- 31: AC-1 - 80% < 50m error (baseline)
- 32: AC-1 - 80% < 50m error (varied terrain)
- 33: AC-2 - 60% < 20m error (high precision)

**Robustness - Outliers (34-35)**:
- 34: AC-3 - Single 350m outlier handling
- 35: AC-3 - Multiple outliers handling

**Robustness - Sharp Turns (36-38)**:
- 36: AC-4 - Sharp turn zero overlap recovery
- 37: AC-4 - Sharp turn minimal overlap (<5%)
- 38: Outlier anchor detection

**Multi-Fragment (39)**:
- 39: AC-5 - Multi-fragment route connection (chunk architecture)

**User Interaction (40)**:
- 40: AC-6 - User input after 3 consecutive failures

**Performance (41-44)**:
- 41: AC-7 - <5s single image processing
- 42: AC-7 - Sustained throughput performance
- 43: AC-8 - Real-time streaming results
- 44: AC-8 - Async refinement delivery

**Quality Metrics (45-47)**:
- 45: AC-9 - Registration rate >95% (baseline)
- 46: AC-9 - Registration rate >95% (challenging conditions)
- 47: AC-10 - Mean Reprojection Error <1.0 pixels

**Cross-Cutting (48-50)**:
- 48: Long flight (3000 images)
- 49: Degraded satellite data
- 50: Complete system acceptance validation

**Chunk-Based Recovery (55-56)**:
- 55: Chunk rotation recovery (rotation sweeps for chunks)
- 56: Multi-chunk simultaneous processing (Atlas architecture)

### GPS-Analyzed Scenario Tests (51-54): Real Data
Tests using GPS-analyzed test datasets.

- 51: Test_Baseline (AD000001-030) - Standard flight
- 52: Test_Outlier_350m (AD000045-050) - Outlier scenario
- 53: Test_Sharp_Turn - Multiple sharp turn datasets
- 54: Test_Long_Flight (AD000001-060) - Full dataset

## Test Data

### GPS Analysis Results
- Mean distance: 120.8m
- Min distance: 24.2m
- Max distance: 268.6m

**Identified Sharp Turns (>200m)**:
- AD000003 → AD000004: 202.2m
- AD000032 → AD000033: 220.6m
- AD000042 → AD000043: 234.2m
- AD000044 → AD000045: 230.2m
- AD000047 → AD000048: 268.6m (largest outlier)

### Test Datasets
**Test_Baseline**: AD000001-030 (30 images, normal spacing)
**Test_Outlier_350m**: AD000045-050 (6 images, 268.6m outlier)
**Test_Sharp_Turn_A**: AD000042, AD000044, AD000045, AD000046 (skip 043)
**Test_Sharp_Turn_B**: AD000032-035 (220m jump)
**Test_Sharp_Turn_C**: AD000003, AD000009 (5-frame gap)
**Test_Long_Flight**: AD000001-060 (all 60 images, all variations)

## Acceptance Criteria Coverage

| AC | Requirement | Test Specs | Status |
|----|-------------|------------|--------|
| AC-1 | 80% < 50m error | 31, 32, 50, 51, 54 | ✓ Covered |
| AC-2 | 60% < 20m error | 33, 50, 51, 54 | ✓ Covered |
| AC-3 | 350m outlier robust | 34, 35, 50, 52, 54 | ✓ Covered |
| AC-4 | Sharp turn <5% overlap | 36, 37, 50, 53, 54, 55 | ✓ Covered |
| AC-5 | Multi-fragment connection | 39, 50, 56 | ✓ Covered |
| AC-6 | User input after 3 failures | 40, 50 | ✓ Covered |
| AC-7 | <5s per image | 41, 42, 50, 51, 54 | ✓ Covered |
| AC-8 | Real-time + refinement | 43, 44, 50 | ✓ Covered |
| AC-9 | Registration >95% | 45, 46, 50, 51, 54 | ✓ Covered |
| AC-10 | MRE <1.0px | 47, 50 | ✓ Covered |

## Component to Test Mapping

| Component | ID | Integration Test |
|-----------|-----|------------------|
| Flight API | F01 | 09 |
| Flight Lifecycle Manager | F02.1 | 11a |
| Flight Processing Engine | F02.2 | 11b |
| Flight Database | F03 | 16 |
| Satellite Data Manager | F04 | 05 |
| Image Input Pipeline | F05 | 07 |
| Image Rotation Manager | F06 | 08 |
| Sequential Visual Odometry | F07 | 01 |
| Global Place Recognition | F08 | 02 |
| Metric Refinement | F09 | 03 |
| Factor Graph Optimizer | F10 | 04 |
| Failure Recovery Coordinator | F11 | 14 |
| Route Chunk Manager | F12 | 39, 55, 56 |
| Coordinate Transformer | F13 | 06 |
| Result Manager | F14 | 12 |
| SSE Event Streamer | F15 | 10 |
| Model Manager | F16 | 13 |
| Configuration Manager | F17 | 15 |

## Test Execution Strategy

### Phase 1: Component Integration (01-16)
- Validate each component individually
- Verify interfaces and dependencies
- Establish baseline performance metrics

### Phase 2: System Integration (21-25)
- Test multi-component interactions
- Validate end-to-end flows
- Verify error handling across components

### Phase 3: Acceptance Testing (31-50)
- Validate all acceptance criteria
- Use GPS-analyzed real data
- Measure against requirements

### Phase 4: Special Scenarios (51-56)
- Test specific GPS-identified situations
- Validate outliers and sharp turns
- Chunk-based recovery scenarios
- Full system validation

## Success Criteria Summary

**Integration Tests**: All components pass individual tests, interfaces work correctly
**System Tests**: Multi-component flows work, errors handled properly
**Acceptance Tests**: All 10 ACs met with real data
**Overall**: System meets all requirements, ready for deployment

## Test Metrics to Track

- **Accuracy**: Mean error, RMSE, percentiles
- **Performance**: Processing time per image, total time
- **Reliability**: Registration rate, success rate
- **Quality**: MRE, confidence scores
- **Robustness**: Outlier handling, error recovery

## Notes

- All test specs follow standard format (Integration vs Acceptance)
- GPS-analyzed datasets based on actual test data coordinates
- Specifications ready for QA team implementation
- No code included per requirement
- Tests cover all components and all acceptance criteria