Wrap up cycle 5 verification + documentation: - Steps 10/11 wrap-up reports (implementation_completeness + implementation_report) for the AZ-503-foundation + AZ-504 batch. - Step 12 test-spec sync: AZ-503-foundation/AZ-504 ACs appended; AZ-505 deferred ACs recorded. - Step 13 update-docs: architecture, data-model, glossary, module- layout, uav-tile-upload contract (v1.1.0), DataAccess + Services + Tests module docs synced; new common_uuidv5.md module doc. - Step 14 security audit: PASS_WITH_WARNINGS; 0 new Critical/High; 2 new Low informational (F1 flightId provenance, F2 pgcrypto deploy gap). - Step 15 performance test: PASS_WITH_INFRA_WARNINGS; PT-08 passed twice (AZ-504 fix verified); PT-01/02 failed due to recurring local Docker/colima DNS cold-start (not an app regression). Cycle-3 perf-harness leftover stays OPEN with replay #5 documented. - Autodev state moved to Step 16 (Deploy). Co-authored-by: Cursor <cursoragent@cursor.com>
19 KiB
Satellite Provider — Architecture
Architecture Vision
Satellite Provider is a self-hosted .NET 10 backend service that pre-downloads, caches, and composites Google Maps satellite imagery for offline use. It runs as a single containerized monolith with PostgreSQL, processing requests asynchronously via in-process queues. The dominant pattern is a layered architecture (API → Services → DataAccess → PostgreSQL) with background hosted services for long-running work.
Components & responsibilities (each owns its own .csproj since AZ-309):
- Common (
SatelliteProvider.Common) — Shared contracts: DTOs, service interfaces, common exceptions, configuration models, geospatial math - DataAccess (
SatelliteProvider.DataAccess) — PostgreSQL persistence via Dapper + DbUp migrations - TileDownloader (
SatelliteProvider.Services.TileDownloader) — Provider-agnostic tile acquisition viaISatelliteDownloaderinterface (first implementation: Google Maps) with deduplication, concurrency control, and an in-memory tile-byte cache owned byTileService - RegionProcessing (
SatelliteProvider.Services.RegionProcessing) — Batch tile downloads for geographic areas, stitching, output generation - RouteManagement (
SatelliteProvider.Services.RouteManagement) — Route interpolation, geofenced region generation, consolidated map output
The three Layer-3 service components are compile-time siblings: each only references SatelliteProvider.Common and SatelliteProvider.DataAccess. Cross-component runtime calls flow exclusively through interfaces in SatelliteProvider.Common.Interfaces.
Major data flows:
- Tile acquisition: HTTP request → cache check → Google Maps download → disk + DB persistence
- Region processing: Request queued → background worker calculates tile grid → downloads all tiles → produces CSV/summary/stitched image
- Route expansion: Waypoints → interpolated points every ~200m → geofence filtering → region requests per point → optional ZIP archive
Architectural principles (inferred):
- Single-instance deployment, no horizontal scaling requirements (
inferred-from: Channel-based queue, no distributed state) - Append-by-source-and-flight tile storage (AZ-503; refines AZ-484) — multiple producers (Google Maps, UAV upload from N flights, future SatAR, …) can each persist a row per
(tile_zoom, tile_x, tile_y, tile_size_meters)cell. Reads return the most-recent row across sources, ordered bycaptured_at DESCwith deterministic(updated_at DESC, id DESC)tie-breaks. The single-row-per-cell-per-source-per-flight invariant is enforced byidx_tiles_unique_identity(6-column integer-only,COALESCE(flight_id, '00000000-...'::uuid)) introduced in migration 014; this supersedes the AZ-484 float-basedidx_tiles_unique_location_source. Identity is deterministic across re-ingests:tiles.id = Uuidv5(TileNamespace, "{z}/{x}/{y}/{source}/{flight_id or zero-uuid}")andtiles.location_hash = Uuidv5(TileNamespace, "{z}/{x}/{y}"). Thetiles.versioncolumn remains vestigial since AZ-357 dropped year-based cache invalidation in favour of cell-level overwrite. (inferred-from: tiles table + AZ-484/AZ-357/AZ-503 migrations + tile-storage contract v1.0.0) - Cross-repo deterministic tile identity (AZ-503) — the
TileNamespaceUUID5b8d0c2e-7f1a-4d3b-9c5e-1f3a8e7d2b6cand the canonical name format are shared with the sibling workspacegps-denied-onboard(components/c6_tile_cache/_uuid.py:TILE_NAMESPACE). Both sides MUST produce byte-identical UUIDv5 output so an onboard-cached tile and a server-cached tile for the same(z, x, y, source, flight_id)are recognized as the same artifact without a round-trip. Changing the namespace constant on either side is a coordinated cross-repo break. (inferred-from: Uuidv5.cs, AZ-503 task spec § Constraints) - Fire-and-forget async processing with status polling (
inferred-from: queue + background service + status endpoint) - JWT-validated callers only — every HTTP endpoint requires a valid HS256-signed Bearer token, validated locally against a shared
JWT_SECRETper the suite-level auth contract (suite/_docs/10_auth.md). Issuer/audience are intentionally not validated yet; signature + lifetime + ≥32-byte key are. Per-endpoint permission claims (e.g.permissions: ["GPS"]on the UAV upload) layer on top of this baseline.
Authentication & Authorization (AZ-487):
- Validation library:
Microsoft.AspNetCore.Authentication.JwtBearer10.0.7 (matchesMicrosoft.AspNetCore.OpenApi10.0.7; AZ-496 bumped both packages from 8.0.21 → 8.0.25 in cycle 3 to close the cycle-1 D1 + cycle-2 D3 supply-chain findings, then AZ-500 bumped both 8.0.25 → 10.0.7 in cycle 4 as part of the .NET 8 → .NET 10 migration). TheTokenValidationParametersshape is unchanged across the JwtBearer 8 → 10 jump — AZ-487/AZ-494 integration tests are the gate and all pass on .NET 10. - Signing key: read from the
JWT_SECRETenvironment variable (preferred) or theJwt:Secretconfiguration key. Startup fails fast if the resolved secret is unset, empty, or shorter than 32 bytes (HMAC-SHA256 minimum per RFC 2104 §3). - Token contract:
ValidateIssuerSigningKey = true,ValidateLifetime = true,RequireSignedTokens = true,RequireExpirationTime = true,ValidateIssuer = true+ValidIssuer = $JWT_ISSUER,ValidateAudience = true+ValidAudience = $JWT_AUDIENCE(AZ-494),ClockSkew = 30s. The 5-minute JwtBearer default is intentionally tightened. - Authorization model: every endpoint registered in
Program.csis decorated with.RequireAuthorization(). AZ-488 addspermissions-claim policies on top of this baseline (UAV upload requiresGPS). - Test infrastructure:
JwtTokenFactory(unit tests) andJwtTestHelpers(integration tests) mint deterministic tokens against the sameJWT_SECRET; the integration test runner attaches a default Bearer token to its sharedHttpClientso legacy non-auth tests continue to exercise the protected endpoints unchanged.
Planned features (confirmed by user, currently stubs):
- MGRS endpoint — tile access via Military Grid Reference System coordinates
Multi-source tile producers (live as of AZ-488; per-flight evidence isolation as of AZ-503):
- Google Maps —
TileService.DownloadAndStoreTilesAsync/DownloadAndStoreSingleTileAsyncstampsource='google_maps',flight_id=NULL, and a deterministic UUIDv5idon every persisted row; tile JPEGs live under{StorageConfig.TilesDirectory}/{zoom}/...per the legacy grandfathered layout.content_sha256is computed from the on-disk JPEG body. - UAV —
POST /api/satellite/upload(AZ-488; per-flight key extended by AZ-503) accepts a multipart batch of UAV-captured tiles, runs each item through a 5-rule quality gate (UavTileQualityGate), and persists accepted items viaITileRepository.InsertAsyncwithsource='uav',flight_id = metadata.flightId(or NULL for anonymous uploads), and a deterministic UUIDv5id. UAV JPEGs live under{StorageConfig.TilesDirectory}/uav/{flight_id or 'none'}/{zoom}/{x}/{y}.jpg, sorm -rf ./tiles/uav/{flight_id}/removes one flight's evidence without touching other flights at overlapping cells. Requires theGPSpermission claim on top of the JWT baseline.
The N-source storage contract is authoritative in _docs/02_document/contracts/data-access/tile-storage.md (v1.0.0; v2.0.0 bump tracking the AZ-503 identity columns is deferred to AZ-505 when the new identity surface freezes for external consumers). The UAV upload contract is authoritative in _docs/02_document/contracts/api/uav-tile-upload.md (v1.0.0; AZ-503 added an optional flightId field to per-item metadata — backward-compatible). Anything that reads or writes tiles MUST follow those contracts rather than re-deriving the rules from prose here.
Drift signals:
geofence_polygonsmentioned in AGENTS.md as a routes table column but does not exist in schema or entity — documentation drift
1. System Context
Problem being solved: A GPS-denied UAV navigation service requires satellite imagery for positioning and route planning without GPS. This service pre-downloads satellite tiles from one or more imagery sources (currently Google Maps; future sources including UAV nadir camera upload and additional providers such as SatAR) for specified regions and routes, stores them alongside each other under a per-source storage key, and serves the most-recent row across sources on access. Tiles are stitched into composite images and packaged for offline use.
System boundaries: The Satellite Provider is a self-contained backend service. It receives HTTP requests (region/route definitions), downloads tiles from Google Maps, stores them on disk and in PostgreSQL, and produces output files (images, CSVs, ZIPs).
External systems:
| System | Integration Type | Direction | Purpose |
|---|---|---|---|
| Satellite imagery provider (e.g., Google Maps) | HTTPS (tile download) | Outbound | First implementation of the multi-source tiles storage; provider-agnostic via ISatelliteDownloader. Stamps source='google_maps' on every persisted row. |
| GPS-Denied Service (UAV) | REST API (multipart) | Inbound | Producer of source='uav' rows via POST /api/satellite/upload (AZ-488). Authenticates with a JWT carrying the GPS permission claim; items pass through the 5-rule quality gate before persistence. |
| PostgreSQL | TCP (Npgsql) | Both | Tile metadata, region/route state |
| File System | Local disk | Both | Tile image storage, output artifacts |
| HTTP Clients | REST API | Inbound | Region/route requests, tile queries |
2. Technology Stack
| Layer | Technology | Version | Rationale |
|---|---|---|---|
| Language | C# | 14.0 (was 12.0 through cycle 3 — AZ-500) | .NET ecosystem, strong typing |
| Framework | ASP.NET Core (Minimal API) | 10.0 (was 8.0 through cycle 3 — AZ-500) | Lightweight HTTP hosting |
| Database | PostgreSQL | 15+ | Reliable RDBMS, spatial-friendly |
| ORM | Dapper | latest | Micro-ORM, raw SQL control |
| Migrations | DbUp | latest | Simple SQL-file-based schema migrations |
| Image Processing | SixLabors.ImageSharp | 3.1.11 | Cross-platform image manipulation |
| Logging | Serilog | 8.0.3 | Structured logging with file sinks |
| Hosting | Docker (docker-compose) | — | Containerized deployment |
| CI/CD | Woodpecker CI | — | Lightweight self-hosted CI |
3. Deployment Model
Environments: Development (docker-compose), Production (Docker)
Infrastructure:
- Docker-based containerized deployment
- PostgreSQL as a separate container
- Shared volumes for tile storage and output artifacts
- No cloud provider dependency (self-hosted capable)
Environment-specific configuration:
| Config | Development | Production |
|---|---|---|
| Database | localhost:5432 (Docker) | Container network db:5432 |
| Secrets | appsettings.Development.json | Environment variables |
| Logging | Console + File | File (./logs/) |
| API URL | http://localhost:5100 | http://0.0.0.0:5100 |
4. Data Model Overview
Core entities:
| Entity | Description | Owned By Component |
|---|---|---|
| Tile | A single satellite image tile with coordinates and zoom | TileDownloader |
| Region | A square area request with processing status | RegionProcessing |
| Route | A named path with geofence polygons | RouteManagement |
| RoutePoint | An individual point (original or interpolated) on a route | RouteManagement |
Key relationships:
- Route → RoutePoint: one-to-many (a route has many sequential points)
- Route → Region: many-to-many via
route_regions(each route point generates a region) - Region → Tile: implicit (a processed region references tiles by coordinate/zoom)
Data flow summary:
- Client → API → Queue → BackgroundService → GoogleMaps → FileSystem + DB: tile acquisition pipeline
- Client → API → RouteService → PointInterpolation → RegionCreation → Queue: route-to-region expansion
5. Integration Points
Internal Communication
| From | To | Protocol | Pattern | Notes |
|---|---|---|---|---|
| WebApi | RegionProcessing | In-process queue (Channel) | Fire-and-forget | Request queued, status polled. Uses IRegionService / IRegionRequestQueue from Common. |
| WebApi | TileDownloader | ITileService (Common interface) |
Request-Response | Single-tile reads (GetOrDownloadTileAsync) and writes (DownloadAndStoreSingleTileAsync) flow through ITileService since AZ-310 / AZ-311. No direct dependency on the concrete GoogleMapsDownloaderV2. |
| RegionProcessing | TileDownloader | ITileService (Common interface) |
Request-Response | Per-tile within region processing. Resolved through DI; no compile-time ProjectReference between RegionProcessing and TileDownloader csprojs. |
| RouteManagement | RegionProcessing | IRegionService / IRegionRequestQueue (Common interfaces) |
Fire-and-forget | Route regions submitted to queue. No compile-time ProjectReference between RouteManagement and RegionProcessing csprojs. |
| All Services | DataAccess | Direct method call (via repository interfaces) | Repository pattern | Dapper queries |
External Integrations
| External System | Protocol | Auth | Rate Limits | Failure Mode |
|---|---|---|---|---|
Satellite imagery provider (abstracted via ISatelliteDownloader; first implementation: Google Maps) |
HTTPS GET | Provider-specific (e.g., session token) | Configured concurrency (MaxConcurrentDownloads) | Retry with backoff, mark region failed |
6. Non-Functional Requirements
| Requirement | Target | Measurement | Priority |
|---|---|---|---|
| Concurrent Downloads | 4 (configurable) | SemaphoreSlim limit | High |
| Concurrent Regions | 20 (configurable) | Processing config | Medium |
| Queue Capacity | 1000 requests | Channel bounded capacity | Medium |
| Tile Deduplication | 100% (no re-download) | DB lookup before fetch | High |
| Max Zip Size | 50 MB | Route zip output | Medium |
7. Security Architecture
Authentication: HS256 JWT Bearer tokens (AZ-487 + AZ-494). Signing key from JWT_SECRET env var (≥ 32 bytes, validated at startup). Issuer and audience claims are validated against JWT_ISSUER / JWT_AUDIENCE env vars (AZ-494) — both required, fail-fast at startup if unset. Microsoft.AspNetCore.Authentication.JwtBearer validates signature, lifetime, signing key, issuer, and audience. ClockSkew tightened from JwtBearer default (5 min) to 30 s. Tokens are minted by the centralized Admin API per suite/_docs/10_auth.md; their iss and aud claims MUST match the satellite-provider configured values or validation rejects with 401.
Authorization: Every endpoint requires authentication via .RequireAuthorization(). Permission-claim enforcement is layered on top through the PermissionsRequirement authorization handler, which reads the permissions claim (accepting either repeated string claims OR a single JSON-array string). AZ-488 wires the RequiresGpsPermission policy on POST /api/satellite/upload — callers without GPS receive HTTP 403; other endpoints accept any authenticated principal.
Data protection:
- At rest: No encryption (tiles stored as plain JPEG files)
- In transit: HTTPS for Google Maps calls; API itself runs HTTP behind Kestrel (TLS termination is a deployment-layer concern)
- Secrets management:
JWT_SECRETandGOOGLE_MAPS_API_KEYfrom environment variables /.env(gitignored);.env.exampledocuments the required keys. Production deployments MUST supply both via the host environment, never via the appsettings files.
Audit logging: Serilog writes to file; logs exceptions and processing state transitions. 401/403 responses are emitted by the JwtBearer middleware via the WWW-Authenticate header; no body leakage of internal details.
8. Key Architectural Decisions
ADR-001: Minimal API over Controller-based
Context: Project needed a lightweight HTTP layer for a small set of endpoints.
Decision: Use ASP.NET Core Minimal APIs (no controllers, no MVC).
Consequences: Less ceremony, all routing in Program.cs, but less structure for future growth.
ADR-002: Dapper over Entity Framework
Context: Database access is straightforward CRUD with some spatial queries.
Decision: Use Dapper for raw SQL control and performance, paired with DbUp for schema migrations.
Consequences: Full SQL control, no ORM overhead; trade-off is manual mapping and no change tracking.
ADR-003: In-Process Queue over External Message Broker
Context: Region/route processing needs to be asynchronous but the system is a single service.
Decision: Use System.Threading.Channels as an in-process bounded queue.
Consequences: Simple, no external dependencies; but limited to single-instance deployment — no horizontal scaling of workers.
ADR-004: File-Based Tile Storage
Context: Tiles are immutable JPEG images that need fast random access.
Decision: Store tiles as files in a directory hierarchy with metadata in PostgreSQL. The layout is per-source (and per-flight for UAV since AZ-503) so the bytes for distinct producers / flights at the same cell remain individually addressable on disk:
- Google Maps (legacy, grandfathered):
{StorageConfig.TilesDirectory}/{zoom}/{x_bucket}/{y_bucket}/tile_{zoom}_{x}_{y}_{timestamp}.jpg - UAV anonymous (AZ-488 baseline):
{StorageConfig.TilesDirectory}/uav/none/{zoom}/{x}/{y}.jpg - UAV per-flight (AZ-503):
{StorageConfig.TilesDirectory}/uav/{flight_id}/{zoom}/{x}/{y}.jpg
The authoritative source/flight markers are the tiles.source and tiles.flight_id columns; the per-source / per-flight on-disk path matters only for write isolation and bulk-delete granularity.
Consequences: Fast reads, easy backup/migration, producers can run without colliding on bytes, and per-flight rm -rf becomes safe. Requires shared filesystem for multi-instance (not currently needed). No migration of pre-AZ-488 Google Maps files is shipped — the legacy layout stays intact. Pre-AZ-503 UAV files written by the AZ-488 baseline at ./tiles/uav/{z}/{x}/{y}.jpg (no flight segment) are not relocated by the migration; the post-AZ-503 code writes anonymous uploads to ./tiles/uav/none/{z}/{x}/{y}.jpg and the original AZ-488-era files stay where they were. This is acceptable because AZ-488 only landed in cycle 2 and the volume of pre-AZ-503 UAV bytes is small (no production UAV upload traffic yet).
ADR-005: Background Hosted Services for Processing
Context: Region and route processing is long-running and should not block HTTP requests.
Decision: Use IHostedService implementations that consume from the in-process queue.
Consequences: Clean separation of request handling and processing; lifecycle managed by the host.