Update configuration and test structure for improved clarity and functionality

- Modified `.gitignore` to include test fixture data while excluding test results. - Updated `config.yaml` to change the model from 'yolo11m.yaml' to 'yolo26m.pt'. - Enhanced `.cursor/rules/coderule.mdc` with additional guidelines for test environment consistency and infrastructure handling. - Revised autopilot state management in `_docs/_autopilot_state.md` to reflect current progress and tasks. - Removed outdated augmentation tests and adjusted dataset formation tests to align with the new structure. These changes streamline the configuration and testing processes, ensuring better organization and clarity in the project.
2026-06-21 05:51:12 +00:00 · 2026-03-28 06:11:55 +02:00
parent cdcd1f6ea7
commit a47fa135de
119 changed files with 824 additions and 774 deletions
@@ -1,61 +1,9 @@
 # Blackbox Test Scenarios

-## BT-AUG: Augmentation Pipeline
-
-### BT-AUG-01: Single image produces 8 outputs
- **Input**: 1 image + 1 valid label from fixture dataset
- **Action**: Run `Augmentator.augment_inner()` on the image
- **Expected**: Returns list of exactly 8 ImageLabel objects
- **Traces**: AC: 8× augmentation ratio
-
-### BT-AUG-02: Augmented filenames follow naming convention
- **Input**: Image with stem "test_image"
- **Action**: Run `augment_inner()`
- **Expected**: Output filenames: `test_image.jpg`, `test_image_1.jpg` through `test_image_7.jpg`; matching `.txt` labels
- **Traces**: AC: Augmentation output format
-
-### BT-AUG-03: All output bounding boxes in valid range
- **Input**: 1 image + label with multiple bboxes
- **Action**: Run `augment_inner()`
- **Expected**: Every bbox coordinate in every output label is in [0.0, 1.0]
- **Traces**: AC: Bounding boxes clipped to [0, 1]
-
-### BT-AUG-04: Bounding box correction clips edge bboxes
- **Input**: Label with bbox near edge: `0 0.99 0.5 0.2 0.1`
- **Action**: Run `correct_bboxes()`
- **Expected**: Width reduced so bbox fits within [margin, 1-margin]; no coordinate exceeds bounds
- **Traces**: AC: Bounding boxes clipped to [0, 1]
-
-### BT-AUG-05: Tiny bounding boxes removed after correction
- **Input**: Label with tiny bbox that becomes < 0.01 after clipping
- **Action**: Run `correct_bboxes()`
- **Expected**: Bbox removed from output (area < correct_min_bbox_size)
- **Traces**: AC: Bounding boxes with area < 0.01% discarded
-
-### BT-AUG-06: Empty label produces 8 outputs with empty labels
- **Input**: 1 image + empty label file
- **Action**: Run `augment_inner()`
- **Expected**: 8 ImageLabel objects returned; all have empty labels lists
- **Traces**: AC: Augmentation handles empty annotations
-
-### BT-AUG-07: Full augmentation pipeline (filesystem integration)
- **Input**: 5 images + labels copied to data/ directory in tmp_path
- **Action**: Run `augment_annotations()` with patched paths
- **Expected**: 40 images (5 × 8) in processed images dir; 40 matching labels in processed labels dir
- **Traces**: AC: 8× augmentation, filesystem output
-
-### BT-AUG-08: Augmentation skips already-processed images
- **Input**: 5 images in data/; 3 already present in processed/ dir
- **Action**: Run `augment_annotations()`
- **Expected**: Only 2 new images processed (16 new outputs); existing 3 untouched
- **Traces**: AC: Augmentation processes only unprocessed images
-
---
-
 ## BT-DSF: Dataset Formation

 ### BT-DSF-01: 70/20/10 split ratio
- **Input**: 100 images + labels in processed/ dir
+- **Input**: 100 images + labels in data/ dir
 - **Action**: Run `form_dataset()` with patched paths
 - **Expected**: train: 70 images+labels, valid: 20, test: 10
 - **Traces**: AC: Dataset split 70/20/10
@@ -1,18 +1,5 @@
 # Performance Test Scenarios

-## PT-AUG-01: Augmentation throughput
- **Input**: 10 images from fixture dataset
- **Action**: Run `augment_annotations()`, measure wall time
- **Expected**: Completes within 60 seconds (10 images × 8 outputs = 80 files)
- **Traces**: Restriction: Augmentation runs continuously
- **Note**: Threshold is generous; actual performance depends on CPU
-
-## PT-AUG-02: Parallel augmentation speedup
- **Input**: 10 images from fixture dataset
- **Action**: Run with ThreadPoolExecutor vs sequential, compare times
- **Expected**: Parallel is ≥ 1.5× faster than sequential
- **Traces**: AC: Parallelized per-image processing
-
 ## PT-DSF-01: Dataset formation throughput
 - **Input**: 100 images + labels
 - **Action**: Run `form_dataset()`, measure wall time
@@ -1,25 +1,7 @@
 # Resilience Test Scenarios

-## RT-AUG-01: Augmentation handles corrupted image gracefully
- **Input**: 1 valid image + 1 corrupted image file (truncated JPEG) in data/ dir
- **Action**: Run `augment_annotations()`
- **Expected**: Valid image produces 8 outputs; corrupted image skipped without crashing pipeline; total output: 8 files
- **Traces**: Restriction: Augmentation exception handling per-image
-
-## RT-AUG-02: Augmentation handles missing label file
- **Input**: 1 image with no matching label file
- **Action**: Run `augment_annotation()` on the image
- **Expected**: Exception caught per-thread; does not crash pipeline
- **Traces**: Restriction: Augmentation exception handling
-
-## RT-AUG-03: Augmentation transform failure produces fewer variants
- **Input**: 1 image + label that causes some transforms to fail (extremely narrow bbox)
- **Action**: Run `augment_inner()`
- **Expected**: Returns 1-8 ImageLabel objects (original always present; failed variants skipped); no crash
- **Traces**: Restriction: Transform failure handling
-
-## RT-DSF-01: Dataset formation with empty processed directory
- **Input**: Empty processed images dir
+## RT-DSF-01: Dataset formation with empty data directory
+- **Input**: Empty data images dir
 - **Action**: Run `form_dataset()`
 - **Expected**: Creates empty train/valid/test directories; no crash
 - **Traces**: Restriction: Edge case handling
@@ -1,11 +1,5 @@
 # Resource Limit Test Scenarios

-## RL-AUG-01: Augmentation output count bounded
- **Input**: 1 image
- **Action**: Run `augment_inner()`
- **Expected**: Returns exactly 8 outputs (never more, even with retries)
- **Traces**: AC: 8× augmentation ratio (1 original + 7 augmented)
-
 ## RL-DSF-01: Dataset split ratios sum to 100%
 - **Input**: Any number of images
 - **Action**: Check `train_set + valid_set + test_set`
@@ -4,8 +4,8 @@

 | ID | Data Item | Source | Format | Preparation |
 |----|-----------|--------|--------|-------------|
-| FD-01 | Annotated images (100) | `_docs/00_problem/input_data/dataset/images/` | JPEG | Copy subset to tmp_path at test start |
-| FD-02 | YOLO labels (100) | `_docs/00_problem/input_data/dataset/labels/` | TXT | Copy subset to tmp_path at test start |
+| FD-01 | Annotated images (20) | `tests/data/images/` | JPEG | Copy subset to tmp_path at test start |
+| FD-02 | YOLO labels (20) | `tests/data/labels/` | TXT | Copy subset to tmp_path at test start |
 | FD-03 | ONNX model | `_docs/00_problem/input_data/azaion.onnx` | ONNX | Read bytes at test start |
 | FD-04 | Class definitions | `classes.json` (project root) | JSON | Copy to tmp_path at test start |
 | FD-05 | Corrupted labels | Generated at test time | TXT | Create labels with coords > 1.0 |
@@ -4,15 +4,6 @@

 | AC / Restriction | Test IDs | Coverage |
 |------------------|----------|----------|
-| 8× augmentation ratio | BT-AUG-01, BT-AUG-06, BT-AUG-07, RL-AUG-01 | Full |
-| Augmentation naming convention | BT-AUG-02 | Full |
-| Bounding boxes clipped to [0,1] | BT-AUG-03, BT-AUG-04 | Full |
-| Tiny bboxes (< 0.01) discarded | BT-AUG-05 | Full |
-| Augmentation skips already-processed | BT-AUG-08 | Full |
-| Augmentation parallelized | PT-AUG-02 | Full |
-| Augmentation handles corrupted images | RT-AUG-01 | Full |
-| Augmentation handles missing labels | RT-AUG-02 | Full |
-| Transform failure graceful | RT-AUG-03 | Full |
 | Dataset split 70/20/10 | BT-DSF-01, RL-DSF-01 | Full |
 | Dataset directory structure | BT-DSF-02 | Full |
 | Dataset integrity (no data loss) | BT-DSF-03, RL-DSF-02 | Full |
@@ -34,6 +25,17 @@
 | Static model encryption key | ST-ENC-03 | Full |
 | Random IV per encryption | ST-ENC-01 | Full |

+## Removed (augmentation now built into YOLO training)
+
+The following tests were removed because external augmentation (`augmentation.py`) is no longer part of the training pipeline. YOLO's built-in augmentation replaces it.
+
+| Removed Test IDs | Reason |
+|-------------------|--------|
+| BT-AUG-01 to BT-AUG-08 | External augmentation replaced by YOLO built-in |
+| PT-AUG-01, PT-AUG-02 | Augmentation performance no longer relevant |
+| RT-AUG-01 to RT-AUG-03 | Augmentation resilience no longer relevant |
+| RL-AUG-01 | Augmentation resource limits no longer relevant |
+
 ## Uncovered (Require External Services)

 | AC / Restriction | Reason |
@@ -50,18 +52,18 @@

 | Metric | Value |
 |--------|-------|
-| Total AC + Restrictions | 36 |
-| Covered by tests | 29 |
+| Total AC + Restrictions | 27 |
+| Covered by tests | 20 |
 | Uncovered (external deps) | 7 |
-| **Coverage** | **80.6%** |
+| **Coverage** | **74.1%** |

 ## Test Count Summary

 | Category | Count |
 |----------|-------|
-| Blackbox tests | 32 |
-| Performance tests | 5 |
-| Resilience tests | 6 |
+| Blackbox tests | 21 |
+| Performance tests | 3 |
+| Resilience tests | 3 |
 | Security tests | 7 |
-| Resource limit tests | 5 |
-| **Total** | **55** |
+| Resource limit tests | 4 |
+| **Total** | **38** |
@@ -38,8 +38,36 @@ AZ-151 (Epic: Blackbox Tests)
    └── AZ-163 test_annotation_queue
 ```

-## Implementation Strategy
+---

- **Batch 1**: AZ-152 (test infrastructure) — must be implemented first
- **Batch 2**: AZ-153 to AZ-163 (all test tasks) — can be implemented in parallel after infrastructure is ready
- **Estimated batches**: 2
+## Refactoring Tasks (Epic: AZ-164)
+
+**Date**: 2026-03-28
+**Total Tasks**: 5
+**Total Complexity Points**: 13
+
+| Task | Name | Complexity | Dependencies | Epic |
+|------|------|-----------|-------------|------|
+| AZ-165 | refactor_unify_config | 3 | None | AZ-164 |
+| AZ-166 | refactor_yolo_model | 2 | None | AZ-164 |
+| AZ-167 | refactor_builtin_augmentation | 3 | AZ-166 | AZ-164 |
+| AZ-168 | refactor_remove_processed_dir | 3 | AZ-167 | AZ-164 |
+| AZ-169 | refactor_hard_symlinks | 2 | AZ-168 | AZ-164 |
+
+### Dependency Graph
+
+```
+AZ-164 (Epic: Code Improvements Refactoring)
+├── AZ-165 refactor_unify_config (independent)
+└── AZ-166 refactor_yolo_model
+    └── AZ-167 refactor_builtin_augmentation
+        └── AZ-168 refactor_remove_processed_dir
+            └── AZ-169 refactor_hard_symlinks
+```
+
+### Implementation Strategy
+
+- **Batch 1**: AZ-165 (unify config) + AZ-166 (YOLO model) — independent, can be parallel
+- **Batch 2**: AZ-167 (built-in aug) + AZ-168 (remove processed dir) — sequential chain
+- **Batch 3**: AZ-169 (hard symlinks) — depends on batch 2
+- **Estimated batches**: 3
@@ -0,0 +1,54 @@
+# Unify Configuration
+
+**Task**: AZ-165_refactor_unify_config
+**Name**: Unify configuration — remove annotation-queue/config.yaml
+**Description**: Consolidate two config files into one shared Config model
+**Complexity**: 3 points
+**Dependencies**: None
+**Component**: Configuration
+**Tracker**: AZ-165
+**Epic**: AZ-164
+
+## Problem
+
+Two separate `config.yaml` files exist (root and `src/annotation-queue/`) with overlapping content but different `dirs` values. The annotation queue handler parses YAML manually instead of using the shared `Config` Pydantic model, creating drift risk.
+
+## Outcome
+
+- Single `Config` model in `constants.py` covers all configuration including queue settings
+- `annotation_queue_handler.py` uses the shared `Config` instead of parsing its own YAML
+- `src/annotation-queue/config.yaml` is deleted
+
+## Scope
+
+### Included
+- Add Pydantic models for `ApiConfig`, `QueueConfig`; extend `DirsConfig` with all directory fields (data, data_seed, data_processed, data_deleted, images, labels)
+- Add these to the `Config` Pydantic model in `constants.py`
+- Refactor `annotation_queue_handler.py` constructor to accept/import the shared Pydantic `Config`
+- Delete `src/annotation-queue/config.yaml`
+
+### Excluded
+- Changing queue connection logic or message handling
+- Modifying root `config.yaml` structure beyond adding queue section (it already has it)
+
+## Acceptance Criteria
+
+**AC-1: Single config source**
+Given the root `config.yaml` contains queue and dirs settings
+When `annotation_queue_handler.py` initializes
+Then it reads configuration from the shared `Config` model, not a local YAML file
+
+**AC-2: No duplicate config file**
+Given the refactoring is complete
+When listing `src/annotation-queue/`
+Then `config.yaml` does not exist
+
+**AC-3: Annotation queue behavior preserved**
+Given the unified configuration
+When the annotation queue handler processes messages
+Then it uses the correct directory paths from configuration
+
+## Constraints
+
+- Root `config.yaml` already has the `queue` section — reuse it
+- `annotation_queue_handler.py` runs as a separate process — config import path must work from its working directory
@@ -0,0 +1,56 @@
+# Update YOLO Model
+
+**Task**: AZ-166_refactor_yolo_model
+**Name**: Update YOLO model to 26m variant (supports both from-scratch and pretrained)
+**Description**: Update model references from YOLO11m to YOLO26m; support both training from scratch (`.yaml`) and from pretrained weights (`.pt`)
+**Complexity**: 2 points
+**Dependencies**: None
+**Component**: Training Pipeline
+**Tracker**: AZ-166
+**Epic**: AZ-164
+
+## Problem
+
+Current `TrainingConfig.model` is set to `yolo11m.yaml` which defines a YOLO11 architecture. YOLO26m is the latest model variant. The system should support both training modes:
+1. **From scratch** — using `yolo26m.yaml` (architecture definition, trains from random weights)
+2. **From pretrained** — using `yolo26m.pt` (pretrained weights, faster convergence)
+
+## Outcome
+
+- `TrainingConfig` default model updated to `yolo26m.pt` (pretrained, recommended default)
+- `config.yaml` updated to `yolo26m.pt`
+- Both `yolo26m.pt` and `yolo26m.yaml` work when set in `config.yaml`
+- `train_dataset()` and `resume_training()` work with either model reference
+
+## Scope
+
+### Included
+- Update `TrainingConfig.model` default from `yolo11m.yaml` to `yolo26m.pt`
+- Update `config.yaml` training.model from `yolo11m.yaml` to `yolo26m.pt`
+- Verify `train_dataset()` works with both `.pt` and `.yaml` model values
+
+### Excluded
+- Changing training hyperparameters (epochs, batch, imgsz)
+- Updating ultralytics library version
+
+## Acceptance Criteria
+
+**AC-1: Default model config updated**
+Given the training configuration
+When reading `TrainingConfig.model`
+Then the default value is `yolo26m.pt`
+
+**AC-2: config.yaml updated**
+Given the root `config.yaml`
+When reading `training.model`
+Then the value is `yolo26m.pt`
+
+**AC-3: From-scratch training supported**
+Given `config.yaml` sets `training.model: yolo26m.yaml`
+When `YOLO(constants.config.training.model)` is called
+Then a YOLO26m model is built from the architecture definition
+
+**AC-4: Pretrained training supported**
+Given `config.yaml` sets `training.model: yolo26m.pt`
+When `YOLO(constants.config.training.model)` is called
+Then a YOLO26m model is loaded from pretrained weights
@@ -0,0 +1,55 @@
+# Replace External Augmentation with YOLO Built-in
+
+**Task**: AZ-167_refactor_builtin_augmentation
+**Name**: Replace external augmentation with YOLO built-in
+**Description**: Remove albumentations pipeline and use YOLO model.train() built-in augmentation parameters
+**Complexity**: 3 points
+**Dependencies**: AZ-166_refactor_yolo_model
+**Component**: Training Pipeline
+**Tracker**: AZ-167
+**Epic**: AZ-164
+
+## Problem
+
+`augmentation.py` uses the `albumentations` library to augment images into a `processed_dir` before training. This creates a separate processing step, uses extra disk space (8x original), and adds complexity. YOLO's built-in augmentation applies on-the-fly during training.
+
+## Outcome
+
+- `train_dataset()` passes augmentation parameters directly to `model.train()`
+- Each augmentation parameter is on its own line with a descriptive comment
+- The external augmentation step is removed from the training pipeline
+- `augmentation.py` is no longer called during training
+
+## Scope
+
+### Included
+- Add YOLO built-in augmentation parameters to `model.train()` call in `train_dataset()`
+- Parameters to add: hsv_h, hsv_s, hsv_v, degrees, translate, scale, shear, fliplr, mosaic (each with comment)
+- Remove call to augmentation from training flow
+
+### Excluded
+- Deleting `augmentation.py` file (may still be useful standalone)
+- Changing training hyperparameters unrelated to augmentation
+
+## Acceptance Criteria
+
+**AC-1: Built-in augmentation parameters with comments**
+Given the `train_dataset()` function
+When `model.train()` is called
+Then every parameter (including augmentation: hsv_h, hsv_s, hsv_v, degrees, scale, shear, fliplr, mosaic, and training: data, epochs, batch, imgsz, etc.) is on its own line with an inline comment explaining what the parameter controls
+
+**AC-2: No external augmentation in training flow**
+Given the training pipeline
+When `train_dataset()` runs
+Then it does not call `augment_annotations()` or any albumentations-based augmentation
+
+## Constraints
+
+- Every parameter row in the `model.train()` call MUST have an inline comment describing what it does (e.g. `hsv_h=0.015,  # hue shift fraction of the color wheel`)
+- This applies to ALL parameters, not just augmentation — training params (data, epochs, batch, imgsz, save_period, workers) also need comments
+- Augmentation parameter values should approximate the current albumentations settings:
+  - fliplr=0.6 (was HorizontalFlip p=0.6)
+  - degrees=35.0 (was Affine rotate=(-35,35))
+  - shear=10.0 (was Affine shear=(-10,10))
+  - hsv_h=0.015, hsv_s=0.7, hsv_v=0.4 (approximate HSV shifts)
+  - mosaic=1.0 (new YOLO built-in, recommended default)
@@ -0,0 +1,60 @@
+# Remove Processed Directory
+
+**Task**: AZ-168_refactor_remove_processed_dir
+**Name**: Remove processed directory — use data dir directly
+**Description**: Eliminate processed_dir concept from Config and all consumers; read from data dir directly; update e2e test fixture
+**Complexity**: 3 points
+**Dependencies**: AZ-167_refactor_builtin_augmentation
+**Component**: Training Pipeline, Data Utilities
+**Tracker**: AZ-168
+**Epic**: AZ-164
+
+## Problem
+
+`Config` exposes `processed_dir`, `processed_images_dir`, `processed_labels_dir` properties. Multiple files read from the processed directory: `train.py::form_dataset()`, `exports.py::form_data_sample()`, `dataset-visualiser.py::visualise_processed_folder()`. With built-in augmentation, the processed directory is no longer populated.
+
+The e2e test fixture (`tests/test_training_e2e.py`) currently copies images to both `data_images_dir` and `processed_images_dir` as a workaround — this needs cleanup once `form_dataset()` reads from data dirs.
+
+## Outcome
+
+- `Config` no longer has `processed_dir`/`processed_images_dir`/`processed_labels_dir` properties
+- `form_dataset()` reads images/labels from `data_images_dir`/`data_labels_dir`
+- `form_data_sample()` reads from `data_images_dir`
+- `visualise_processed_folder()` reads from `data_images_dir`/`data_labels_dir`
+- E2e test fixture copies images only to `data_images_dir`/`data_labels_dir` (no more processed dir population)
+
+## Scope
+
+### Included
+- Remove `processed_dir`, `processed_images_dir`, `processed_labels_dir` from `Config`
+- Update `form_dataset()` in `train.py` to use `data_images_dir` and `data_labels_dir`
+- Update `copy_annotations()` in `train.py` to look up labels from `data_labels_dir` instead of `processed_labels_dir`
+- Update `form_data_sample()` in `exports.py` to use `data_images_dir`
+- Update `visualise_processed_folder()` in `dataset-visualiser.py`
+- Update `tests/test_training_e2e.py` e2e fixture: remove processed dir population (only copy to data dirs)
+
+### Excluded
+- Removing `augmentation.py` file
+- Changing `corrupted_dir` handling
+
+## Acceptance Criteria
+
+**AC-1: No processed dir in Config**
+Given the `Config` class
+When inspecting its properties
+Then `processed_dir`, `processed_images_dir`, `processed_labels_dir` do not exist
+
+**AC-2: Dataset formation reads data dir**
+Given images and labels in `data_images_dir` / `data_labels_dir`
+When `form_dataset()` runs
+Then it reads from `data_images_dir` and validates labels from `data_labels_dir`
+
+**AC-3: Data sample reads data dir**
+Given images in `data_images_dir`
+When `form_data_sample()` runs
+Then it reads from `data_images_dir`
+
+**AC-4: E2e test uses data dirs only**
+Given the e2e test fixture
+When setting up test data
+Then it copies images/labels only to `data_images_dir`/`data_labels_dir` (no processed dir)
@@ -0,0 +1,42 @@
+# Use Hard Symlinks for Dataset
+
+**Task**: AZ-169_refactor_hard_symlinks
+**Name**: Use hard symlinks instead of file copies for dataset formation
+**Description**: Replace shutil.copy() with os.link() in dataset split creation to save disk space
+**Complexity**: 2 points
+**Dependencies**: AZ-168_refactor_remove_processed_dir
+**Component**: Training Pipeline
+**Tracker**: AZ-169
+**Epic**: AZ-164
+
+## Problem
+
+`copy_annotations()` in `train.py` uses `shutil.copy()` to duplicate images and labels into train/valid/test splits. For large datasets this wastes significant disk space.
+
+## Outcome
+
+- Dataset formation uses `os.link()` (hard links) instead of `shutil.copy()`
+- Fallback to `shutil.copy()` when hard links fail (cross-filesystem)
+- No change in training behavior — YOLO reads hard-linked files identically
+
+## Scope
+
+### Included
+- Replace `shutil.copy()` with `os.link()` in `copy_annotations()` inner `copy_image()` function
+- Add try/except fallback to `shutil.copy()` for `OSError` (cross-filesystem)
+
+### Excluded
+- Changing `form_data_sample()` in exports.py (separate utility, lower priority)
+- Changing corrupted file handling
+
+## Acceptance Criteria
+
+**AC-1: Hard links used**
+Given images and labels in the data directory
+When `copy_annotations()` creates train/valid/test splits
+Then files are hard-linked via `os.link()`, not copied
+
+**AC-2: Fallback on failure**
+Given a cross-filesystem scenario where `os.link()` raises `OSError`
+When `copy_annotations()` encounters the error
+Then it falls back to `shutil.copy()` transparently
@@ -0,0 +1,33 @@
+# Refactoring Roadmap
+
+**Run**: 01-code-improvements
+**Date**: 2026-03-28
+
+## Execution Order
+
+All 5 changes are grouped into a single phase (straightforward, low-to-medium risk).
+
+| Priority | Change | Risk | Effort |
+|----------|--------|------|--------|
+| 1 | C05: Unify configuration | medium | 3 pts |
+| 2 | C01: Update YOLO model | medium | 2 pts |
+| 3 | C02: Replace external augmentation | medium | 3 pts |
+| 4 | C03: Remove processed directory | medium | 3 pts |
+| 5 | C04: Hard symlinks | low | 2 pts |
+
+**Total estimated effort**: 13 points across 5 tasks
+
+## Dependency Graph
+
+```
+C05 (config unification) ─── independent
+C01 (YOLO update) ← C02 (built-in aug) ← C03 (remove processed dir) ← C04 (hard symlinks)
+```
+
+C05 can be done in parallel with the C01→C04 chain.
+
+## Risk Mitigation
+
+- Existing test suite (83 tests) provides safety net
+- Each change committed separately for easy rollback
+- C02 is the highest-risk change (training pipeline behavior change) — validate by running a short training sanity check after implementation
@@ -0,0 +1,50 @@
+# Research Findings
+
+**Run**: 01-code-improvements
+**Date**: 2026-03-28
+
+## Current State Analysis
+
+### Training Pipeline
+- Uses `yolo11m.yaml` (architecture-only config, trains from scratch)
+- External augmentation via `albumentations` library in `src/augmentation.py`
+- Two-step process: augment → form dataset → train
+- Dataset formation copies files with `shutil.copy()`, duplicating ~8x storage
+
+### Configuration
+- Two config files: `config.yaml` (root) and `src/annotation-queue/config.yaml`
+- Annotation queue handler parses YAML manually instead of using shared `Config` model
+- Config drift risk between the two files
+
+## YOLO 26 Model Update
+
+Ultralytics YOLO26 is the latest model family. The medium variant `yolo26m.pt` replaces `yolo11m.yaml`:
+- Uses pretrained weights (`.pt`) rather than architecture-only (`.yaml`)
+- Faster convergence with transfer learning
+- Improved accuracy on detection benchmarks
+
+## Built-in Augmentation Parameters
+
+YOLO's `model.train()` supports the following augmentation parameters that replace the external `albumentations` pipeline:
+
+| Parameter | Default | Equivalent to current external aug |
+|-----------|---------|-----------------------------------|
+| `hsv_h` | 0.015 | HueSaturationValue(hue_shift_limit=10) |
+| `hsv_s` | 0.7 | HueSaturationValue(sat_shift_limit=10) |
+| `hsv_v` | 0.4 | RandomBrightnessContrast + HSV |
+| `degrees` | 0.0 | Affine(rotate=(-35,35)) → set to 35.0 |
+| `translate` | 0.1 | Default is sufficient |
+| `scale` | 0.5 | Affine(scale=(0.8,1.2)) → default covers this |
+| `shear` | 0.0 | Affine(shear=(-10,10)) → set to 10.0 |
+| `fliplr` | 0.5 | HorizontalFlip(p=0.6) → set to 0.6 |
+| `flipud` | 0.0 | Not used currently |
+| `mosaic` | 1.0 | New — YOLO built-in |
+| `mixup` | 0.0 | New — optional |
+
+## Hard Symlinks
+
+`os.link()` creates hard links sharing the same inode. Benefits:
+- Zero additional disk usage for dataset splits
+- Same read performance as regular files
+- Works on same filesystem (which is the case here — all under `/azaion/`)
+- Fallback to `shutil.copy()` for cross-filesystem edge cases
@@ -0,0 +1,66 @@
+# Baseline Metrics
+
+**Run**: 01-code-improvements
+**Date**: 2026-03-28
+**Mode**: Guided
+**Source**: `_docs/02_document/refactoring_notes.md`
+
+## Goals
+
+Apply 5 improvements identified during documentation:
+1. Update YOLO to v26m version
+2. Replace external augmentation with YOLO built-in augmentation
+3. Remove processed folder — use data dir directly
+4. Use hard symlinks instead of file copies for dataset formation
+5. Unify constants directories — remove `src/annotation-queue/config.yaml`
+
+## Code Metrics
+
+| Metric | Value |
+|--------|-------|
+| Source files (src/) | 24 Python files |
+| Source LOC | 2,945 |
+| Test files | 21 Python files |
+| Test LOC | 1,646 |
+| Total tests | 83 (77 blackbox/unit + 6 performance) |
+| Test execution time | ~130s (120s unit + 10s perf) |
+| Python version | 3.10.8 |
+| Ultralytics version | 8.4.30 |
+| Pip packages | ~76 |
+
+## Files Affected by Refactoring
+
+| File | LOC | Refactoring Items |
+|------|-----|-------------------|
+| `src/constants.py` | 118 | #3 (remove processed_dir), #5 (unify config) |
+| `src/train.py` | 178 | #1 (YOLO version), #2 (built-in aug), #3 (data dir), #4 (symlinks) |
+| `src/augmentation.py` | 152 | #2 (replace with YOLO built-in), #3 (processed dir) |
+| `src/exports.py` | 118 | #3 (processed dir references) |
+| `src/convert-annotations.py` | 119 | #3 (processed dir references) |
+| `src/dataset-visualiser.py` | 52 | #3 (processed dir references) |
+| `src/annotation-queue/annotation_queue_handler.py` | 173 | #5 (remove separate config.yaml) |
+| `src/annotation-queue/config.yaml` | 21 | #5 (delete — duplicated config) |
+| `config.yaml` | 30 | #5 (single source of truth) |
+
+## Test Suite Baseline
+
+```
+77 passed, 0 failed, 0 skipped (blackbox/unit)
+6 passed, 0 failed, 0 skipped (performance)
+Total: 83 passed in ~130s
+```
+
+## Functionality Inventory
+
+| Feature | Status | Affected by Refactoring |
+|---------|--------|------------------------|
+| Augmentation pipeline | Working | Yes (#2, #3) |
+| Dataset formation | Working | Yes (#3, #4) |
+| Training | Working | Yes (#1, #2) |
+| Model export (ONNX) | Working | No |
+| Inference (ONNX/TensorRT) | Working | No |
+| Annotation queue | Working | Yes (#5) |
+| API client | Working | No |
+| CDN manager | Working | No |
+| Security/encryption | Working | No |
+| Label validation | Working | No |
@@ -0,0 +1,26 @@
+# Training Pipeline
+
+## Files
+- `src/train.py` (178 LOC)
+- `src/augmentation.py` (152 LOC)
+- `src/constants.py` (118 LOC)
+
+## Current Flow
+
+```mermaid
+graph TD
+    A[augmentation.py] -->|reads from| B[data_dir]
+    A -->|writes to| C[processed_dir]
+    D[train.py::form_dataset] -->|reads from| C
+    D -->|shutil.copy to| E[datasets_dir/today/train,valid,test]
+    F[train.py::train_dataset] -->|YOLO.train| E
+```
+
+## Issues
+- External augmentation (albumentations) runs as separate step, writing to `processed_dir`
+- `form_dataset()` copies files from `processed_dir` to dataset splits using `shutil.copy`
+- YOLO has built-in augmentation that runs during training (mosaic, mixup, flips, etc.)
+- Using built-in aug eliminates need for `processed_dir` and the full `augmentation.py` pipeline
+- `copy_annotations()` uses `shutil.copy` — wasteful for large datasets
+- Global mutable `total_files_copied` variable in `copy_annotations`
+- Model config `yolo11m.yaml` trains from scratch; likely should use pretrained weights or updated variant
@@ -0,0 +1,18 @@
+# Configuration System
+
+## Files
+- `src/constants.py` (118 LOC)
+- `config.yaml` (root, 30 lines)
+- `src/annotation-queue/config.yaml` (21 lines)
+- `src/annotation-queue/annotation_queue_handler.py` (173 LOC)
+
+## Current State
+- `constants.py` defines `Config` (Pydantic model) loaded from root `config.yaml`
+- `annotation_queue_handler.py` reads its own `config.yaml` with raw `yaml.safe_load`
+- Both config files share `api`, `queue`, `dirs` sections but with different `dirs` values
+- Annotation queue config has `data: 'data-test'` vs root `data: 'data'`
+
+## Issues
+- Two config files with overlapping content — drift risk
+- `annotation_queue_handler.py` parses config manually instead of using `Config` model
+- `constants.py` still has `processed_dir` properties that become obsolete after removing external augmentation
@@ -0,0 +1,10 @@
+# Data Utilities
+
+## Files
+- `src/exports.py` — `form_data_sample()` reads from `processed_images_dir`
+- `src/dataset-visualiser.py` — `visualise_processed_folder()` reads from `processed_images_dir`/`processed_labels_dir`
+
+## Impact
+- Both files reference `processed_dir` via `constants.config`
+- After removing `processed_dir`, these must switch to `data_images_dir`/`data_labels_dir`
+- `form_data_sample()` also uses `shutil.copy` — candidate for hard links
@@ -0,0 +1,52 @@
+# List of Changes
+
+**Run**: 01-code-improvements
+**Mode**: guided
+**Source**: `_docs/02_document/refactoring_notes.md`
+**Date**: 2026-03-28
+
+## Summary
+
+Apply 5 improvements from documentation review: update YOLO model, switch to built-in augmentation, remove processed directory, use hard symlinks for dataset formation, and unify configuration files.
+
+## Changes
+
+### C01: Update YOLO model to 26m variant
+- **File(s)**: `src/constants.py`, `src/train.py`
+- **Problem**: Current model config uses `yolo11m.yaml` which trains from a YAML architecture definition
+- **Change**: Update `TrainingConfig.model` to the YOLO 26m variant; ensure `train_dataset()` uses the updated model reference
+- **Rationale**: Use updated model version as requested; pretrained weights improve convergence
+- **Risk**: medium
+- **Dependencies**: None
+
+### C02: Replace external augmentation with YOLO built-in
+- **File(s)**: `src/train.py`, `src/augmentation.py`
+- **Problem**: `augmentation.py` uses albumentations to augment images into a separate `processed_dir` before training — adds complexity, disk usage, and a separate processing step
+- **Change**: Remove the `augment_annotations()` call from the training pipeline; add YOLO built-in augmentation parameters (hsv_h, hsv_s, hsv_v, degrees, translate, scale, shear, flipud, fliplr, mosaic, mixup) to the `model.train()` call in `train_dataset()`, each on its own line with a descriptive comment; `augmentation.py` remains in codebase but is no longer called during training
+- **Rationale**: YOLO's built-in augmentation applies on-the-fly during training, eliminating the pre-processing step and processed directory
+- **Risk**: medium
+- **Dependencies**: C01
+
+### C03: Remove processed directory — use data dir directly
+- **File(s)**: `src/constants.py`, `src/train.py`, `src/exports.py`, `src/dataset-visualiser.py`
+- **Problem**: `processed_dir`, `processed_images_dir`, `processed_labels_dir` properties in `Config` are no longer needed when built-in augmentation is used; `form_dataset()` reads from processed dir; `form_data_sample()` reads from processed dir; `visualise_processed_folder()` reads from processed dir
+- **Change**: Remove `processed_dir`/`processed_images_dir`/`processed_labels_dir` properties from `Config`; update `form_dataset()` to read from `data_images_dir`/`data_labels_dir`; update `form_data_sample()` similarly; update `visualise_processed_folder()` similarly
+- **Rationale**: Processed directory is unnecessary without external augmentation step
+- **Risk**: medium
+- **Dependencies**: C02
+
+### C04: Use hard symlinks instead of file copies for dataset
+- **File(s)**: `src/train.py`
+- **Problem**: `copy_annotations()` uses `shutil.copy()` to duplicate images and labels into train/valid/test splits — wastes disk space on large datasets
+- **Change**: Replace `shutil.copy()` with `os.link()` to create hard links; add fallback to `shutil.copy()` for cross-filesystem scenarios
+- **Rationale**: Hard links share the same inode, saving disk space while maintaining independent directory entries
+- **Risk**: low
+- **Dependencies**: C03
+
+### C05: Unify configuration — remove annotation-queue/config.yaml
+- **File(s)**: `src/constants.py`, `src/annotation-queue/annotation_queue_handler.py`, `src/annotation-queue/config.yaml`
+- **Problem**: `src/annotation-queue/config.yaml` duplicates root `config.yaml` with different `dirs` values; `annotation_queue_handler.py` parses config manually via `yaml.safe_load` instead of using the shared `Config` model
+- **Change**: Extend `Config` in `constants.py` to include queue and annotation-queue directory settings; refactor `annotation_queue_handler.py` to accept a `Config` instance (or import from constants); delete `src/annotation-queue/config.yaml`
+- **Rationale**: Single source of truth for configuration eliminates drift risk and inconsistency
+- **Risk**: medium
+- **Dependencies**: None
@@ -2,8 +2,8 @@

 ## Current Step
 flow: existing-code
-step: 6
-name: Run Tests
+step: 7
+name: Refactor
 status: in_progress
-sub_step: 0
-retry_count: 1
+sub_step: 4 — Execution (Batch 1 done: AZ-165, AZ-166, AZ-167; next: Batch 2 AZ-168)
+retry_count: 0