detections

mirror of https://github.com/azaion/detections.git synced 2026-06-21 04:21:08 +00:00

Author	SHA1	Message	Date
Roman Meshko	7d897df380	Fixed dynamic ONNX input Fix dynamic ONNX input Update docs with correct file name for tests	2026-04-19 20:55:51 +03:00
Oleksandr Bezdieniezhnykh	8116b55813	[AZ-180] Refactor inference and engine factory for improved model handling - Updated the autopilot state to reflect the current task status as in progress. - Refactored the inference module to streamline model downloading and conversion processes, replacing the download_model method with a more efficient load_source method. - Introduced asynchronous model building in the inference module to enhance performance during model conversion. - Enhanced the engine factory to include a new method for building and caching models, improving error handling and logging during the upload process. - Added calibration cache handling in the Jetson TensorRT engine for better resource management. Made-with: Cursor	2026-04-03 06:41:11 +03:00
Oleksandr Bezdieniezhnykh	834f846dc8	[AZ-180] Enhance setup and improve inference logging - Added a new Cython extension for the engine factory to the setup configuration. - Updated the inference module to include additional logging for video batch processing and annotation callbacks. - Refactored test cases to standardize the detection endpoint responses and include channel IDs in headers for better event handling.	2026-04-03 05:58:55 +03:00
Oleksandr Bezdieniezhnykh	8baa96978b	[AZ-180] Refactor detection event handling and improve SSE support - Updated the detection image endpoint to require a channel ID for event streaming. - Introduced a new endpoint for streaming detection events, allowing clients to receive real-time updates. - Enhanced the internal buffering mechanism for detection events to manage multiple channels. - Refactored the inference module to support the new event handling structure. Made-with: Cursor	2026-04-03 02:42:05 +03:00
Oleksandr Bezdieniezhnykh	3984507221	[AZ-180] Fix INT8 conversion: set FP16 flag alongside INT8 for TensorRT 10.x In TensorRT 10.x, INT8 conversion requires FP16 to be set as a fallback for network layers (e.g. normalization ops in detection models) that have no INT8 kernel implementation. Without FP16, build_serialized_network can return None on Jetson for YOLO-type models. INT8 flag is still the primary precision; FP16 is only the layer-level fallback within the same engine. Made-with: Cursor	2026-04-02 07:32:16 +03:00
Oleksandr Bezdieniezhnykh	2149cd6c08	[AZ-180] Add Jetson Orin Nano support with INT8 TensorRT engine - Dockerfile.jetson: JetPack 6.x L4T base image (aarch64), TensorRT and PyCUDA from apt - requirements-jetson.txt: derived from requirements.txt, no pip tensorrt/pycuda - docker-compose.jetson.yml: runtime: nvidia for NVIDIA Container Runtime - tensorrt_engine.pyx: convert_from_source accepts optional calib_cache_path; INT8 used when cache present, FP16 fallback; get_engine_filename encodes precision suffix to avoid engine cache confusion - inference.pyx: init_ai tries INT8 engine then FP16 on lookup; downloads calibration cache before conversion thread; passes cache path through to convert_from_source - constants_inf: add INT8_CALIB_CACHE_FILE constant - Unit tests for AC-3 (INT8 flag set when cache provided) and AC-4 (FP16 when no cache) Made-with: Cursor	2026-04-02 07:12:45 +03:00
Oleksandr Bezdieniezhnykh	097811a67b	[AZ-178] Fix Critical/High security findings: auth, CVEs, non-root containers, per-job SSE - Pin all deps; h11==0.16.0 (CVE-2025-43859), python-multipart>=1.3.1 (CVE-2026-28356), PyJWT==2.12.1 - Add HMAC JWT verification (require_auth FastAPI dependency, JWT_SECRET-gated) - Fix TokenManager._refresh() to use ADMIN_API_URL instead of ANNOTATIONS_URL - Rename POST /detect → POST /detect/image (image-only, rejects video files) - Replace global SSE stream with per-job SSE: GET /detect/{media_id} with event replay buffer - Apply require_auth to all 4 protected endpoints - Fix on_annotation/on_status closure to use mutable current_id for correct post-upload event routing - Add non-root appuser to Dockerfile and Dockerfile.gpu - Add JWT_SECRET to e2e/docker-compose.test.yml and run-tests.sh - Update all e2e tests and unit tests for new endpoints and HMAC token signing - 64/64 tests pass Made-with: Cursor	2026-04-02 06:32:12 +03:00
Oleksandr Bezdieniezhnykh	be4cab4fcb	[AZ-178] Implement streaming video detection endpoint - Added `/detect/video` endpoint for true streaming video detection, allowing inference to start as upload bytes arrive. - Introduced `run_detect_video_stream` method in the inference module to handle video processing from a file-like object. - Updated media hashing to include a new function for computing hashes directly from files with minimal I/O. - Enhanced documentation to reflect changes in video processing and API behavior. Made-with: Cursor	2026-04-01 03:11:43 +03:00
Oleksandr Bezdieniezhnykh	e65d8da6a3	[AZ-177] Remove redundant synchronous video pre-writes in /detect endpoint Made-with: Cursor	2026-04-01 01:12:05 +03:00
Oleksandr Bezdieniezhnykh	948b50ae3a	[AZ-175] Restore image validation for corrupt/oversized uploads in /detect endpoint Made-with: Cursor	2026-03-31 06:46:34 +03:00
Oleksandr Bezdieniezhnykh	9411103041	[AZ-176] Remove obsolete path-based detection code from inference pipeline Made-with: Cursor	2026-03-31 06:39:19 +03:00
Oleksandr Bezdieniezhnykh	40be55ac03	[AZ-175] Media table integration with XxHash64 content hashing and status lifecycle Made-with: Cursor	2026-03-31 06:36:56 +03:00
Oleksandr Bezdieniezhnykh	6c24d09eab	[AZ-173] [AZ-174] Stream-based detection API and DB-driven AI config Made-with: Cursor	2026-03-31 06:30:22 +03:00
Oleksandr Bezdieniezhnykh	8ce40a9385	Add AIAvailabilityStatus and AIRecognitionConfig classes for AI model management - Introduced `AIAvailabilityStatus` class to manage the availability status of AI models, including methods for setting status and logging messages. - Added `AIRecognitionConfig` class to encapsulate configuration parameters for AI recognition, with a static method for creating instances from dictionaries. - Implemented enums for AI availability states to enhance clarity and maintainability. - Updated related Cython files to support the new classes and ensure proper type handling. These changes aim to improve the structure and functionality of the AI model management system, facilitating better status tracking and configuration handling.	2026-03-31 05:49:51 +03:00

14 Commits