[AZ-180] Fix INT8 conversion: set FP16 flag alongside INT8 for TensorRT 10.x

In TensorRT 10.x, INT8 conversion requires FP16 to be set as a fallback for network layers (e.g. normalization ops in detection models) that have no INT8 kernel implementation. Without FP16, build_serialized_network can return None on Jetson for YOLO-type models. INT8 flag is still the primary precision; FP16 is only the layer-level fallback within the same engine. Made-with: Cursor
2026-06-21 05:51:08 +00:00 · 2026-04-02 07:32:16 +03:00
parent 7a7f2a4cdd
commit 3984507221
2 changed files with 3 additions and 0 deletions
@@ -73,6 +73,7 @@ def test_convert_from_source_uses_int8_when_cache_provided():

        # Assert
        mock_config.set_flag.assert_any_call("INT8")
+        mock_config.set_flag.assert_any_call("FP16")
        assert mock_config.int8_calibrator is not None
    finally:
        os.unlink(cache_path)