Files
Oleksandr Bezdieniezhnykh 8e2ecf50fd Initial commit
Made-with: Cursor
2026-03-26 00:20:30 +02:00

12 KiB

Source Registry

Source #1

  • Title: Ultralytics YOLO26 Documentation
  • Link: https://docs.ultralytics.com/models/yolo26/
  • Tier: L1
  • Publication Date: 2026-01-14
  • Timeliness Status: Currently valid
  • Version Info: YOLO26, Ultralytics 8.4.x
  • Summary: Official YOLO26 docs — NMS-free, edge-first, MuSGD optimizer, improved small object detection, instance segmentation with semantic loss.

Source #2

  • Title: YOLOE: Real-Time Seeing Anything — Ultralytics Docs
  • Link: https://docs.ultralytics.com/models/yoloe/
  • Tier: L1
  • Publication Date: 2025-2026
  • Timeliness Status: Currently valid
  • Version Info: YOLOE, YOLOE-26 (yoloe-26n-seg.pt through yoloe-26x-seg.pt)
  • Summary: Official YOLOE docs — open-vocabulary detection/segmentation, text/visual/prompt-free modes, RepRTA, SAVPE, LRPC, zero inference overhead when re-parameterized.

Source #3

  • Title: YOLOE-26 Paper
  • Link: https://arxiv.org/abs/2602.00168
  • Tier: L1
  • Publication Date: 2026-02
  • Timeliness Status: Currently valid
  • Summary: Integration of YOLO26 with YOLOE for real-time open-vocabulary instance segmentation. NMS-free, end-to-end.

Source #4

  • Title: Ultralytics YOLO26 Jetson Benchmarks
  • Link: https://docs.ultralytics.com/guides/nvidia-jetson
  • Tier: L1
  • Publication Date: 2026
  • Timeliness Status: Currently valid
  • Version Info: YOLO11 benchmarks on Jetson Orin Nano Super, TensorRT FP16
  • Summary: YOLO11n TensorRT FP16 on Jetson Orin Nano Super: 6.93ms at 640px. YOLO11s: 13.50ms. YOLO11m: 17.48ms.

Source #5

Source #6

  • Title: UAV-VL-R1 Paper
  • Link: https://arxiv.org/pdf/2508.11196
  • Tier: L1
  • Publication Date: 2025
  • Timeliness Status: Currently valid
  • Summary: Lightweight VLM for aerial reasoning. 48% better zero-shot than Qwen2-VL-2B. 2.5GB INT8, 3.9GB FP16. Open source.

Source #7

  • Title: SmolVLM 256M & 500M Blog
  • Link: https://huggingface.co/blog/smolervlm
  • Tier: L1
  • Publication Date: 2025-01
  • Timeliness Status: Currently valid
  • Summary: SmolVLM-500M: 1.8GB GPU RAM, ONNX/WebGPU support, 93M SigLIP vision encoder.

Source #8

Source #9

Source #10

Source #11

  • Title: UAV-YOLO12 Road Segmentation
  • Link: https://www.mdpi.com/2072-4292/17/9/1539
  • Tier: L1
  • Publication Date: 2025
  • Summary: F1=0.825 for paths from UAV imagery. 11.1ms inference. SKNet + PConv modules.

Source #12

  • Title: FootpathSeg GitHub
  • Link: https://github.com/WennyXY/FootpathSeg
  • Tier: L3
  • Publication Date: 2025
  • Summary: DINO-MC pre-training + UNet fine-tuning for footpath segmentation. GIS layer generation.

Source #13

  • Title: Herbivore Trail Segmentation (UNet+MambaOut)
  • Link: https://arxiv.org/pdf/2504.12121
  • Tier: L1
  • Publication Date: 2025-04
  • Summary: UNet+MambaOut achieves best accuracy for trail detection from aerial photographs.

Source #14

  • Title: Open-Vocabulary Camouflaged Object Segmentation
  • Link: https://arxiv.org/html/2506.19300v1
  • Tier: L1
  • Publication Date: 2025
  • Summary: VLM + SAM cascaded approach for camouflage detection. VLM-derived features as prompts to SAM.

Source #15

Source #16

Source #17

  • Title: servopilot Python Library
  • Link: https://pypi.org/project/servopilot/
  • Tier: L3
  • Publication Date: 2025
  • Summary: Anti-windup PID controller for gimbal control. Dual-axis support. Zero dependencies.

Source #18

Source #19

  • Title: TensorRT Multiple Engines on Single GPU
  • Link: https://github.com/NVIDIA/TensorRT/issues/4358
  • Tier: L2
  • Publication Date: 2025
  • Summary: NVIDIA recommends single engine with async CUDA streams over multiple separate engines. CUDA context push/pop needed for multiple engines.

Source #20

  • Title: TensorRT High Memory Usage on Jetson Orin Nano (Ultralytics)
  • Link: https://github.com/ultralytics/ultralytics/issues/21562
  • Tier: L2
  • Publication Date: 2025
  • Summary: YOLOv8-OBB TRT engine consumes ~2.6GB on Jetson Orin Nano. cuDNN/CUDA binary loading adds ~940MB-1.1GB overhead per engine.

Source #21

Source #22

Source #23

  • Title: YOLO26 INT8 TensorRT Export Fails on Jetson Orin (Ultralytics Issue #23841)
  • Link: https://github.com/ultralytics/ultralytics/issues/23841
  • Tier: L2
  • Publication Date: 2026
  • Summary: YOLO26n INT8 TRT export fails with checkLinks error during calibration on Jetson Orin with TensorRT 10.3.0 / JetPack 6.

Source #24

  • Title: PatchBlock: Lightweight Defense Against Adversarial Patches for Edge AI
  • Link: https://arxiv.org/abs/2601.00367
  • Tier: L1
  • Publication Date: 2026-01
  • Summary: CPU-based preprocessing module recovers up to 77% model accuracy under adversarial patch attacks. Minimal clean accuracy loss. Suitable for edge deployment.

Source #25

Source #26

Source #27

  • Title: Synthetic Data for Military Camouflaged Object Detection (IEEE)
  • Link: https://ieeexplore.ieee.org/document/10660900/
  • Tier: L1
  • Publication Date: 2024
  • Summary: Synthetic data generation approach for military camouflage detection training.

Source #28

  • Title: GenCAMO: Environment-Aware Camouflage Image Generation
  • Link: https://arxiv.org/abs/2601.01181
  • Tier: L1
  • Publication Date: 2026-01
  • Summary: Scene graph + generative models for synthetic camouflage data with multi-modal annotations. Improves complex scene detection.

Source #29

  • Title: Camouflage Anything (CVPR 2025)
  • Link: https://openaccess.thecvf.com/content/CVPR2025/html/Das_Camouflage_Anything_...
  • Tier: L1
  • Publication Date: 2025
  • Summary: Controlled out-painting for realistic camouflage dataset generation. CamOT metric. Improves detection baselines when used for fine-tuning.

Source #30

  • Title: YOLOE Visual+Text Multimodal Fusion PR (Ultralytics)
  • Link: https://github.com/ultralytics/ultralytics/pull/21966
  • Tier: L2
  • Publication Date: 2025
  • Summary: Multimodal fusion of text + visual prompts for YOLOE. Concat mode (zero overhead) and weighted-sum mode (fuse_alpha). Merged into Ultralytics.

Source #31

  • Title: Learnable Morphological Skeleton for Remote Sensing (IEEE TGRS 2025)
  • Link: https://ui.adsabs.harvard.edu/abs/2025ITGRS..63S1458X
  • Tier: L1
  • Publication Date: 2025
  • Summary: Learnable morphological skeleton priors integrated into SAM for slender object segmentation. Addresses downsampling information loss.

Source #32

  • Title: GraphMorph: Topologically Accurate Tubular Structure Extraction
  • Link: https://arxiv.org/pdf/2502.11731
  • Tier: L1
  • Publication Date: 2025
  • Summary: Branch-level graph decoder + SkeletonDijkstra for centerline extraction. Reduces false positives vs pixel-level segmentation.

Source #33

  • Title: UAV Gimbal PID Control for Camera Stabilization (IEEE 2024)
  • Link: https://ieeexplore.ieee.org/document/10569310/
  • Tier: L1
  • Publication Date: 2024
  • Summary: PID controllers applied in gimbal construction for stabilization and tracking.

Source #34

Source #35

Source #36

Source #37

Source #38