• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight
 
  • Details
  • Full
Options
2025
Conference Paper
Title

High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight

Abstract
Semantic segmentation from RGB cameras is essential to the perception of autonomous flying vehicles. The stability of predictions through the captured videos is paramount to their reliability and, by extension, to the trustworthiness of the agents. In this paper, we propose a lightweight video semantic segmentation approach - suited to onboard real-time inference - achieving high temporal consistency on aerial data through Semantic Similarity Propagation across frames. SSP temporally propagates the predictions of an efficient image segmentation model with global registration alignment to compensate for camera movements. It combines the current estimation and the prior prediction with linear interpolation using weights computed from the features similarities of the two frames. Because data availability is a challenge in this domain, we propose a consistency-aware Knowledge Distillation training procedure for sparsely labeled datasets with few annotations. Using a large image segmentation model as a teacher to train the efficient SSP, we leverage the strong correlations between labeled and unlabeled frames in the same training videos to obtain high-quality supervision on all frames. KD-SSP obtains a significant temporal consistency increase over the base image segmentation model of 12.5% and 6.7% TC on UAVid and RuralScapes respectively, with higher accuracy and comparable inference speed. On these aerial datasets, KD-SSP provides a superior segmentation quality and inference speed trade-off than other video methods proposed for general applications and shows considerably higher consistency. Project page: https://github.com/FraunhoferIVI/SSP.
Author(s)
Vincent, Cédric
Fraunhofer-Institut für Verkehrs- und Infrastruktursysteme IVI  
Kim, Taehyoung
Fraunhofer-Institut für Verkehrs- und Infrastruktursysteme IVI  
Meeß, Henri
Fraunhofer-Institut für Verkehrs- und Infrastruktursysteme IVI  
Mainwork
IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025. Proceedings  
Conference
Conference on Computer Vision and Pattern Recognition 2025  
DOI
10.1109/CVPR52734.2025.00144
Language
English
Fraunhofer-Institut für Verkehrs- und Infrastruktursysteme IVI  
Keyword(s)
  • semantic segmentation

  • autonomous flying vehicles

  • UAV

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024