• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Spatio-Temporal Convolutional Neural Network for Enhanced Inter Prediction in Video Coding
 
  • Details
  • Full
Options
2024
Journal Article
Title

Spatio-Temporal Convolutional Neural Network for Enhanced Inter Prediction in Video Coding

Abstract
This paper presents a convolutional neural network (CNN)-based enhancement to inter prediction in Versatile Video Coding (VVC). Our approach aims at improving the prediction signal of inter blocks with a residual CNN that incorporates spatial and temporal reference samples. It is motivated by the theoretical consideration that neural network-based methods have a higher degree of signal adaptivity than conventional signal processing methods and that spatially neighboring reference samples have the potential to improve the prediction signal by adapting it to the reconstructed signal in its immediate vicinity. We show that adding a polyphase decomposition stage to the CNN results in a significantly better trade-off between computational complexity and coding performance. Incorporating spatial reference samples in the inter prediction process is challenging: The fact that the input of the CNN for one block may depend on the output of the CNN for preceding blocks prohibits parallel processing. We solve this by introducing a novel signal plane that contains specifically constrained reference samples, enabling parallel decoding while maintaining a high compression efficiency. Overall, experimental results show average bit rate savings of 4.07% and 3.47% for the random access (RA) and low-delay B (LB) configurations of the JVET common test conditions, respectively.
Author(s)
Merkle, Philipp  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Winken, Martin  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Pfaff, Jonathan
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Schwarz, Heiko  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Marpe, Detlev  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Wiegand, Thomas  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Journal
IEEE transactions on image processing  
Open Access
File(s)
Download (5.14 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.1109/TIP.2024.3446228
10.24406/publica-6288
Additional link
Full text
Language
English
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Keyword(s)
  • convolutional neural network

  • Inter prediction

  • intra reference samples

  • versatile video coding standard

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024