Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

An optimized parallel IDCT on graphics processing units

: Wang, B.; Alvarez-Mesa, M.; Chi, C.C.; Juurlink, B.


Caragiannis, I.:
Euro-Par 2012. Parallel processing workshops : BDMC, CGWS, HeteroPar, HiBB, OMHI, Paraphrase, PROPER, Resilience, UCHPC, VHPC, Rhodes Island, Greece, August 27 - 31, 2012; Revised selected papers
Berlin: Springer, 2013 (Lecture Notes in Computer Science 7640)
ISBN: 3-642-36948-0
ISBN: 978-3-642-36948-3 (Print)
ISBN: 978-3-642-36949-0 (Online)
International Conference "Euro-Par" <18, 2012, Rhodes>
International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar) <10, 2012, Rhodes>
Fraunhofer HHI ()

In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.