• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Cross-modal identification of audiovisual streams directly from the compressed domain
 
  • Details
  • Full
Options
2009
Conference Paper
Title

Cross-modal identification of audiovisual streams directly from the compressed domain

Abstract
During the last years, a number of search and retrieval methods for audio and visual content were described in literature. Also cross-modal approaches started to emerge recently. All search methods are based on audiovisual fingerprints, which are extracted from the audiovisual data prior to the actual search. Since the most data are available in the compressed domain, they must be decompressed prior to the feature extraction. This paper describes an audiovisual search engine, which extracts their features directly from the compressed domain, without performing a decoding algorithm. The direct video feature extraction is based on motion vectors and the direct audio feature extraction is based on a polyphase matrix description. The paper depicts an overview of the search engine, which operates cross-modal on audio, and visual features and describes the evaluation of the direct feature extraction methods in detail.
Author(s)
Gruhne, M.
Dunker, P.
Mikhalev, A.
Fedotov, I.
Andritsopoulos, F.
Mainwork
IEEE 13th International Symposium on Consumer Electronics, ISCE 2009. Proceedings. Vol.2  
Conference
International Symposium on Consumer Electronics (ISCE) 2009  
DOI
10.1109/ISCE.2009.5157028
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • audio visual system

  • data compression

  • decoding

  • feature extraction

  • information retrieval

  • media streaming

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024