• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol
 
  • Details
  • Full
Options
2024
Conference Paper
Title

Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol

Abstract
This paper presents a baseline approach and an experimental protocol for a specific content verification problem: detecting discrepancies between the audio and video modalities in multimedia content. We first design and optimize an audio-visual scene classifier, to compare with existing classification baselines that use both modalities. Then, by applying this classifier separately to the audio and the visual modality, we can detect scene-class inconsistencies between them. To facilitate further research and provide a common evaluation platform, we introduce an experimental protocol and a benchmark dataset simulating such inconsistencies. Our approach achieves state-of-the-art results in scene classification and promising outcomes in audio-visual discrepancies detection, highlighting its potential in content verification applications.
Author(s)
Apostolidis, Konstantinos
Abeßer, Jakob  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Cuccovillo, Luca  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Vasileios, Mezaris
Mainwork
Proceedings of the 3rd ACM International Workshop on Multimedia AI against Disinformation, MAD'24  
Conference
International Workshop on Multimedia AI against Disinformation 2024  
Open Access
DOI
10.1145/3643491.3660287
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Media Forensics

  • audio-visual forensics

  • audio-visual scene classification

  • content verification

  • self-attention

  • Environmental Sound Analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024