• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Supporting radio archive workflows with vocabulary independent spoken keyword search
 
  • Details
  • Full
Options
2007
Conference Paper
Title

Supporting radio archive workflows with vocabulary independent spoken keyword search

Abstract
Archive departments of large radio broadcasters stand to benefit greatly from speech recognition technology and other audio processing techniques. In order to move towards a practical understanding of how these technologies can support archive staff, two large German radio broadcasters, Deutsche Welle and Westdeutscher Rundfunk, commissioned Fraunhofer IAIS to build a German-language radio archive prototype. This paper discusses the development and assessment of the spoken keyword search module of this prototype. The search module was designed and tested in a project group consisting of both multimedia researchers and archive professionals. As a result, the prototype is unique in that its design and evaluation are tuned explicitly to the requirements of archivists. The paper discusses the special needs of radio archive staff and how they were accommodated in the design of the keyword search functionality. In particular, the archive staff required a vocabulary-independent search facility capable of searching for keywords in an archive containing a high proportion of spontaneous speech. Keyword search is implemented using a fuzzy-matching algorithm, which performs a similarity search on syllable transcripts generated by the speech recognizer. An evaluation is carried out to assess whether or not the radio archive prototype fulfilled the needs of archivists.

; 

Archive departments of large radio broadcasters stand to benefit greatly from speech recognition technology and other audio processing techniques. In order to move towards a practical understanding of how these technologies can support archive staff, two large German radio broadcasters, Deutsche Welle and Westdeutscher Rundfunk, commissioned Fraunhofer IAIS to build a German-language radio archive prototype. This paper discusses the development and assessment of the spoken keyword search module of this prototype. The search module was designed and tested in a project group consisting of both multimedia researchers and archive professionals. As a result, the prototype is unique in that its design and evaluation are tuned explicitly to the requirements of archivists. The paper discusses the special needs of radio archive staff and how they were accommodated in the design of the keyword search functionality. In particular, the archive staff required a vocabulary-independent search facility capable of searching for keywords in an archive containing a high proportion of spontaneous speech. Keyword search is implemented using a fuzzy-matching algorithm, which performs a similarity search on syllable transcripts generated by the speech recognizer. An evaluation is carried out to assess whether or not the radio archive prototype fulfilled the needs of archivists.
Author(s)
Larson, M.
Eickeler, Stefan  
Köhler, Joachim  
Mainwork
Proceedings of the ACM SIGIR Workshop on Searching Spontaneous Conversational Speech  
Conference
Workshop "Searching Spontaneous Conversational Speech" 2007  
International Conference on Research and Development in Information Retrieval (SIGIR) 2007  
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • spoken document retrieval

  • broadcast news

  • Spracherkennung

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024