Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Audio clips content comparison using latent semantic indexing

: Biatov, K.; Köhler, J.; Schneider, D.


Institute of Electrical and Electronics Engineers -IEEE-:
ICSC 2009, Third IEEE International Conference on Semantic Computing : Berkeley, CA, USA - September 14-16, 2009
Piscataway/NJ: IEEE, 2009
ISBN: 978-1-4244-4962-0
International Conference on Semantic Computing (ICSC) <3, 2009, Berkeley/Calif.>
Conference Paper
Fraunhofer IAIS ()
semantic analysis; Latent Semantic Indexing; social tags; Singular Value Decomposition; large vocabulary continuous speech recognition

This paper describes experiments for audio clips comparison based on spoken context. The spoken content is obtained using automatic speech recognition. The social tags that are available for most of the audio clips are used as keywords. These keywords are mapped to the spoken transcription representing the audio clips on the base of the social tags-keywords. The clips are described using the term frequency-inverse document frequency weighting. This description statistically evaluates how important are the keywords for the documents. The Latent Semantic Indexing (LSI) is applied on audio clips-feature vectors matrix mapping the clips content into low dimensional latent semantic space. The clips are compared using document-document comparison measure based in LSI. The similarity based on LSI is compared with the results obtained by using the standard vector space model.