• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping
 
  • Details
  • Full
Options
May 2026
Conference Paper
Title

Selective Augmentation: Improving Universal Automatic Phonetic Transcription via G2P Bootstrapping

Abstract
In the field of universal automatic phonetic transcription (APT), clean and diverse training transcriptions are required. However, such high-quality data is limited. We propose the bootstrapping approach Selective Augmentation to improve the available training transcriptions by selectively transferring distinctions between languages. Based on the model MultIPA, we exemplarily show that we could increase the accuracy of an existing feature (plosive voicing) and add a new feature (plosive aspiration) by augmenting the existing training data using information from a separate helper language (Hindi). We describe intrinsic challenges of the evaluation and develop objective metrics to determine the success: Voicing accuracy was increased by 17.6% by reducing the number of false positives. Additionally, aspiration recognition was introduced: While the baseline transcribed 0% of German /p, t, k/ as aspirated, our approach transcribed them as aspirated in 61.2% of the cases. Introducing aspiration recognition to APT models allowed for the tenuis class to be successfully reduced by 32.2%, which also reduces the conflations between the test language’s plosives.
Author(s)
Bystrich, Tobias
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Pritzen, Julia  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Schmidt, Christoph Andreas  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Wich-Reif, Claudia
Universität Bonn  
Mainwork
Fifteenth Language Resources and Evaluation Conference, LREC 2026. Proceedings  
Conference
Language Resources and Evaluation Conference 2026  
Open Access
File(s)
Download (491.39 KB)
Rights
CC BY-NC 4.0: Creative Commons Attribution-NonCommercial
DOI
10.63317/53t62v2i3f8m
10.24406/publica-8666
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • automatic speech recognition

  • phonetics

  • phonetic transcription

  • bootstrapping

  • G2P

  • NLP

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024