• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition
 
  • Details
  • Full
Options
2023
Journal Article
Title

Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition

Abstract
Cross-modal representation learning learns a shared embedding between two or more modalities to improve performance in a given task compared to using only one of the modalities. Cross-modal representation learning from different data types - such as images and time-series data (e.g., audio or text data) - requires a deep metric learning loss that minimizes the distance between the modality embeddings. In this paper, we propose to use the contrastive or triplet loss, which uses positive and negative identities to create sample pairs with different labels, for cross-modal representation learning between image and time-series modalities (CMR-IS). By adapting the triplet loss for cross-modal representation learning, higher accuracy in the main (time-series classification) task can be achieved by exploiting additional information of the auxiliary (image classification) task. We present a triplet loss with a dynamic margin for single label and sequence-to-sequence classification tasks. We perform extensive evaluations on synthetic image and time-series data, and on data for offline handwriting recognition (HWR) and on online HWR from sensor-enhanced pens for classifying written words. Our experiments show an improved classification accuracy, faster convergence, and better generalizability due to an improved cross-modal representation. Furthermore, the more suitable generalizability leads to a better adaptability between writers for online HWR.
Author(s)
Ott, Felix
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Rügamer, David
Heublein, Lucas
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Bischl, Bernd
Mutschler, Christopher  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Journal
IEEE access  
Open Access
DOI
10.1109/ACCESS.2023.3310819
Additional link
Full text
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Contrastive learning

  • cross-modal retrieval

  • online handwriting recognition

  • optical character recognition

  • representation learning

  • sensor-enhanced pen

  • sequence-based learning

  • triplet learning

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024