• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Catnip for MedCAT: Optimizing the Input for Automated SNOMED CT Mapping of Clinical Variables
 
  • Details
  • Full
Options
2025
Journal Article
Title

Catnip for MedCAT: Optimizing the Input for Automated SNOMED CT Mapping of Clinical Variables

Abstract
Introduction: Mapping local medical data assets to international data standards such as medical ontology SNOMED CT fosters data harmonization and, thereby, global progress in medical research. Since its intense resource requirements often hinder manual SNOMED CT mapping, automated mapping tools such as MedCAT have been developed. We investigated how the formulation of study variable names (VNs) influences the efficacy and accuracy of the SNOMED CT concepts identified by MedCAT.
Methods: We extracted 763 VNs from the GEPESTIM database hosted locally in REDCap and created three VNs using different REDCap metadata items for MedCAT-based SNOMED CT mapping. A fourth VN version was created manually. The mapping was evaluated based on the number and quality of identified SNOMED CT concepts, using manual scoring to assess concept accuracy while ensuring a blind evaluation process.
Results: Increasing the expressiveness of VNs by adding more metadata items led to more SNOMED CT concepts being mapped, but also introduced mismatches, particularly when additionally included metadata contained misleading terms. The best overall mapping performance was achieved on the manually specified VNs while a basic VN version with minimal extra information from the metadata resulted in similarly good results.
Conclusion: Our study identified key challenges in using MedCAT for automatically mapping study variables to SNOMED CT concepts. To improve accuracy, we recommend refining VNs reducing misleading terms and iteratively improving VN phrasing for optimal mapping outcome. Furthermore, it appears reasonable to always conduct a final manual review of the mapping outcome especially for critical variables and for those VNs containing negations or abbreviations.
Author(s)
Gehrmann, Julia
Medizinische Fakultät
Dogan, Asme
Medizinische Fakultät
Hagelschuer, Lea
Medizinische Fakultät
Quakulinski, Lars
Medizinische Fakultät
Koy, Anne
Medizinische Fakultät
Beyan, Oya Deniz
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Journal
Studies in health technology and informatics  
Open Access
File(s)
Download (436.15 KB)
Rights
CC BY-NC 4.0: Creative Commons Attribution-NonCommercial
DOI
10.3233/SHTI251390
10.24406/publica-5653
Additional link
Full text
Language
English
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Keyword(s)
  • Automation

  • Computerized Medical Record System

  • Controlled Vocabulary

  • Natural Language Processing

  • SNOMED CT

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024