• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Evaluation of Tools and Frameworks for Machine Learning Model Serving
 
  • Details
  • Full
Options
August 20, 2025
Conference Paper
Title

Evaluation of Tools and Frameworks for Machine Learning Model Serving

Abstract
Machine learning (ML) models are ubiquitous, as ML expands into numerous application domains. Despite this growth, the software engineering of ML systems remains complex, particularly in production environments. Serving ML models for inference is a critical part, as it provides the interface between the model and the surrounding components. Today, a variety of open source tools and frameworks for model serving exists which promise ease of use and performance. However, they differ in terms of usability, flexibility, scalability, and their overall performance. In this work, we systematically evaluate several popular model serving tools and frameworks in the context of a natural language processing scenario. In detail, we analyze their features and capabilities, conduct runtime experiments, and report on our experiences from various real-world ML projects. Our evaluation results provide valuable insights and considerations for ML engineers and other practitioners seeking for effective serving environments that seamlessly integrate with the existing ML tech stack, simplifying and accelerating the process of serving ML models in production.
Author(s)
Beck, Niklas  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Stein, Benny Jörg  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Helmer, Lennard  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Wegener, Dennis  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
IEEE/ACM 47th International Conference on Software Engineering: Software Engineering in Practice, ICSE-SEIP 2025. Proceedings  
Project(s)
Aufbau eines Gaia-X Knotens für große KI-Sprachmodelle und innovative Sprachapplikations-Services; Teilvorhaben: Entwicklung von Sprachmodellen, Interoperabilitäts- und Nutzungskonzepten  
Funder
Bundesministerium für Wirtschaft und Energie  
Conference
International Conference on Software Engineering - Software Engineering in Practice 2025  
DOI
10.1109/ICSE-SEIP66354.2025.00013
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Machine learning

  • Inference engines

  • Performance evaluation

  • Solution Deployment

  • Natural Language Processing

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024