• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Recent Trends in Edge AI: Efficient Design, Training and Deployment of Machine Learning Models
 
  • Details
  • Full
Options
2025
Book Article
Title

Recent Trends in Edge AI: Efficient Design, Training and Deployment of Machine Learning Models

Abstract
With a rising demand for ubiquitous smart systems, processing and interpreting large quantities of data generated on the edge at a high velocity is becoming an increasingly important challenge. Machine learning (ML) models such as Deep Neural Networks (DNNs) are an essential tool of today's artificial intelligence due to their ability to make accurate predictions given complex tasks and environments. However, Deep Learning is computationally complex and energy intensive. This seems to contradict the characteristics of many edge devices, which have only limited memory, computational resources, and energy budget available. To overcome this challenge, an efficient ML model design is crucial that incorporates available optimization techniques from hardware, software, and methodological perspective to enable energy-efficient deployment and operation on the edge. This work comprehensively summarizes recent techniques for training, optimizing, and deploying ML models targeting edge devices. We discuss different strategies for finding deployable ML models, scalable DNN architectures, neural architecture search, and multi-objective optimization approaches, to enable feasible trade-offs considering available resources and latency. Furthermore, we give insight into DNN compression methods such as quantization and pruning. We conclude by investigating different forms of cascaded processing, from simple multi-level approaches to highly branched compute graphs and early-exit DNNs.
Author(s)
Deutel, Mark
Friedrich-Alexander-Universität Erlangen-Nürnberg
Mallah, Maen
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Wißing, Julio Emmanuel Diem
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Scheele, Stephan
Ostbayerische Technische Hochschule Regensburg
Mainwork
Charting the Intelligence Frontiers Edge AI Systems Nexus  
DOI
10.1201/9788743808862-9
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Cascaded processing

  • Energy efficient AI

  • Neural architecture search

  • Pruning

  • Quantization

  • TinyML

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024