• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. TransCeption: Enhancing medical image segmentation with an inception-like transformer design for efficient feature fusion
 
  • Details
  • Full
Options
2025
Journal Article
Title

TransCeption: Enhancing medical image segmentation with an inception-like transformer design for efficient feature fusion

Abstract
While CNN-based methods have been the cornerstone of medical image segmentation due to their promising performance and robustness, they suffer from limitations in capturing long-range dependencies. Transformer-based approaches are currently prevailing since they enlarge the receptive field to model global contextual correlations. To further extract rich representations, some extensions of U-Net employ multi-scale feature extraction and fusion modules to obtain improved performance. Inspired by this idea, we propose TransCeption for medical image segmentation, a pure transformer-based U-shaped network incorporating an inception-like module in the encoder and adopting a contextual bridge for better feature fusion. The design proposed in this work is based on three core principles. (i) The patch merging module in the encoder is redesigned to use ResInception Patch Merging (RIPM). The Multi-Branch (MB) transformer has the same number of branches as the outputs of RIPM. Combining the two modules enables the model to capture a multi-scale representation within a single stage. (ii) We apply an Intra-stage Feature Fusion (IFF) module following the MB transformer to enhance the aggregation of feature maps from all branches and particularly focus on the interaction between the different channels at all scales. (iii) In contrast to a bridge that only contains token-wise self-attention, we propose a Dual Transformer Bridge that also includes channel-wise self-attention to exploit correlations between scales at different stages from a dual perspective. Extensive experiments on multi-organ and skin lesion segmentation tasks show the superiority of TransCeption to previous work. The code is publicly available on GitHub.
Author(s)
Azad, Reza
Rheinisch-Westfälische Technische Hochschule Aachen
Jia, Yiwei
Rheinisch-Westfälische Technische Hochschule Aachen
Aghdam, Ehsan Khodapanah
Shahid Beheshti University
Cohen-Adad, Julien
Polytechnique Montréal
Merhof, Dorit
Fraunhofer-Institut für Digitale Medizin MEVIS  
Journal
Computational Visual Media  
Open Access
File(s)
Download (12.33 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.26599/CVM.2025.9450407
10.24406/publica-6537
Additional link
Full text
Language
English
Fraunhofer-Institut für Digitale Medizin MEVIS  
Keyword(s)
  • inception

  • medical image segmentation

  • multi-scale feature fusion

  • transformer

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024