• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. HyenaPixel: Global Image Context with Convolutions
 
  • Details
  • Full
Options
October 2024
Conference Paper
Title

HyenaPixel: Global Image Context with Convolutions

Abstract
In computer vision, a larger effective receptive field (ERF) is associated with better performance. While attention natively supports global context, its quadratic complexity limits its applicability to tasks that benefit from high-resolution input. In this work, we extend Hyena, a convolution-based attention replacement, from causal sequences to bidirectional data and two-dimensional image space. We scale Hyena’s convolution kernels beyond the feature map size, up to 191×191, to maximize ERF while maintaining sub-quadratic complexity in the number of pixels. We integrate our two-dimensional Hyena, HyenaPixel, and bidirectional Hyena into the MetaFormer framework. For image categorization, HyenaPixel and bidirectional Hyena achieve a competitive ImageNet-1k top-1 accuracy of 84.9% and 85.2%, respectively, with no additional training data, while outperforming other convolutional and large-kernel networks. Combining HyenaPixel with attention further improves accuracy. We attribute the success of bidirectional Hyena to learning the data-dependent geometric arrangement of pixels without a fixed neighborhood definition. Experimental results on downstream tasks suggest that HyenaPixel with large filters and a fixed neighborhood leads to better localization performance.
Author(s)
Spravil, Julian
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Houben, Sebastian
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Behnke, Sven  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
ECAI 2024, 27th European Conference on Artificial Intelligence. Proceedings  
Conference
European Conference on Artificial Intelligence 2024  
Conference on Prestigious Applications of Intelligent Systems 2024  
Open Access
DOI
10.3233/FAIA240529
Additional link
Full text
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Complex networks

  • Convolution kernel

  • Global context

  • High resolution

  • Image space

  • Large effective

  • Neighbourhood

  • Performance

  • Quadratic complexity

  • Receptive fields

  • Two dimensional images

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024