• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints
 
  • Details
  • Full
Options
2025
Conference Paper
Title

SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints

Abstract
Open-vocabulary segmentation, powered by large visual-language models like CLIP, has expanded 2D segmentation capabilities beyond fixed classes predefined by the dataset, enabling zero-shot understanding across diverse scenes. Extending these capabilities to 3D segmentation introduces challenges, as CLIP’s image-based embeddings often lack the geometric detail necessary for 3D scene segmentation. Recent methods tend to address this by introducing additional segmentation models or replacing CLIP with variations trained on segmentation data, which lead to redundancy or loss on CLIP’s general language capabilities. To overcome this limitation, we introduce SPNeRF, a NeRF based zero-shot 3D segmentation approach that leverages geometric priors. We integrate geometric primitives derived from the 3D scene into NeRF training to produce primitive-wise CLIP features, avoiding the ambiguity of point-wise features. Additionally, we propose a primitive-based merging mechanism enhanced with affin ity scores. Without relying on additional segmentation models, our method further explores CLIP’s capability for 3D segmentation and achieves notable improvements over orig-inal LERF.
Author(s)
Hu, Weiwen
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Parodi, Niccolo
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Zepp, Marcus
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Feldmann, Ingo  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Schreer, Oliver  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Eisert, Peter  
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Mainwork
VISIGRAPP 2025, 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. Proceedings. Vol.3: VISAPP  
Conference
International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2025  
International Conference on Computer Vision Theory and Applications 2025  
Open Access
DOI
10.5220/0013255100003912
Additional link
Full text
Language
English
Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut HHI  
Keyword(s)
  • 3D

  • Computer Vision

  • Neural Radiance Field

  • Point Cloud

  • Semantic Segmentation

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024