• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. GDCK: Efficient Large-Scale Graph Distillation Utilizing a Model-Free Kernelized Approach
 
  • Details
  • Full
Options
2025
Conference Paper
Title

GDCK: Efficient Large-Scale Graph Distillation Utilizing a Model-Free Kernelized Approach

Abstract
Large-scale graph distillation, including applications to social networks and literature citation graphs, has shown significant progress in recent years. Existing methods primarily rely on minimizing surrogate objectives, such as gradient or distribution discrepancies between the original and condensed graphs, or aligning training trajectories. However, these approaches are often computationally intensive, requiring nested optimization loops or the training of multiple expert models to guide student models. To overcome these issues, we propose GDCK, a novel and efficient approach for graph distillation based on neural tangent kernels (NTK). GDCK leverages NTKs with kernel ridge regression, eliminating the need to train graph neural networks and significantly reducing computation time. By applying NTKs to randomly selected sub-graphs and within individual classes, GDCK preserves critical structural information for high-performance outcomes. Additionally, it incorporates node importance, effectively compressing nodes whose neighbors exhibit diverse labels into the synthetic graph. Experiments on node classification tasks demonstrate that GDCK achieves rapid convergence in early training epochs, substantially reducing time costs while maintaining competitive classification performance. This approach offers a practical and scalable solution for graph distillation, advancing its utility in real-world scenarios. Our code is available at: https://github.com/SchenbergZY/GDCK.
Author(s)
Zhang, Yue
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Chen, Zongxiong
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Liu, Qian
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Schimmler, Sonja  
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Hauswirth, Manfred  
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Mainwork
Data Science: Foundations and Applications. 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2025. Proceedings. Part VII  
Conference
Pacific-Asia Conference on Knowledge Discovery and Data Mining 2025  
DOI
10.1007/978-981-96-8298-0_19
Language
English
Fraunhofer-Institut für Offene Kommunikationssysteme FOKUS  
Keyword(s)
  • Graph Condensation

  • Graph Neural Networks

  • Kernel Ridge Regression

  • Neural Tangent Kernels

  • Node-level Classification

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024