CC BY 4.0Würz, Hendrik MartinHendrik MartinWürzKocon, KevinKevinKoconPedretscher, BarbaraBarbaraPedretscherKlien, EvaEvaKlienEggeling, EvaEvaEggeling2023-06-292023-06-292023https://publica.fraunhofer.de/handle/publica/444955https://doi.org/10.24406/publica-156510.5194/agile-giss-4-53-202310.24406/publica-1565We present a platform to support the AI development lifecycle with focus on large data like remote sensing.We target developers who are not allowed to use existing commercial cloud platforms for legal reasons or data compliance. The flexible implementation of our platform enables a deployment on classic server infrastructures as well as on internal clouds. Our goals of scalable and resource-efficient execution, independence from specific AI frameworks and programming languages, as well as reproducibility of results are met through a workflow-based calculation combined with the tool Data Version Control. The capabilities of the platform are demonstrated by training an AI-based forest type classification.enBranche: Information TechnologyBranche: Bioeconomics and InfrastructureResearch Line: Machine learning (ML)LTA: Scalable architectures for massive data setsLTA: Machine intelligence, algorithms, and data structures (incl. semantics)Artificial intelligence (AI)Workflow managementCloud computingRemote sensingA Scalable AI Training Platform for Remote Sensing Dataconference paper