Now showing 1 - 10 of 533
  • Publication
    A global scale comparison of risk aggregation in AI assessment frameworks
    ( 2024-05-06) ; ;
    Görge, Rebekka
    ;
    Cremers, Armin B.
    ;
    AI applications bear inherent risks in various risk dimensions, such as insufficient reliability, robustness, fairness or data protection. It is well-known that trade-offs between these dimensions can arise, for example, a highly accurate AI application may reflect unfairness and bias of the real-world data, or may provide hard-to-explain outcomes because of its internal complexity. AI risk assessment frameworks aim to provide systematic approaches to risk assessment in various dimensions. The overall trustworthiness assessment is then generated by some form of risk aggregation among the risk dimensions. This paper provides a systematic overview on risk aggregation schemes used in existing AI risk assessment frameworks, focusing on the question how potential trade-offs among the risk dimensions are incorporated. To this end, we examine how the general risk notion, the application context, the extent of risk quantification, and specific instructions for evaluation may influence overall risk aggregation. We discuss our findings in the current frameworks in terms of whether they provide meaningful and practicable guidance. Lastly, we derive recommendations for the further operationalization of risk aggregation both from horizontal and vertical perspectives.
  • Publication
    On the effects of biased quantum random numbers on the initialization of artificial neural networks
    Recent advances in practical quantum computing have led to a variety of cloud-based quantum computing platforms that allow researchers to evaluate their algorithms on noisy intermediate-scale quantum devices. A common property of quantum computers is that they can exhibit instances of true randomness as opposed to pseudo-randomness obtained from classical systems. Investigating the effects of such true quantum randomness in the context of machine learning is appealing, and recent results vaguely suggest that benefits can indeed be achieved from the use of quantum random numbers. To shed some more light on this topic, we empirically study the effects of hardware-biased quantum random numbers on the initialization of artificial neural network weights in numerical experiments. We find no statistically significant difference in comparison with unbiased quantum random numbers as well as biased and unbiased random numbers from a classical pseudo-random number generator. The quantum random numbers for our experiments are obtained from real quantum hardware.
  • Publication
    AITA: AI trustworthiness assessment. AAAI spring symposium 2023
    ( 2024-01-03)
    Braunschweig, Bertrand
    ;
    Buijsman, Stefan
    ;
    Chamroukhi, Faïcel
    ;
    Heintz, Fredrik
    ;
    Khomh, Foutse
    ;
    Mattioli, Juliette
    ;
  • Publication
    Rechtliche Fairnessanforderungen an KI-Systeme und ihre technische Evaluation
    ( 2024)
    Feldkamp, Jakob
    ;
    Kappler, Quirin
    ;
    ; ;
    Weiss, Erik
    Im vorliegenden Beitrag werden rechtliche Fairness-Anforderungen an KI-Systeme anhand ausgewählter Kreditscoring-Systeme beleuchtet. Einerseits wird das geltende Recht in den Blick genommen und insbesondere die Frage einer mittelbaren Drittwirkung des Art. 3 GG im Kontext des Kreditscorings erörtert. Andererseits wird auch ein Fokus auf die zukünftige europäische KI-Verordnung (KI-VO) gelegt. Mittels einer juristisch-informatischen Analyse werden Ansätze zur Interpretation und Operationalisierung der fairnessbezogenen Anforderungen an die verwendeten Daten gem. Art. 10 KI-VO nach dem Stand der Technik diskutiert. Diese werden sodann exemplarisch anhand eines konkreten Kreditscoring-Beispiels evaluiert. Der Beitrag schließt mit Empfehlungen hinsichtlich ausgewählter rechtlicher Rahmenbedingungen für einen "fairen" KI-Einsatz.
  • Publication
    Wasserstein Dropout
    ( 2024)
    Sicking, Joachim
    ;
    ;
    Pintz, Maximilian Alexander
    ;
    ; ;
    Fischer, Asja
    Despite of its importance for safe machine learning, uncertainty quantification for neural networks is far from being solved. State-of-the-art approaches to estimate neural uncertainties are often hybrid, combining parametric models with explicit or implicit (dropout-based) ensembling. We take another pathway and propose a novel approach to uncertainty quantification for regression tasks, Wasserstein dropout, that is purely non-parametric. Technically, it captures aleatoric uncertainty by means of dropout-based sub-network distributions. This is accomplished by a new objective which minimizes the Wasserstein distance between the label distribution and the model distribution. An extensive empirical analysis shows that Wasserstein dropout outperforms state-of-the-art methods, on vanilla test data as well as under distributional shift in terms of producing more accurate and stable uncertainty estimates.
  • Publication
    Automatic scoring of Rhizoctonia crown and root rot affected sugar beet fields from orthorectified UAV images using Machine Learning
    ( 2024)
    Ispizua Yamati, Facundo Ramón
    ;
    ;
    Barreto Alcántara, Abel Andree
    ;
    Bömer, Jonas
    ;
    Laufer, Daniel
    ;
    ;
    Mahlein, Anne-Katrin
    Rhizoctonia crown and root rot (RCRR), caused by Rhizoctonia solani, can cause severe yield and quality losses in sugar beet. The most common strategy to control the disease is the development of resistant varieties. In the breeding process, field experiments with artificial inoculation are carried out to evaluate the performance of genotypes and varieties. The phenotyping process in breeding trials requires constant monitoring and scoring by skilled experts. This work is time demanding and shows bias and heterogeneity according to the experience and capacity of each individual person. Optical sensors and artificial intelligence have demonstrated a great potential to achieve higher accuracy than human raters and the possibility to standardize phenotyping applications. A workflow combining red-green-blue (RGB) and multispectral imagery coupled to an unmanned aerial vehicle (UAV), and machine learning techniques was applied to score diseased plants and plots affected by RCRR. Georeferenced annotation of UAV orthorectified images. With the annotated images, five convolutional neural networks were trained to score individual plants. The training was carried out with different image analysis strategies and data augmentation, respectively. The custom convolutional neural network trained from scratch together with a pre-trained MobileNet showed the best precision in scoring RCRR (0.73 to 0.85). The average per plot of spectral information was used to score plots, and the benefit of adding the information obtained from the score of individual plants was compared. For this purpose, machine learning models were trained together with data management strategies, and the best-performing model was chosen. A combined pipeline of Random Forest and k-Nearest neighbors have shown the best weighted precision (0.67). This research provides a reliable workflow for detecting and scoring RCRR based on aerial imagery. RCRR is often distributed heterogeneously in trial plots, therefore, considering the information from individual plants of the plots showed a significant improvement of UAV based automated monitoring routines.
  • Publication
    Short-term predictor for COVID-19 severity from a longitudinal multi-omics study for practical application in intensive care units
    ( 2024) ;
    Hahnefeld, Lisa
    ;
    Kloka, Jan Andreas
    ;
    ;
    Nürenberg-Goloub, Elena
    ;
    Zinn, Sebastian
    ;
    Vehrenschild, Maria
    ;
    Zacharowski, Kai
    ;
    Lindau, Simone
    ;
    Ulrich, Evelyn
    ;
    ; ;
    Schwäble, Joachim
    ;
    Gurke, Robert
    ;
    Dorochow, Erika
    ;
    Bennett, Alexandre
    ;
    Dauth, Stephanie
    ;
    Campe, Julia
    ;
    Knape, Tilo
    ;
    Laux, Volker
    ;
    Kannt, Aimo
    ;
    Köhm, Michaela
    ;
    ;
    Resch, Eduard
    ;
    Behrens, Frank
    Background: The COVID-19 pandemic challenged the management of technical and human resources in intensive care units (ICU) across the world. Several long-term predictors for COVID-19 disease progression have been discovered. However, predictors to support short-term planning of resources and medication that can be translated to future pandemics are still missing. A workflow was established to identify a predictor for short-term COVID-19 disease progression in the acute phase of intensive care patients to support clinical decision-making. Methods: Thirty-two patients with SARS-CoV-2 infection were recruited on admission to the ICU and clinical data collected. During their hospitalization, plasma samples were acquired from each patient on multiple occasions, excepting one patient for which only one time point was possible, and the proteome (Inflammation, Immune Response and Organ Damage panels from Olink® Target 96), metabolome and lipidome (flow injection analysis and liquid chromatography-mass spectrometry) analyzed for each sample. Patient visits were grouped according to changes in disease severity based on their respiratory and organ function, and evaluated using a combination of statistical analysis and machine learning. The resulting short-term predictor from this multi-omics approach was compared to the human assessment of disease progression. Furthermore, the potential markers were compared to the baseline levels of 50 healthy subjects with no known SARS-CoV-2 or other viral infections. Results: A total of 124 clinical parameters, 271 proteins and 782 unique metabolites and lipids were assessed. The dimensionality of the dataset was reduced, selecting 47 from the 1177 parameters available following down-selection, to build the machine learning model. Subsequently, two proteins (C-C motif chemokine 7 (CCL7) and carbonic anhydrase 14 (CA14)) and one lipid (hexosylceramide 18:2; O2/20:0) were linked to disease progression in the studied SARS-CoV-2 infections. Thus, a predictor delivering the prognosis of an upcoming worsening of the patient's condition up to five days in advance with a reasonable accuracy (79 % three days prior to event, 84 % four to five days prior to event) was found. Interestingly, the predictor's performance was complementary to the clinicians' capabilities to foresee a worsening of a patient. Conclusion: This study presents a workflow to identify omics-based biomarkers to support clinical decision-making and resource management in the ICU. This was successfully applied to develop a short-term predictor for aggravation of COVID-19 symptoms. The applied methods can be adapted for future small cohort studies.
  • Publication
    Einsatz von KI-basierten Anwendungen durch Krankenhauspersonal: Aufgabenprofile und Qualifizierungsbedarfe
    ( 2024) ;
    Albiez, Daniela
    ;
    Bures, Dominik Martin
    ;
    Hosters, Bernadette
    ;
    Jovy-Klein, Florian
    ;
    Nickel, Kilian
    ;
    Reibel, Thomas
    ;
    Schramm, Johanna
    ;
    ;
    Antons, David
    ;
    Diehl, Anke
    Künstliche Intelligenz (KI) hat für Krankenhäuser wesentlich an Bedeutung gewonnen. Um die umfangreichen Potenziale der Technologie tatsächlich nutzen zu können, sind Anpassungen bei Aufgabenprofilen sowie zielgerichtete Qualifizierungsmaßnahmen für das Krankenhauspersonal heute und in Zukunft unabdingbar. Davon sind sowohl medizinische als auch nichtmedizinische Prozesse entlang der gesamten Wertschöpfungskette im Krankenhaus betroffen. Ziel der Arbeit ist es, einen Überblick über die notwendigen Fähigkeiten im Umgang mit intelligenten Technologien im klinischen Kontext zu geben und Maßnahmen zur Qualifizierung von Mitarbeiter*innen vorzustellen. Im Rahmen des Projekts "SmartHospital.NRW" wurden im Jahr 2022 eine Literaturrecherche sowie Interviews und Workshops mit Expert*innen durchgeführt. KI-Technologien und Anwendungsfelder wurden identifiziert. Zentrale Ergebnisse umfassen veränderte und neue Aufgabenprofile, identifizierte Synergien und Abhängigkeiten zwischen den einzelnen Aufgabenprofilen sowie die Notwendigkeit eines umfassenden interdisziplinären und interprofessionellen Austauschs beim Einsatz von KI-basierten Anwendungen im Krankenhaus. Unser Beitrag zeigt, dass Krankenhäuser frühzeitig Kompetenzen im Bereich Digital Health Literacy in der Belegschaft fördern und gleichzeitig technikaffines Personal anwerben müssen. Interprofessionelle Austauschformate sowie ein begleitendes Changemanagement sind essenziell für die Nutzung von KI im Krankenhaus.
  • Publication
    Using ScrutinAI for visual inspection of DNN performance in a medical use case
    ( 2024)
    Görge, Rebekka
    ;
    ;
    Our Visual Analytics (VA) tool ScrutinAI supports human analysts to investigate interactively model performance and data sets. Model performance depends on labeling quality to a large extent. In particular in medical settings, generation of high quality labels requires in depth expert knowledge and is very costly. Often, data sets are labeled by collecting opinions of groups of experts. We use our VA tool to analyse the influence of label variations between different experts on the model performance. ScrutinAI facilitates to perform a root cause analysis that distinguishes weaknesses of deep neural network (DNN) models caused by varying or missing labeling quality from true weaknesses. We scrutinize the overall detection of intracranial hemorrhages and the more subtle differentiation between subtypes in a publicly available data set.
  • Publication
    Deep reinforcement learning in service of air traffic controllers to resolve tactical conflicts
    ( 2024)
    Papadopoulos, George
    ;
    Bastas, Alevizos
    ;
    Vouros, George A.
    ;
    Crook, Ian
    ;
    Andrienko, Natalia
    ;
    Gennady Andrienko
    ;
    Cordero, Jose Manuel
    Dense and complex air traffic requires higher levels of automation than those exhibited by tactical conflict detection and resolution (CD&R) tools that air traffic controllers (ATCOs) use today: AI tools can act on their own initiative, increasing the capacity of ATCOs to control higher volumes of traffic. However, given that the air traffic control (ATC) domain is safety critical, requires AI systems to which ATCOs are comfortable to relinquishing control, guaranteeing operational integrity and automation adoption. Two major factors towards this goal are quality of solutions and operational transparency. ResoLver, the system that this article presents, addresses these challenges using an enhanced graph convolutional reinforcement learning method operating in a multiagent setting where each agent – representing a flight – performs a CD&R task, jointly with other agents. We show that ResoLver can provide high-quality solutions with respect to stakeholders interests (air traffic controllers and airspace users), addressing also operational transparency issues, which have been validated by ATCOs in simulated real-world settings.