Publication:
Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

cris.virtual.departmentFraunhofer-Institut für Integrierte Schaltungen IIS
cris.virtual.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
cris.virtualsource.department00dfa1aa-a5cf-4385-a10c-a81c6b2084fa
cris.virtualsource.orcid00dfa1aa-a5cf-4385-a10c-a81c6b2084fa
crisou.acronymIIS
dc.contributor.authorBhattacharya, Debarpan
dc.contributor.authorSharma, Neeraj Kumar
dc.contributor.authorDutta, Debottam
dc.contributor.authorChetupalli, Srikanth Raj
dc.contributor.authorMote, Pravin
dc.contributor.authorGanapathy, Sriram
dc.contributor.authorChandrakiran, C.
dc.contributor.authorNori, Sahiti
dc.contributor.authorSuhail, K.K.
dc.contributor.authorGonuguntla, Sadhana
dc.contributor.authorMurali, Alagesan
dc.date.accessioned2023-08-14T14:13:26Z
dc.date.available2023-08-14T14:13:26Z
dc.date.issued2023
dc.description.abstractThis paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demographic information associated with age, gender and geographic location, as well as the health information relating to the symptoms, pre-existing respiratory ailments, comorbidity and SARS-CoV-2 test status. Our study is the first of its kind to manually annotate the audio quality of the entire dataset (amounting to 65 hours) through manual listening. The paper summarizes the data collection procedure, demographic, symptoms and audio data information. A COVID-19 classifier based on bi-directional long short-term (BLSTM) architecture, is trained and evaluated on the different population sub-groups contained in the dataset to understand the bias/fairness of the model. This enabled the analysis of the impact of gender, geographic location, date of recording, and language proficiency on the COVID-19 detection performance.
dc.description.issue1
dc.description.volume10
dc.identifier.doi10.1038/s41597-023-02266-0
dc.identifier.pmid37349364
dc.identifier.scopus2-s2.0-85162783419
dc.identifier.urihttps://publica.fraunhofer.de/handle/publica/448175
dc.language.isoen
dc.relation.ispartofScientific data
dc.relation.issn#PLACEHOLDER_PARENT_METADATA_VALUE#
dc.titleCoswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection
dc.typejournal article
dcterms.bibliographicCitation.articlenumber397
dspace.entity.typePublication
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliationFraunhofer-Institut für Integrierte Schaltungen IIS
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
oairecerif.author.affiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid0000-0002-3392-8898
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid0000-0002-5779-9066
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid0000-0002-3845-7984
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.scopus-author-id57313248700
person.identifier.scopus-author-id57917222800
person.identifier.scopus-author-id57226881835
person.identifier.scopus-author-id56304173600
person.identifier.scopus-author-id57313022900
person.identifier.scopus-author-id23392746400
person.identifier.scopus-author-id35932031000
person.identifier.scopus-author-id57762425900
person.identifier.scopus-author-id57763373800
person.identifier.scopus-author-id57761727500
person.identifier.scopus-author-id35389358600
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.author.alternativeaffiliation#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.bestOA.landinghttps://doi.org/10.1038/s41597-023-02266-0
publica.bestOA.pdfhttps://www.nature.com/articles/s41597-023-02266-0.pdf
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.correspondingtrue
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.contributor.corresponding#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.date.scupdated2025-02-25
publica.description.pagecount11 S.
publica.fhg.instituteFraunhofer-Institut für Integrierte Schaltungen IIS
publica.fhg.institute-controller-groupabd8e97b-9ee5-4b96-a69c-4f8acaf3e840
publica.fhg.location#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.journal.publisher#PLACEHOLDER_PARENT_METADATA_VALUE#
publica.rights.oaOpen Access
publica.rights.oaStatusgold
publica.rights.oaUnpaywallTrue
publica.rights.timestamp2026-04-03 16:47:38.679807

Files

Collections