TY - JOUR
T1 - Accelerating Cough-Based Algorithms for Pulmonary Tuberculosis Screening
T2 - Results From the CODA TB DREAM Challenge
AU - behalf of the Cough Diagnostic Algorithm for Tuberculosis (CODA TB) DREAM Challenge Consortium
AU - Jaganath, Devan
AU - Sieberts, Solveig K.
AU - Raberahona, Mihaja
AU - Huddart, Sophie
AU - Omberg, Larsson
AU - Rakotoarivelo, Rivo
AU - Lyimo, Issa
AU - Lweno, Omar
AU - Christopher, Devasahayam J.
AU - Nhung, Nguyen Viet
AU - Worodria, William
AU - Yu, Charles
AU - Chen, Jhih Yu
AU - Chen, Sz Hau
AU - Chen, Tsai Min
AU - Huang, Chih Han
AU - Huang, Kuei Lin
AU - Mulier, Filip
AU - Rafter, Daniel
AU - Shih, Edward S.C.
AU - Tsao, Yu
AU - Wang, Hsuan Kai
AU - Wu, Chih Hsun
AU - Bachman, Christine
AU - Burkot, Stephen
AU - Dewan, Puneet
AU - Kulhare, Sourabh
AU - Small, Peter M.
AU - Yadav, Vijay
AU - Grandjean Lapierre, Simon
AU - Theron, Grant
AU - Cattamanchi, Adithya
AU - Ahuja, Gautam
AU - Balodi, Shalini
AU - Khurdiya, Diya
AU - Kutum, Rintu
AU - Rao, Aakash M.
AU - Salampuria, Ashwin
AU - Akbarian, Sina
AU - Asgarian, Sepehr
AU - Arora, Akanksha
AU - Choudhury, Shubham
AU - Raghava, Gajendra P.S.
AU - Dong, Sherry
AU - Guan, Yuanfang
AU - Nan, Yiyang
AU - Zhang, Hanrui
AU - Gupta, Aniket
AU - Li, Tenglong
AU - Singh, Rohan
N1 - Publisher Copyright:
© 2025 The Author(s). Published by Oxford University Press on behalf of Infectious Diseases Society of America.
PY - 2025/10
Y1 - 2025/10
N2 - Background Open-access data challenges can accelerate innovation in artificial intelligence-based tools. In the Cough Diagnostic Algorithm for Tuberculosis (CODA TB) DREAM Challenge, we developed and independently validated cough sound-based artificial intelligence algorithms for tuberculosis screening. Methods We included data from 2143 adults with ≥2 weeks of cough from outpatient clinics in India, Madagascar, the Philippines, South Africa, Tanzania, Uganda, and Vietnam. A standard tuberculosis evaluation was completed, and ≥3 solicited coughs were recorded using a smartphone. We invited teams to develop models using training data to classify microbiologically confirmed tuberculosis disease using (1) cough sound features only and/or (2) cough sound features with routinely available clinical data. After 4 months, they submitted the algorithms for independent test set validation. Models were ranked by area under the receiver operating characteristic curve (AUROC) and partial AUROC (pAUROC) to achieve at least 80% sensitivity and 60% specificity. Results Eleven cough models and 6 cough-plus-clinical models were submitted. AUROCs for cough models ranged from 0.69 to 0.74, and the highest performing model achieved 55.5% specificity (95% confidence interval, 47.7%-64.2%) at 80% sensitivity. The addition of clinical data improved AUROCs (range, 0.78-0.83); 5 of the 6 models reached the target pAUROC, and the highest performing model had 73.8% specificity (95% confidence interval, 60.8%-80.0%) at 80% sensitivity. The AUROC varied by country and was higher among male and human immunodeficiency virus-negative individuals. Conclusions In a short period, an open-access data challenge facilitated the development of new cough-based tuberculosis algorithms and demonstrated potential as a tuberculosis screening tool.
AB - Background Open-access data challenges can accelerate innovation in artificial intelligence-based tools. In the Cough Diagnostic Algorithm for Tuberculosis (CODA TB) DREAM Challenge, we developed and independently validated cough sound-based artificial intelligence algorithms for tuberculosis screening. Methods We included data from 2143 adults with ≥2 weeks of cough from outpatient clinics in India, Madagascar, the Philippines, South Africa, Tanzania, Uganda, and Vietnam. A standard tuberculosis evaluation was completed, and ≥3 solicited coughs were recorded using a smartphone. We invited teams to develop models using training data to classify microbiologically confirmed tuberculosis disease using (1) cough sound features only and/or (2) cough sound features with routinely available clinical data. After 4 months, they submitted the algorithms for independent test set validation. Models were ranked by area under the receiver operating characteristic curve (AUROC) and partial AUROC (pAUROC) to achieve at least 80% sensitivity and 60% specificity. Results Eleven cough models and 6 cough-plus-clinical models were submitted. AUROCs for cough models ranged from 0.69 to 0.74, and the highest performing model achieved 55.5% specificity (95% confidence interval, 47.7%-64.2%) at 80% sensitivity. The addition of clinical data improved AUROCs (range, 0.78-0.83); 5 of the 6 models reached the target pAUROC, and the highest performing model had 73.8% specificity (95% confidence interval, 60.8%-80.0%) at 80% sensitivity. The AUROC varied by country and was higher among male and human immunodeficiency virus-negative individuals. Conclusions In a short period, an open-access data challenge facilitated the development of new cough-based tuberculosis algorithms and demonstrated potential as a tuberculosis screening tool.
KW - artificial intelligence
KW - cough
KW - data challenge
KW - diagnostics
KW - tuberculosis
UR - https://www.scopus.com/pages/publications/105018079480
U2 - 10.1093/ofid/ofaf572
DO - 10.1093/ofid/ofaf572
M3 - Article
AN - SCOPUS:105018079480
SN - 2328-8957
VL - 12
JO - Open Forum Infectious Diseases
JF - Open Forum Infectious Diseases
IS - 10
M1 - ofaf572
ER -