Cardiff Metropolitan University
Manuscript -clean-24.3.22-3-zeng fei_Fei Zhao.pdf (434.5 kB)

A Deep Learning Approach to Predict Conductive Hearing Loss in Otitis Media with Effusion Using Otoscopic Images

Download (434.5 kB)
journal contribution
posted on 2022-05-27, 12:40 authored by Junbo Zeng, Weibiao Kang, Suijun Chen, Yi Lin, Wenting Deng, Yajing Wang, Guisheng Chen, Kai Ma, Fei Zhao, Yefeng Zheng, Maojin Liang, Linqi Zeng, Weijie Ye, Peng Li, Yubin Chen, Guoping Chen, Jinliang Gao, Minjian Wu, Yuejiao Su, Yiqing Zheng, Yuexin Cai


Importance  Otitis media with effusion (OME) is one of the most common causes of acquired conductive hearing loss (CHL). Persistent hearing loss is associated with poor childhood speech and language development and other adverse consequence. However, to obtain accurate and reliable hearing thresholds largely requires a high degree of cooperation from the patients.

Objective  To predict CHL from otoscopic images using deep learning (DL) techniques and a logistic regression model based on tympanic membrane features.

Design, Setting, and Participants  A retrospective diagnostic/prognostic study was conducted using 2790 otoscopic images obtained from multiple centers between January 2015 and November 2020. Participants were aged between 4 and 89 years. Of 1239 participants, there were 209 ears from children and adolescents (aged 4-18 years [16.87%]), 804 ears from adults (aged 18-60 years [64.89%]), and 226 ears from older people (aged >60 years, [18.24%]). Overall, 679 ears (54.8%) were from men. The 2790 otoscopic images were randomly assigned into a training set (2232 [80%]), and validation set (558 [20%]). The DL model was developed to predict an average air-bone gap greater than 10 dB. A logistic regression model was also developed based on otoscopic features.

Main Outcomes and Measures  The performance of the DL model in predicting CHL was measured using the area under the receiver operating curve (AUC), accuracy, and F1 score (a measure of the quality of a classifier, which is the harmonic mean of precision and recall; a higher F1 score means better performance). In addition, these evaluation parameters were compared to results obtained from the logistic regression model and predictions made by three otologists.

Results  The performance of the DL model in predicting CHL showed the AUC of 0.74, accuracy of 81%, and F1 score of 0.89. This was better than the results from the logistic regression model (ie, AUC of 0.60, accuracy of 76%, and F1 score of 0.82), and much improved on the performance of the 3 otologists; accuracy of 16%, 30%, 39%, and F1 scores of 0.09, 0.18, and 0.25, respectively. Furthermore, the DL model took 2.5 seconds to predict from 205 otoscopic images, whereas the 3 otologists spent 633 seconds, 645 seconds, and 692 seconds, respectively.

Conclusions and Relevance  The model in this diagnostic/prognostic study provided greater accuracy in prediction of CHL in ears with OME than those obtained from the logistic regression model and otologists. This indicates great potential for the use of artificial intelligence tools to facilitate CHL evaluation when CHL is unable to be measured.


Published in

JAMA Otolaryngology-Head and Neck Surgery


American Medical Association


  • AM (Accepted Manuscript)


Zeng, J., Kang, W., Chen, S., Lin, Y., Deng, W., Wang, Y., Chen, G., Ma, K., Zhao, F., Zheng, Y. and Liang, M. (2022) 'A Deep Learning Approach to Predict Conductive Hearing Loss in Patients With Otitis Media With Effusion Using Otoscopic Images', JAMA Otolaryngology–Head & Neck Surgery. DOI:

Electronic ISSN


Cardiff Met Affiliation

  • Cardiff School of Sport and Health Sciences

Cardiff Met Authors

Fei Zhao

Cardiff Met Research Centre/Group

  • Speech, Hearing and Communication

Copyright Holder

  • © The Publisher


  • en

Usage metrics

    Population Risk & Healthcare - Journal Articles



    Ref. manager