Use of Artificial Intelligence for Reducing Unnecessary Recalls at Screening Mammography: A Simulation Study

Yeon Soo KIM; Myoung-jin JANG; Su Hyun LEE; Soo-Yeon KIM; Su Min HA; Bo Ra KWON; Woo Kyung MOON; Jung Min CHANG

Return

Use of Artificial Intelligence for Reducing Unnecessary Recalls at Screening Mammography: A Simulation Study

Author: Yeon Soo KIM ¹ ; Myoung-jin JANG ; Su Hyun LEE ; Soo-Yeon KIM ; Su Min HA ; Bo Ra KWON ; Woo Kyung MOON ; Jung Min CHANG
Author Information

1. Department of Radiology, Seoul National University Hospital, Seoul, Korea
Publication Type:Original Article
From:Korean Journal of Radiology 2022;23(12):1241-1250
CountryRepublic of Korea
Language:English
Abstract: Objective:To conduct a simulation study to determine whether artificial intelligence (AI)-aided mammography reading can reduce unnecessary recalls while maintaining cancer detection ability in women recalled after mammography screening.
Materials and Methods:A retrospective reader study was performed by screening mammographies of 793 women (mean age ± standard deviation, 50 ± 9 years) recalled to obtain supplemental mammographic views regarding screening mammographydetected abnormalities between January 2016 and December 2019 at two screening centers. Initial screening mammography examinations were interpreted by three dedicated breast radiologists sequentially, case by case, with and without AI aid, in a single session. The area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and recall rate for breast cancer diagnosis were obtained and compared between the two reading modes.
Results:Fifty-four mammograms with cancer (35 invasive cancers and 19 ductal carcinomas in situ) and 739 mammograms with benign or negative findings were included. The reader-averaged AUC improved after AI aid, from 0.79 (95% confidence interval [CI], 0.74–0.85) to 0.89 (95% CI, 0.85–0.94) (p < 0.001). The reader-averaged specificities before and after AI aid were 41.9% (95% CI, 39.3%–44.5%) and 53.9% (95% CI, 50.9%–56.9%), respectively (p < 0.001). The reader-averaged sensitivity was not statistically different between AI-unaided and AI-aided readings: 89.5% (95% CI, 83.1%–95.9%) vs.92.6% (95% CI, 86.2%–99.0%) (p = 0.053), although the sensitivities of the least experienced radiologists before and after AI aid were 79.6% (43 of 54 [95% CI, 66.5%–89.4%]) and 90.7% (49 of 54 [95% CI, 79.7%–96.9%]), respectively (p = 0.031). With AI aid, the reader-averaged recall rate decreased by from 60.4% (95% CI, 57.8%–62.9%) to 49.5% (95% CI, 46.5%–52.4%) (p < 0.001).
Conclusion:AI-aided reading reduced the number of recalls and improved the diagnostic performance in our simulation using women initially recalled for supplemental mammographic views after mammography screening.