1.Pre-trained convolutional neural networks in the assessment of bone scans for metastasis
Vincent Peter C. Magboo ; Ma. Sheila A. Magboo
The Philippine Journal of Nuclear Medicine 2021;16(2):46-53
Background:
Numerous applications of artificial intelligence have been applied in radiological imaging ranging from
computer-aided diagnosis based on machine learning to deep learning using convolutional neural networks.
One of the nuclear medicine imaging tests being commonly performed today is bone scan. The use of deep
learning methods through convolutional neural networks in bone scintigrams has not been fully explored. Very
few studies have been published on its diagnostic capability of convolutional neural networks in assessing
osseous metastasis.
Objective:
The aim of our study is to assess the classification performance of the pre-trained convolutional neural
networks in the diagnosis of bone metastasis from whole body bone scintigrams of a local institutional dataset.
Methods:
Bone scintigrams from all types of cancer were retrospectively reviewed during the period 2019-2020 at the
University of Perpetual Help Medical Center in Las Pinas City, Metro Manila. The study was approved by the
Institutional Ethical Review Board and Technical Review Board of the medical center. Bone scan studies should
be mainly for metastasis screening. The pre-processing techniques consisting of image normalization, image
augmentation, data shuffling, and train-test split (testing at 30% and the rest (70%) was split 85% for training
and 15% for validation) were applied to image dataset. Three pre-trained architectures (ResNet50, VGG19,
DenseNet121) were applied to the processed dataset. Performance metrics such as accuracy, recall (sensitivity),
precision (positive predictive value), and F1-scores were obtained.
Results:
A total of 570 bone scan images with dimension 220 x 646 pixel sizes in .tif file format were included in this
study with 40% classified with bone metastasis while 60% were classified as without bone metastasis.
DenseNet121 yielded the highest performance metrics with an accuracy rate of 83%, 76% recall, 86% precision,
and 81% F1-score. ResNet50 and VGG19 had similar performance with each other across all metrics but
generally lower predictive capability as compared to DenseNet121.
Conclusion
A bone metastasis machine learning classification study using three pre-trained convolutional neural networks
was performed on a local medical center bone scan dataset via transfer learning. DenseNet121 generated the
highest performance metrics with 83% accuracy, 76% recall, 86% precision and 81% F1-score. Our simulation
experiments generated promising outcomes and potentially could lead to its deployment in the clinical practice
of nuclear medicine physicians. The use of deep learning techniques through convolutional neural networks has
the potential to improve diagnostic capability of nuclear medicine physicians using bone scans for the
assessment of metastasis.
Deep Learning
;
Machine Learning
2.Diagnostic performance of a computer-aided system for tuberculosis screening in two Philippine cities
Gabrielle P. Flores ; Reiner Lorenzo J. Tamayo ; Robert Neil F. Leong ; Christian Sergio M. Biglaen ; Kathleen Nicole T. Uy ; Renee Rose O. Maglente ; Marlex Jorome M. Nugui ; Jason V. Alacap
Acta Medica Philippina 2024;58(Early Access 2024):1-8
Background and Objectives:
The Philippines faces challenges in the screening of tuberculosis (TB), one of them being the shortage in the health workforce who are skilled and allowed to screen TB. Deep learning neural networks (DLNNs) have shown potential in the TB screening process utilizing chest radiographs (CXRs). However, local studies on AIbased TB screening are limited. This study evaluated qXR3.0 technology's diagnostic performance for TB screening in Filipino adults aged 15 and older. Specifically, we evaluated the specificity and sensitivity of qXR3.0 compared to radiologists' impressions and determined whether it meets the World Health Organization (WHO) standards.
Methods:
A prospective cohort design was used to perform a study on comparing screening and diagnostic accuracies of qXR3.0 and two radiologist gradings in accordance with the Standards for Reporting Diagnostic Accuracy (STARD). Subjects from two clinics in Metro Manila which had qXR 3.0 seeking consultation at the time of study were invited to participate to have CXRs and sputum collected. Radiologists' and qXR3.0 readings and impressions were compared with respect to the reference standard Xpert MTB/RiF assay. Diagnostic accuracy measures were calculated.
Results:
With 82 participants, qXR3.0 demonstrated 100% sensitivity and 72.7% specificity with respect to the
reference standard. There was a strong agreement between qXR3.0 and radiologists' readings as exhibited by
the 0.7895 (between qXR 3.0 and CXRs read by at least one radiologist), 0.9362 (qXR 3.0 and CXRs read by both
radiologists), and 0.9403 (qXR 3.0 and CXRs read as not suggestive of TB by at least one radiologist) concordance indices.
Conclusions
qXR3.0 demonstrated high sensitivity to identify presence of TB among patients, and meets the WHO standard of at least 70% specificity for detecting true TB infection. This shows an immense potential for the tool to supplement the shortage of radiologists for TB screening in the country. Future research directions may consider larger sample sizes to confirm these findings and explore the economic value of mainstream adoption of qXR 3.0 for TB screening.
Tuberculosis
;
Diagnostic Imaging
;
Deep Learning
3.The impact of anatomic racial variations on artificial intelligence analysis of Filipino retinal fundus photographs using an image-based deep learning model
Carlo A. Kasala ; Kaye Lani Rea B. Locaylocay ; Paolo S. Silva
Philippine Journal of Ophthalmology 2024;49(2):130-137
OBJECTIVES
This study evaluated the accuracy of an artificial intelligence (AI) model in identifying retinal lesions, validated its performance on a Filipino population dataset, and evaluated the impact of dataset diversity on AI analysis accuracy.
METHODSThis cross-sectional, analytical, institutional study analyzed standardized macula-centered fundus photos taken with the Zeiss Visucam®. The AI model’s output was compared with manual readings by trained retina specialists.
RESULTSA total of 215 eyes from 109 patients were included in the study. Human graders identified 109 eyes (50.7%) with retinal abnormalities. The AI model demonstrated an overall accuracy of 73.0% (95% CI 66.6% – 78.8%) in detecting abnormal retinas, with a sensitivity of 54.1% (95% CI 44.3% – 63.7%) and specificity of 92.5% (95% CI 85.7% – 96.7%).
CONCLUSIONThe availability and sources of AI training datasets can introduce biases into AI algorithms. In our dataset, racial differences in retinal morphology, such as differences in retinal pigmentation, affected the accuracy of AI image-based analysis. More diverse datasets and external validation on different populations are needed to mitigate these biases.
Human ; Artificial Intelligence ; Deep Learning
4.SPECT-MPI for Coronary Artery Disease: A deep learning approach
Vincent Peter C. Magboo ; Ma. Sheila A. Magboo
Acta Medica Philippina 2024;58(8):67-75
Background:
Worldwide, coronary artery disease (CAD) is a leading cause of mortality and morbidity and remains to be a top health priority in many countries. A non-invasive imaging modality for diagnosis of CAD such as single photon emission computed tomography-myocardial perfusion imaging (SPECT-MPI) is usually requested by cardiologists as it displays radiotracer distribution in the heart reflecting myocardial perfusion. The interpretation of SPECT-MPI is done visually by a nuclear medicine physician and is largely dependent on his clinical experience and showing significant inter-observer variability.
Objective:
The aim of the study is to apply a deep learning approach in the classification of SPECT-MPI for perfusion abnormalities using convolutional neural networks (CNN).
Methods:
A publicly available anonymized SPECT-MPI from a machine learning repository (https://www.kaggle.com/ selcankaplan/spect-mpi) was used in this study involving 192 patients who underwent stress-test-rest Tc99m MPI. An exploratory approach of CNN hyperparameter selection to search for optimum neural network model was utilized with particular focus on various dropouts (0.2, 0.5, 0.7), batch sizes (8, 16, 32, 64), and number of dense nodes (32, 64, 128, 256). The base CNN model was also compared with the commonly used pre-trained CNNs in medical images such as VGG16, InceptionV3, DenseNet121 and ResNet50. All simulations experiments were performed in Kaggle using TensorFlow 2.6.0., Keras 2.6.0, and Python language 3.7.10.
Results:
The best performing base CNN model with parameters consisting of 0.7 dropout, batch size 8, and 32 dense nodes generated the highest normalized Matthews Correlation Coefficient at 0.909 and obtained 93.75% accuracy, 96.00% sensitivity, 96.00% precision, and 96.00% F1-score. It also obtained higher classification performance as compared to the pre-trained architectures.
Conclusions
The results suggest that deep learning approaches through the use of CNN models can be deployed by nuclear medicine physicians in their clinical practice to further augment their decision skills in the interpretation of SPECT-MPI tests. These CNN models can also be used as a dependable and valid second opinion that can aid physicians as a decision-support tool as well as serve as teaching or learning materials for the less-experienced physicians particularly those still in their training career. These highlights the clinical utility of deep learning approaches through CNN models in the practice of nuclear cardiology.
Coronary Artery Disease
;
Deep Learning
5.A review of machine learning in tumor radiotherapy.
Junqian ZHANG ; Yuan ZHANG ; Yong YIN ; Jian ZHU ; Baosheng LI
Journal of Biomedical Engineering 2019;36(5):879-884
Radiotherapy is one of the main treatments for tumor with increasingly high request for technique precision and the equipment stability. Machine learning may bring radiotherapy simplicity, individualization and precision, and may improve the automatic level of planning and quality assurance. Based on the process of radiotherapy, this paper reviews the applications and researches on machine learning, with an emphasis on deep learning, and proposes the prospects in the following aspects: segmentation of normal tissue and tumor, planning, treatment delivery, quality assurance and prognosis prediction.
Deep Learning
;
Humans
;
Machine Learning
;
Neoplasms
;
radiotherapy
6.Progress in biomedical data analysis based on deep learning.
Suyi LI ; Shijie TANG ; Feng LI ; Jianzhuo QI ; Wenji XIONG
Journal of Biomedical Engineering 2020;37(2):349-357
Traditional biomedical data analysis technology faces enormous challenges in the context of the big data era. The application of deep learning technology in the field of biomedical analysis has ushered in tremendous development opportunities. In this paper, we reviewed the latest research progress of deep learning in the field of biomedical data analysis. Firstly, we introduced the deep learning method and its common framework. Then, focusing on the proposal of biomedical problems, data preprocessing method, model building method and training algorithm, we summarized the specific application of deep learning in biomedical data analysis in the past five years according to the chronological order, and emphasized the application of deep learning in medical assistant diagnosis. Finally, we gave the possible development direction of deep learning in the field of biomedical data analysis in the future.
Algorithms
;
Biomedical Technology
;
Data Analysis
;
Deep Learning
7.Application of deep learning in cancer prognosis prediction model.
Wen CHEN ; Xu WANG ; Huihong DUAN ; Xiaobing ZHANG ; Ting DONG ; Shengdong NIE
Journal of Biomedical Engineering 2020;37(5):918-929
In recent years, deep learning has provided a new method for cancer prognosis analysis. The literatures related to the application of deep learning in the prognosis of cancer are summarized and their advantages and disadvantages are analyzed, which can be provided for in-depth research. Based on this, this paper systematically reviewed the latest research progress of deep learning in the construction of cancer prognosis model, and made an analysis on the strengths and weaknesses of relevant methods. Firstly, the construction idea and performance evaluation index of deep learning cancer prognosis model were clarified. Secondly, the basic network structure was introduced, and the data type, data amount, and specific network structures and their merits and demerits were discussed. Then, the mainstream method of establishing deep learning cancer prognosis model was verified and the experimental results were analyzed. Finally, the challenges and future research directions in this field were summarized and expected. Compared with the previous models, the deep learning cancer prognosis model can better improve the prognosis prediction ability of cancer patients. In the future, we should continue to explore the research of deep learning in cancer recurrence rate, cancer treatment program and drug efficacy evaluation, and fully explore the application value and potential of deep learning in cancer prognosis model, so as to establish an efficient and accurate cancer prognosis model and realize the goal of precision medicine.
Deep Learning
;
Humans
;
Neoplasms
;
Precision Medicine
;
Prognosis
8.Diagnostic performance of a computer-aided system for tuberculosis screening in two Philippine cities.
Gabrielle P. FLORES ; Reiner Lorenzo J. TAMAYO ; Robert Neil F. LEONG ; Christian Sergio M. BIGLAEN ; Kathleen Nicole T. UY ; Renee Rose O. MAGLENTE ; Marlex Jorome M. NUGUID ; Jason V. ALACAP
Acta Medica Philippina 2025;59(2):33-40
BACKGROUND AND OBJECTIVES
The Philippines faces challenges in the screening of tuberculosis (TB), one of them being the shortage in the health workforce who are skilled and allowed to screen TB. Deep learning neural networks (DLNNs) have shown potential in the TB screening process utilizing chest radiographs (CXRs). However, local studies on AIbased TB screening are limited. This study evaluated qXR3.0 technology's diagnostic performance for TB screening in Filipino adults aged 15 and older. Specifically, we evaluated the specificity and sensitivity of qXR3.0 compared to radiologists' impressions and determined whether it meets the World Health Organization (WHO) standards.
METHODSA prospective cohort design was used to perform a study on comparing screening and diagnostic accuracies of qXR3.0 and two radiologist gradings in accordance with the Standards for Reporting Diagnostic Accuracy (STARD). Subjects from two clinics in Metro Manila which had qXR 3.0 seeking consultation at the time of study were invited to participate to have CXRs and sputum collected. Radiologists' and qXR3.0 readings and impressions were compared with respect to the reference standard Xpert MTB/RiF assay. Diagnostic accuracy measures were calculated.
RESULTSWith 82 participants, qXR3.0 demonstrated 100% sensitivity and 72.7% specificity with respect to the reference standard. There was a strong agreement between qXR3.0 and radiologists' readings as exhibited by the 0.7895 (between qXR 3.0 and CXRs read by at least one radiologist), 0.9362 (qXR 3.0 and CXRs read by both radiologists), and 0.9403 (qXR 3.0 and CXRs read as not suggestive of TB by at least one radiologist) concordance indices.
CONCLUSIONSqXR3.0 demonstrated high sensitivity to identify presence of TB among patients, and meets the WHO standard of at least 70% specificity for detecting true TB infection. This shows an immense potential for the tool to supplement the shortage of radiologists for TB screening in the country. Future research directions may consider larger sample sizes to confirm these findings and explore the economic value of mainstream adoption of qXR 3.0 for TB screening.
Human ; Tuberculosis ; Diagnostic Imaging ; Deep Learning
9.Histopathological Diagnosis System for Gastritis Using Deep Learning Algorithm.
Wei BA ; Shu-Hao WANG ; Can-Cheng LIU ; Yue-Feng WANG ; Huai-Yin SHI ; Zhi-Gang SONG
Chinese Medical Sciences Journal 2021;36(3):204-209
Objective To develope a deep learning algorithm for pathological classification of chronic gastritis and assess its performance using whole-slide images (WSIs). Methods We retrospectively collected 1,250 gastric biopsy specimens (1,128 gastritis, 122 normal mucosa) from PLA General Hospital. The deep learning algorithm based on DeepLab v3 (ResNet-50) architecture was trained and validated using 1,008 WSIs and 100 WSIs, respectively. The diagnostic performance of the algorithm was tested on an independent test set of 142 WSIs, with the pathologists' consensus diagnosis as the gold standard. Results The receiver operating characteristic (ROC) curves were generated for chronic superficial gastritis (CSuG), chronic active gastritis (CAcG), and chronic atrophic gastritis (CAtG) in the test set, respectively.The areas under the ROC curves (AUCs) of the algorithm for CSuG, CAcG, and CAtG were 0.882, 0.905 and 0.910, respectively. The sensitivity and specificity of the deep learning algorithm for the classification of CSuG, CAcG, and CAtG were 0.790 and 1.000 (accuracy 0.880), 0.985 and 0.829 (accuracy 0.901), 0.952 and 0.992 (accuracy 0.986), respectively. The overall predicted accuracy for three different types of gastritis was 0.867. By flagging the suspicious regions identified by the algorithm in WSI, a more transparent and interpretable diagnosis can be generated. Conclusion The deep learning algorithm achieved high accuracy for chronic gastritis classification using WSIs. By pre-highlighting the different gastritis regions, it might be used as an auxiliary diagnostic tool to improve the work efficiency of pathologists.
Algorithms
;
Deep Learning
;
Gastritis/diagnosis*
;
Humans
;
ROC Curve
;
Retrospective Studies
10.Machine and deep learning-based clinical characteristics and laboratory markers for the prediction of sarcopenia.
He ZHANG ; Mengting YIN ; Qianhui LIU ; Fei DING ; Lisha HOU ; Yiping DENG ; Tao CUI ; Yixian HAN ; Weiguang PANG ; Wenbin YE ; Jirong YUE ; Yong HE
Chinese Medical Journal 2023;136(8):967-973
BACKGROUND:
Sarcopenia is an age-related progressive skeletal muscle disorder involving the loss of muscle mass or strength and physiological function. Efficient and precise AI algorithms may play a significant role in the diagnosis of sarcopenia. In this study, we aimed to develop a machine learning model for sarcopenia diagnosis using clinical characteristics and laboratory indicators of aging cohorts.
METHODS:
We developed models of sarcopenia using the baseline data from the West China Health and Aging Trend (WCHAT) study. For external validation, we used the Xiamen Aging Trend (XMAT) cohort. We compared the support vector machine (SVM), random forest (RF), eXtreme Gradient Boosting (XGB), and Wide and Deep (W&D) models. The area under the receiver operating curve (AUC) and accuracy (ACC) were used to evaluate the diagnostic efficiency of the models.
RESULTS:
The WCHAT cohort, which included a total of 4057 participants for the training and testing datasets, and the XMAT cohort, which consisted of 553 participants for the external validation dataset, were enrolled in this study. Among the four models, W&D had the best performance (AUC = 0.916 ± 0.006, ACC = 0.882 ± 0.006), followed by SVM (AUC =0.907 ± 0.004, ACC = 0.877 ± 0.006), XGB (AUC = 0.877 ± 0.005, ACC = 0.868 ± 0.005), and RF (AUC = 0.843 ± 0.031, ACC = 0.836 ± 0.024) in the training dataset. Meanwhile, in the testing dataset, the diagnostic efficiency of the models from large to small was W&D (AUC = 0.881, ACC = 0.862), XGB (AUC = 0.858, ACC = 0.861), RF (AUC = 0.843, ACC = 0.836), and SVM (AUC = 0.829, ACC = 0.857). In the external validation dataset, the performance of W&D (AUC = 0.970, ACC = 0.911) was the best among the four models, followed by RF (AUC = 0.830, ACC = 0.769), SVM (AUC = 0.766, ACC = 0.738), and XGB (AUC = 0.722, ACC = 0.749).
CONCLUSIONS:
The W&D model not only had excellent diagnostic performance for sarcopenia but also showed good economic efficiency and timeliness. It could be widely used in primary health care institutions or developing areas with an aging population.
TRIAL REGISTRATION
Chictr.org, ChiCTR 1800018895.
Humans
;
Aged
;
Sarcopenia/diagnosis*
;
Deep Learning
;
Aging
;
Algorithms
;
Biomarkers