1.Reproducibility and Generalizability in Radiomics Modeling: Possible Strategies in Radiologic and Statistical Perspectives
Ji Eun PARK ; Seo Young PARK ; Hwa Jung KIM ; Ho Sung KIM
Korean Journal of Radiology 2019;20(7):1124-1137
Radiomics, which involves the use of high-dimensional quantitative imaging features for predictive purposes, is a powerful tool for developing and testing medical hypotheses. Radiologic and statistical challenges in radiomics include those related to the reproducibility of imaging data, control of overfitting due to high dimensionality, and the generalizability of modeling. The aims of this review article are to clarify the distinctions between radiomics features and other omics and imaging data, to describe the challenges and potential strategies in reproducibility and feature selection, and to reveal the epidemiological background of modeling, thereby facilitating and promoting more reproducible and generalizable radiomics research.
Machine Learning
2.Machine Learning: a New Opportunity for Risk Prediction
Osung KWON ; Wonjun NA ; Young Hak KIM
Korean Circulation Journal 2020;50(1):85-87
No abstract available.
Machine Learning
3.Thyroid disorder classification using machine learning models
Vincent Peter C. Magboo ; Ma. Sheila A. Magboo
The Philippine Journal of Nuclear Medicine 2022;17(2):54-61
Introduction:
Thyroid hormones are produced by the thyroid gland and are essential for regulating the basal metabolic rate.
Abnormalities in the levels of these hormones lead to two classes of thyroid diseases – hyperthyroidism and
hypothyroidism. Detection and monitoring of these two general classes of thyroid diseases require accurate
measurement and interpretation of thyroid function tests. The clinical utility of machine learning models to
predict a class of thyroid disorders has not been fully elucidated.
Objective:
The objective of this study is to develop machine learning models that classify the type of thyroid disorder on a
publicly available thyroid disease dataset extracted from a machine learning data repository.
Methods:
Several machine learning algorithms for classifying thyroid disorders were utilized after a series of
pre-processing steps applied on the dataset.
Results:
The best performing model was obtained by with XGBoost with a 99% accuracy and showing very good recall,
precision, and F1-scores for each of the three thyroid classes. Generally, all models with the exception of Naïve
Bayes did well in predicting the negative class generating over 90% in all metrics. For predicting
hypothyroidism, XGBoost, decision tree and random forest obtained the most superior performance with
metric values ranging from 96-100%. On the other end in predicting hyperthyroidism, all models have lower
classification performance as compared to the negative and hypothyroid classes Needless to say, XGBoost and
random forest did obtain good metric values ranging from 71-89% in predicting hyperthyroid class.
Conclusion
The findings of this study were encouraging and had generated useful insights in the application and
development of faster automated models with high reliability which can be of use to clinicians in the
assessment of thyroid diseases. The early and prompt clinical assessment coupled with the integration of these
machine learning models in practice can be used to determine prompt and precise diagnosis and to formulate
personalized treatment options to ensure the best quality of care to our patients.
Machine Learning
4.Pre-trained convolutional neural networks in the assessment of bone scans for metastasis
Vincent Peter C. Magboo ; Ma. Sheila A. Magboo
The Philippine Journal of Nuclear Medicine 2021;16(2):46-53
Background:
Numerous applications of artificial intelligence have been applied in radiological imaging ranging from
computer-aided diagnosis based on machine learning to deep learning using convolutional neural networks.
One of the nuclear medicine imaging tests being commonly performed today is bone scan. The use of deep
learning methods through convolutional neural networks in bone scintigrams has not been fully explored. Very
few studies have been published on its diagnostic capability of convolutional neural networks in assessing
osseous metastasis.
Objective:
The aim of our study is to assess the classification performance of the pre-trained convolutional neural
networks in the diagnosis of bone metastasis from whole body bone scintigrams of a local institutional dataset.
Methods:
Bone scintigrams from all types of cancer were retrospectively reviewed during the period 2019-2020 at the
University of Perpetual Help Medical Center in Las Pinas City, Metro Manila. The study was approved by the
Institutional Ethical Review Board and Technical Review Board of the medical center. Bone scan studies should
be mainly for metastasis screening. The pre-processing techniques consisting of image normalization, image
augmentation, data shuffling, and train-test split (testing at 30% and the rest (70%) was split 85% for training
and 15% for validation) were applied to image dataset. Three pre-trained architectures (ResNet50, VGG19,
DenseNet121) were applied to the processed dataset. Performance metrics such as accuracy, recall (sensitivity),
precision (positive predictive value), and F1-scores were obtained.
Results:
A total of 570 bone scan images with dimension 220 x 646 pixel sizes in .tif file format were included in this
study with 40% classified with bone metastasis while 60% were classified as without bone metastasis.
DenseNet121 yielded the highest performance metrics with an accuracy rate of 83%, 76% recall, 86% precision,
and 81% F1-score. ResNet50 and VGG19 had similar performance with each other across all metrics but
generally lower predictive capability as compared to DenseNet121.
Conclusion
A bone metastasis machine learning classification study using three pre-trained convolutional neural networks
was performed on a local medical center bone scan dataset via transfer learning. DenseNet121 generated the
highest performance metrics with 83% accuracy, 76% recall, 86% precision and 81% F1-score. Our simulation
experiments generated promising outcomes and potentially could lead to its deployment in the clinical practice
of nuclear medicine physicians. The use of deep learning techniques through convolutional neural networks has
the potential to improve diagnostic capability of nuclear medicine physicians using bone scans for the
assessment of metastasis.
Deep Learning
;
Machine Learning
5.Heart Alert: A heart disease prediction system using machine learning approach and optimization techniques
Justin Allen P. Denopol ; Ma. Sheila A. Magboo ; Vincent Peter C. Magboo
Philippine Journal of Health Research and Development 2022;26(3):83-92
Background:
Cardiovascular diseases belong to the top three leading causes of mortality in the Philippines with 17.8 % of the total deaths. Lifestyle-related habits such as alcohol consumption, smoking, poor diet and nutrition, high sedentary behavior, overweight, and obesity have been increasingly implicated in the high rates of heart disease among Filipinos leading to a significant burden to the country's healthcare system. The objective of this study was to predict the presence of heart disease using various machine learning algorithms (support vector machine, naïve Bayes, random forest, logistic regression, decision tree, and adaptive boosting) evaluated on an anonymized publicly available cardiovascular disease dataset.
Methodology:
Various machine learning algorithms were applied on an anonymized publicly available
cardiovascular dataset from a machine learning data repository (IEEE Dataport). A web-based application
system named Heart Alert was developed based on the best machine learning model that would predict the risk of developing heart disease. An assessment of the effects of different optimization techniques as to the imputation methods (mean, median, mode, and multiple imputation by chained equations) and as to the feature selection method (recursive feature elimination) on the classification performance of the machine learning algorithms was made. All simulation experiments were implemented via Python 3.8 and its machine learning libraries (Scikit-learn, Keras, Tensorflow, Pandas, Matplotlib, Seaborn, NumPy).
Results:
The support vector machine without imputation and feature selection obtained the highest
performance metrics (90.2% accuracy, 87.7% sensitivity, 93.6% specificity, 94.9% precision, 91.2% F1-score and an area under the receiver operating characteristic curve of 0.902 ) and was used to implement the heart disease prediction system (Heart Alert). Following very closely were random forest with mean or median imputation and logistic regression with mode imputation, all having no feature selection which also performed well.
Conclusion
The performance of the best four machine learning models suggests that for this dataset,
imputation technique for missing values may or may not be done. Likewise, recursive feature elimination for feature selection may not apply as all variables seem to be important in heart disease prediction. An early accurate diagnosis leading to prompt intervention efforts is very crucial as it improves the patient's quality of life and diminishes the risk of developing cardiac events.
Machine Learning
;
Support Vector Machine
6.Semi-supervised Long-tail Endoscopic Image Classification.
Run-Nan CAO ; Meng-Jie FANG ; Hai-Ling LI ; Jie TIAN ; Di DONG
Chinese Medical Sciences Journal 2022;37(3):171-180
Objective To explore the semi-supervised learning (SSL) algorithm for long-tail endoscopic image classification with limited annotations. Method We explored semi-supervised long-tail endoscopic image classification in HyperKvasir, the largest gastrointestinal public dataset with 23 diverse classes. Semi-supervised learning algorithm FixMatch was applied based on consistency regularization and pseudo-labeling. After splitting the training dataset and the test dataset at a ratio of 4:1, we sampled 20%, 50%, and 100% labeled training data to test the classification with limited annotations. Results The classification performance was evaluated by micro-average and macro-average evaluation metrics, with the Mathews correlation coefficient (MCC) as the overall evaluation. SSL algorithm improved the classification performance, with MCC increasing from 0.8761 to 0.8850, from 0.8983 to 0.8994, and from 0.9075 to 0.9095 with 20%, 50%, and 100% ratio of labeled training data, respectively. With a 20% ratio of labeled training data, SSL improved both the micro-average and macro-average classification performance; while for the ratio of 50% and 100%, SSL improved the micro-average performance but hurt macro-average performance. Through analyzing the confusion matrix and labeling bias in each class, we found that the pseudo-based SSL algorithm exacerbated the classifier's preference for the head class, resulting in improved performance in the head class and degenerated performance in the tail class. Conclusion SSL can improve the classification performance for semi-supervised long-tail endoscopic image classification, especially when the labeled data is extremely limited, which may benefit the building of assisted diagnosis systems for low-volume hospitals. However, the pseudo-labeling strategy may amplify the effect of class imbalance, which hurts the classification performance for the tail class.
Supervised Machine Learning
;
Algorithms
7.Population Pharmacokinetic and Pharmacodynamic Models of Propofol in Healthy Volunteers using NONMEM and Machine Learning Methods.
Yoo Mi KIM ; Sung Hong KANG ; Il Su PARK ; Gyu Jeong NOH
Journal of Korean Society of Medical Informatics 2008;14(2):147-159
OBJECTIVES: The primary objective of this study is to compare model performance of machine learning methods with that of a previous study in which a nonlinear mixed effects model was created using NONMEM(R) for the pharmacokinetic and pharmacodynamic data for propofol. The secondary objective was to evaluate if a pharmacodynamic model describing the relationship between the dose of propofol and bispectral index (BIS) outperform that describing the relationship between a pharmacokinetic model derived-predicted concentrations of propofol and BIS. METHODS: Data were collected during a study involving the infusion of propofol into healthy volunteers. Pharmacokinetic and pharmacodynamic models were constructed using artificial neural networks (ANNs), support vector machines (SVMs), and multi-method ensembles and were compared with the nonlinear mixed effects method as implemented by NONMEM(R). Model performance was assessed by goodness-of-fit statistics, paired t-tests between predicted and observed values for each model and scatterplots. RESULTS: In pharmacokinetic analysis, ensemble I, the mean of ANN and NONMEM(R) predictions, achieved minimal error and the highest correlation coefficient. SVM produced the highest error and the lowest correlation coefficient. In pharmacodynamic analysis, ANN exhibited the best performance. An ANNModel describing the relationship between the dose of propofol and BIS was not inferior to an ANN model describing the relationship between predicted concentrations of propofol derived from an ANN pharmacokinetic model and BIS. CONCLUSIONS: In pharmacokinetic analysis, ensemble combined with ANN achieved slightly better performance than NONMEM(R). The relationship between the dose of propofol and BIS can be predicted without considering pharmacokinetics of propofol.
Machine Learning
;
Propofol
;
Support Vector Machine
8.Major depressive disorder prediction using data science
Vincent Peter C. Magboo ; Ma. Sheila A. Magboo
Philippine Journal of Health Research and Development 2022;26(3):41-50
Background:
Major depressive disorder is a mood disorder that has affected many people worldwide. It is characterized by persistently low or depressed mood, anhedonia or decreased interest in pleasurable activities, feelings of guilt or worthlessness, lack of energy, poor concentration, appetite changes, psychomotor retardation or agitation, sleep disturbances, or suicidal thoughts.
Objective:
The objective of the study was to predict the presence of major depressive disorder using a variety of machine learning classification algorithms (logistic regression, Naive Bayes, support vector machine, random forest, adaptive boosting, and extreme gradient boosting) on a publicly available depression dataset.
Methodology:
After data pre-processing, several experiments were performed to assess the recursive feature elimination with cross validation as a feature selection method and synthetic minority over-sampling technique to address dataset imbalance. Several machine learning algorithms were applied on an anonymized publicly available depression dataset. Feature importance of the top performing models were also generated. All simulation experiments were implemented via Python 3.8 and its machine learning libraries (Scikit-learn, Keras, Tensorflow, Pandas, Matplotlib, Seaborn, NumPy).
Results:
The top performing model was obtained by logistic regression with excellent performance metrics (91% accuracy, 93% sensitivity, 85% specificity, 93% recall, 93% F1-score, and 0.78 Matthews correlation coefficient). Feature importance scores of the most relevant attribute were also generated for the best model.
Conclusion
The findings suggest the utility of data science techniques powered by machine learning models to make a diagnosis of major depressive disorders with acceptable results. The potential deployment of these machine learning models in clinical practice can further enhance the diagnostic acumen of health professionals. Using data analytics and machine learning, data scientists can have a better understanding of mental health illness contributing to prompt and improved diagnosis thereby leading to the institution of early intervention and medical treatments ensuring the best quality of care for our patients.
Depressive Disorder, Major
;
Machine Learning
9.A review of machine learning in tumor radiotherapy.
Junqian ZHANG ; Yuan ZHANG ; Yong YIN ; Jian ZHU ; Baosheng LI
Journal of Biomedical Engineering 2019;36(5):879-884
Radiotherapy is one of the main treatments for tumor with increasingly high request for technique precision and the equipment stability. Machine learning may bring radiotherapy simplicity, individualization and precision, and may improve the automatic level of planning and quality assurance. Based on the process of radiotherapy, this paper reviews the applications and researches on machine learning, with an emphasis on deep learning, and proposes the prospects in the following aspects: segmentation of normal tissue and tumor, planning, treatment delivery, quality assurance and prognosis prediction.
Deep Learning
;
Humans
;
Machine Learning
;
Neoplasms
;
radiotherapy
10.Application of Deep Learning System into the Development of Communication Device for Quadriplegic Patient
Jung Hwan LEE ; Taewoo KANG ; Byung Kwan CHOI ; In Ho HAN ; Byung Chul KIM ; Jung Hoon RO
Korean Journal of Neurotrauma 2019;15(2):88-94
OBJECTIVE: In general, quadriplegic patients use their voices to call the caregiver. However, severe quadriplegic patients are in a state of tracheostomy, and cannot generate a voice. These patients require other communication tools to call caregivers. Recently, monitoring of eye status using artificial intelligence (AI) has been widely used in various fields. We made eye status monitoring system using deep learning, and developed a communication system for quadriplegic patients can call the caregiver. METHODS: The communication system consists of 3 programs. The first program was developed for automatic capturing of eye images from the face using a webcam. It continuously captured and stored 15 eye images per second. Secondly, the captured eye images were evaluated for open or closed status by deep learning, which is a type of AI. Google TensorFlow was used as a machine learning tool or library for convolutional neural network. A total of 18,000 images were used to train deep learning system. Finally, the program was developed to utter a sound when the left eye was closed for 3 seconds. RESULTS: The test accuracy of eye status was 98.7%. In practice, when the quadriplegic patient looked at the webcam and closed his left eye for 3 seconds, the sound for calling a caregiver was generated. CONCLUSION: Our eye status detection software using AI is very accurate, and the calling system for the quadriplegic patient was satisfactory.
Artificial Intelligence
;
Caregivers
;
Humans
;
Learning
;
Machine Learning
;
Quadriplegia
;
Tracheostomy
;
Unsupervised Machine Learning
;
Voice