1.Machine learning in medicine: what clinicians should know.
Jordan Zheng TING SIM ; Qi Wei FONG ; Weimin HUANG ; Cher Heng TAN
Singapore medical journal 2023;64(2):91-97
With the advent of artificial intelligence (AI), machines are increasingly being used to complete complicated tasks, yielding remarkable results. Machine learning (ML) is the most relevant subset of AI in medicine, which will soon become an integral part of our everyday practice. Therefore, physicians should acquaint themselves with ML and AI, and their role as an enabler rather than a competitor. Herein, we introduce basic concepts and terms used in AI and ML, and aim to demystify commonly used AI/ML algorithms such as learning methods including neural networks/deep learning, decision tree and application domain in computer vision and natural language processing through specific examples. We discuss how machines are already being used to augment the physician's decision-making process, and postulate the potential impact of ML on medical practice and medical research based on its current capabilities and known limitations. Moreover, we discuss the feasibility of full machine autonomy in medicine.
Humans
;
Artificial Intelligence
;
Machine Learning
;
Algorithms
;
Neural Networks, Computer
;
Medicine
2.A multiscale feature extraction algorithm for dysarthric speech recognition.
Jianxing ZHAO ; Peiyun XUE ; Jing BAI ; Chenkang SHI ; Bo YUAN ; Tongtong SHI
Journal of Biomedical Engineering 2023;40(1):44-50
In this paper, we propose a multi-scale mel domain feature map extraction algorithm to solve the problem that the speech recognition rate of dysarthria is difficult to improve. We used the empirical mode decomposition method to decompose speech signals and extracted Fbank features and their first-order differences for each of the three effective components to construct a new feature map, which could capture details in the frequency domain. Secondly, due to the problems of effective feature loss and high computational complexity in the training process of single channel neural network, we proposed a speech recognition network model in this paper. Finally, training and decoding were performed on the public UA-Speech dataset. The experimental results showed that the accuracy of the speech recognition model of this method reached 92.77%. Therefore, the algorithm proposed in this paper can effectively improve the speech recognition rate of dysarthria.
Humans
;
Dysarthria/diagnosis*
;
Speech
;
Speech Perception
;
Algorithms
;
Neural Networks, Computer
3.Fetal electrocardiogram signal extraction and analysis method combining fast independent component analysis algorithm and convolutional neural network.
Yuyao YANG ; Jingyu HAO ; Shuicai WU
Journal of Biomedical Engineering 2023;40(1):51-59
Fetal electrocardiogram (ECG) signals provide important clinical information for early diagnosis and intervention of fetal abnormalities. In this paper, we propose a new method for fetal ECG signal extraction and analysis. Firstly, an improved fast independent component analysis method and singular value decomposition algorithm are combined to extract high-quality fetal ECG signals and solve the waveform missing problem. Secondly, a novel convolutional neural network model is applied to identify the QRS complex waves of fetal ECG signals and effectively solve the waveform overlap problem. Finally, high quality extraction of fetal ECG signals and intelligent recognition of fetal QRS complex waves are achieved. The method proposed in this paper was validated with the data from the PhysioNet computing in cardiology challenge 2013 database of the Complex Physiological Signals Research Resource Network. The results show that the average sensitivity and positive prediction values of the extraction algorithm are 98.21% and 99.52%, respectively, and the average sensitivity and positive prediction values of the QRS complex waves recognition algorithm are 94.14% and 95.80%, respectively, which are better than those of other research results. In conclusion, the algorithm and model proposed in this paper have some practical significance and may provide a theoretical basis for clinical medical decision making in the future.
Algorithms
;
Neural Networks, Computer
;
Electrocardiography
;
Databases, Factual
;
Fetus
4.Application of a light-weighted convolutional neural network for automatic recognition of coal workers' pneumoconiosis in the early stage.
Feng Tao CUI ; Yan WANG ; Xin Ping DING ; Yu Long YAO ; Bing LI ; Fu Hai SHEN
Chinese Journal of Industrial Hygiene and Occupational Diseases 2023;41(3):177-182
Objective: To construct and verify a light-weighted convolutional neural network (CNN), and explore its application value for screening the early stage (subcategory 0/1 and stage Ⅰ of pneumoconiosis) of coal workers' pneumoconiosis (CWP) from digital chest radiography (DR) . Methods: A total of 1225 DR images of coal workers who were examined at an Occupational Disease Prevention and Control Institute in Anhui Province from October 2018 to March 2021 were retrospectively collected. All DR images were collectively diagnosed by 3 radiologists with diagnostic qualifications and gave diagnostic results. There were 692 DR images with small opacity profusion 0/- or 0/0 and 533 DR images with small opacity profusion 0/1 to stage Ⅲ of pneumoconiosis. The original chest radiographs were preprocessed differently to generate four datasets, namely 16-bit grayscale original image set (Origin16), 8-bit grayscale original image set (Origin 8), 16-bit grayscale histogram equalized image set (HE16) and 8-bit grayscale histogram equalized image set (HE8). The light-weighted CNN, ShuffleNet, was applied to train the generated prediction model on the four datasets separately. The performance of the four models for pneumoconiosis prediction was evaluated on a test set containing 130 DR images using measures such as the receiver operating characteristic (ROC) curve, accuracy, sensitivity, specificity, and Youden index. The Kappa consistency test was used to compare the agreement between the model predictions and the physician diagnosed pneumoconiosis results. Results: Origin16 model achieved the highest ROC area under the curve (AUC=0.958), accuracy (92.3%), specificity (92.9%), and Youden index (0.8452) for predicting pneumoconiosis, with a sensitivity of 91.7%. And the highest consistency between identification and physician diagnosis was observed for Origin16 model (Kappa value was 0.845, 95%CI: 0.753-0.937, P<0.001). HE16 model had the highest sensitivity (98.3%) . Conclusion: The light-weighted CNN ShuffleNet model can efficiently identify the early stages of CWP, and its application in the early screening of CWP can effectively improve physicians' work efficiency.
Humans
;
Retrospective Studies
;
Anthracosis/diagnostic imaging*
;
Pneumoconiosis/diagnostic imaging*
;
Coal Mining
;
Neural Networks, Computer
;
Coal
5.Platelet RNA signature independently predicts ovarian cancer prognosis by deep learning neural network model.
Chun-Jie LIU ; Hua-Yi LI ; Yue GAO ; Gui-Yan XIE ; Jian-Hua CHI ; Gui-Ling LI ; Shao-Qing ZENG ; Xiao-Ming XIONG ; Jia-Hao LIU ; Lin-Li SHI ; Xiong LI ; Xiao-Dong CHENG ; Kun SONG ; Ding MA ; An-Yuan GUO ; Qing-Lei GAO
Protein & Cell 2023;14(8):618-622
6.Diagnosis of primary clear cell carcinoma of the liver based on Faster region-based convolutional neural network.
Bin LIU ; Jianfei LI ; Xue YANG ; Feng CHEN ; Yanyan ZHANG ; Hongjun LI
Chinese Medical Journal 2023;136(22):2706-2711
BACKGROUND:
Distinguishing between primary clear cell carcinoma of the liver (PCCCL) and common hepatocellular carcinoma (CHCC) through traditional inspection methods before the operation is difficult. This study aimed to establish a Faster region-based convolutional neural network (RCNN) model for the accurate differential diagnosis of PCCCL and CHCC.
METHODS:
In this study, we collected the data of 62 patients with PCCCL and 1079 patients with CHCC in Beijing YouAn Hospital from June 2012 to May 2020. A total of 109 patients with CHCC and 42 patients with PCCCL were randomly divided into the training validation set and the test set in a ratio of 4:1.The Faster RCNN was used for deep learning of patients' data in the training validation set, and established a convolutional neural network model to distinguish PCCCL and CHCC. The accuracy, average precision, and the recall of the model for diagnosing PCCCL and CHCC were used to evaluate the detection performance of the Faster RCNN algorithm.
RESULTS:
A total of 4392 images of 121 patients (1032 images of 33 patients with PCCCL and 3360 images of 88 patients with CHCC) were uesd in test set for deep learning and establishing the model, and 1072 images of 30 patients (320 images of nine patients with PCCCL and 752 images of 21 patients with CHCC) were used to test the model. The accuracy of the model for accurately diagnosing PCCCL and CHCC was 0.962 (95% confidence interval [CI]: 0.931-0.992). The average precision of the model for diagnosing PCCCL was 0.908 (95% CI: 0.823-0.993) and that for diagnosing CHCC was 0.907 (95% CI: 0.823-0.993). The recall of the model for diagnosing PCCCL was 0.951 (95% CI: 0.916-0.985) and that for diagnosing CHCC was 0.960 (95% CI: 0.854-0.962). The time to make a diagnosis using the model took an average of 4 s for each patient.
CONCLUSION
The Faster RCNN model can accurately distinguish PCCCL and CHCC. This model could be important for clinicians to make appropriate treatment plans for patients with PCCCL or CHCC.
Humans
;
Liver Neoplasms/pathology*
;
Retrospective Studies
;
Carcinoma, Hepatocellular/pathology*
;
Neural Networks, Computer
7.Establishment of comprehensive evaluation models of physical fitness of the elderly based on machine learning.
Xiao-Hua LIU ; Ruo-Ling ZHU ; Wei-Xin LIU ; Xiao-Li TIAN ; Lei WU
Acta Physiologica Sinica 2023;75(6):937-945
The present study aims to establish comprehensive evaluation models of physical fitness of the elderly based on machine learning, and provide an important basis to monitor the elderly's physique. Through stratified sampling, the elderly aged 60 years and above were selected from 10 communities in Nanchang City. The physical fitness of the elderly was measured by the comprehensive physical assessment scale based on our previous study. Fuzzy neural network (FNN), support vector machine (SVM) and random forest (RF) models for comprehensive physical evaluation of the elderly people in communities were constructed respectively. The accuracy, sensitivity and specificity of the comprehensive physical fitness evaluation models constructed by FNN, SVM and RF were above 0.85, 0.75 and 0.89, respectively, with the FNN model possessing the best prediction performance. FNN, RF and SVM models are valuable in the comprehensive evaluation and prediction of physical fitness, which can be used as tools to carry out physical evaluation of the elderly.
Aged
;
Humans
;
Physical Fitness
;
Neural Networks, Computer
;
Exercise
;
Machine Learning
8.An Artificial Neural Network Model Combined with Dietary Retinol Intake from Different Sources to Predict the Risk of Nonalcoholic Fatty Liver Disease.
Can LIU ; Shi Hui ZHOU ; Hong SU ; Wen Qin YANG ; Jiao LU
Biomedical and Environmental Sciences 2023;36(12):1123-1135
OBJECTIVE:
This study aimed to develop an artificial neural network (ANN) model combined with dietary retinol intake from different sources to predict the risk of non-alcoholic fatty liver disease (NAFLD) in American adults.
METHODS:
Data from the 2007 to 2014 National Health and Nutrition Examination Survey (NHANES) 2007-2014 were analyzed. Eligible subjects ( n = 6,613) were randomly divided into a training set ( n 1 = 4,609) and a validation set ( n 2 = 2,004) at a ratio of 7:3. The training set was used to identify predictors of NAFLD risk using logistic regression analysis. An ANN was established to predict the NAFLD risk using a training set. Receiver operating characteristic (ROC) curve analysis was performed to evaluate the accuracy of the model using the training and validation sets.
RESULTS:
Our study found that the odds ratios ( ORs) and 95% confidence intervals ( CIs) of NAFLD for the highest quartile of plant-derived dietary retinol intake (i.e., provitamin A carotenoids, such as β-carotene) ( OR = 0.75, 95% CI: 0.57 to 0.99) were inversely associated with NAFLD risk, compared to the lowest quartile of intake, after adjusting for potential confounders. The areas under the ROC curves were 0.874 and 0.883 for the training and validation sets, respectively. NAFLD occurs when its incidence probability is greater than 0.388.
CONCLUSION
The ANN model combined with plant-derived dietary retinol intake showed a significant effect on NAFLD. This could be applied to predict NAFLD risk in the American adult population when government departments formulate future health plans.
Adult
;
Humans
;
Vitamin A
;
Non-alcoholic Fatty Liver Disease/epidemiology*
;
Nutrition Surveys
;
Diet
;
Neural Networks, Computer
9.Prediction of pulp exposure risk of carious pulpitis based on deep learning.
Li WANG ; Fei WU ; Mo XIAO ; Yu-Xin CHEN ; Ligeng WU
West China Journal of Stomatology 2023;41(2):218-224
OBJECTIVES:
This study aims to predict the risk of deep caries exposure in radiographic images based on the convolutional neural network model, compare the prediction results of the network model with those of senior dentists, evaluate the performance of the model for teaching and training stomatological students and young dentists, and assist dentists to clarify treatment plans and conduct good doctor-patient communication before surgery.
METHODS:
A total of 206 cases of pulpitis caused by deep caries were selected from the Department of Stomatological Hospital of Tianjin Medical University from 2019 to 2022. According to the inclusion and exclusion criteria, 104 cases of pulpitis were exposed during the decaying preparation period and 102 cases of pulpitis were not exposed. The 206 radiographic images collected were randomly divided into three groups according to the proportion: 126 radiographic images in the training set, 40 radiographic images in the validation set, and 40 radiographic images in the test set. Three convolutional neural networks, visual geometry group network (VGG), residual network (ResNet), and dense convolutional network (DenseNet) were selected to analyze the rules of the radiographic images in the training set. The radiographic images of the validation set were used to adjust the super parameters of the network. Finally, 40 radiographic images of the test set were used to evaluate the performance of the three network models. A senior dentist specializing in dental pulp was selected to predict whether the deep caries of 40 radiographic images in the test set were exposed. The gold standard is whether the pulp is exposed after decaying the prepared hole during the clinical operation. The prediction effect of the three network models (VGG, ResNet, and DenseNet) and the senior dentist on the pulp exposure of 40 radiographic images in the test set were compared using receiver operating characteristic (ROC) curve, area under the ROC curve (AUC), accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score to select the best network model.
RESULTS:
The best network model was DenseNet model, with AUC of 0.97. The AUC values of the ResNet model, VGG model, and the senior dentist were 0.89, 0.78, and 0.87, respectively. Accuracy was not statistically different between the senior dentist (0.850) and the DenseNet model (0.850)(P>0.05). Kappa consistency test showed moderate reliability (Kappa=0.6>0.4, P<0.05).
CONCLUSIONS
Among the three convolutional neural network models, the DenseNet model has the best predictive effect on whether deep caries are exposed in imaging. The predictive effect of this model is equivalent to the level of senior dentists specializing in dental pulp.
Humans
;
Deep Learning
;
Neural Networks, Computer
;
Pulpitis/diagnostic imaging*
;
Reproducibility of Results
;
ROC Curve
;
Random Allocation
10.CT and MRI fusion based on generative adversarial network and convolutional neural networks under image enhancement.
Yunpeng LIU ; Jin LI ; Yu WANG ; Wenli CAI ; Fei CHEN ; Wenjie LIU ; Xianhao MAO ; Kaifeng GAN ; Renfang WANG ; Dechao SUN ; Hong QIU ; Bangquan LIU
Journal of Biomedical Engineering 2023;40(2):208-216
Aiming at the problems of missing important features, inconspicuous details and unclear textures in the fusion of multimodal medical images, this paper proposes a method of computed tomography (CT) image and magnetic resonance imaging (MRI) image fusion using generative adversarial network (GAN) and convolutional neural network (CNN) under image enhancement. The generator aimed at high-frequency feature images and used double discriminators to target the fusion images after inverse transform; Then high-frequency feature images were fused by trained GAN model, and low-frequency feature images were fused by CNN pre-training model based on transfer learning. Experimental results showed that, compared with the current advanced fusion algorithm, the proposed method had more abundant texture details and clearer contour edge information in subjective representation. In the evaluation of objective indicators, Q AB/F, information entropy (IE), spatial frequency (SF), structural similarity (SSIM), mutual information (MI) and visual information fidelity for fusion (VIFF) were 2.0%, 6.3%, 7.0%, 5.5%, 9.0% and 3.3% higher than the best test results, respectively. The fused image can be effectively applied to medical diagnosis to further improve the diagnostic efficiency.
Image Processing, Computer-Assisted/methods*
;
Neural Networks, Computer
;
Tomography, X-Ray Computed
;
Magnetic Resonance Imaging/methods*
;
Algorithms

Result Analysis
Print
Save
E-mail