1.Origin identification of Poria cocos based on hyperspectral imaging technology.
Xue SUN ; Deng-Ting ZHANG ; Hui WANG ; Cong ZHOU ; Jian YANG ; Dai-Yin PENG ; Xiao-Bo ZHANG
China Journal of Chinese Materia Medica 2023;48(16):4337-4346
To realize the non-destructive and rapid origin discrimination of Poria cocos in batches, this study established the P. cocos origin recognition model based on hyperspectral imaging combined with machine learning. P. cocos samples from Anhui, Fujian, Guangxi, Hubei, Hunan, Henan and Yunnan were used as the research objects. Hyperspectral data were collected in the visible and near infrared band(V-band, 410-990 nm) and shortwave infrared band(S-band, 950-2 500 nm). The original spectral data were divided into S-band, V-band and full-band. With the original data(RD) of different bands, multiplicative scatter correction(MSC), standard normal variation(SNV), S-G smoothing(SGS), first derivative(FD), second derivative(SD) and other pretreatments were carried out. Then the data were classified according to three different types of producing areas: province, county and batch. The origin identification model was established by partial least squares discriminant analysis(PLS-DA) and linear support vector machine(LinearSVC). Finally, confusion matrix was employed to evaluate the optimal model, with F1 score as the evaluation standard. The results revealed that the origin identification model established by FD combined with LinearSVC had the highest prediction accuracy in full-band range classified by province, V-band range by county and full-band range by batch, which were 99.28%, 98.55% and 97.45%, respectively, and the overall F1 scores of these three models were 99.16%, 98.59% and 97.58%, respectively, indicating excellent performance of these models. Therefore, hyperspectral imaging combined with LinearSVC can realize the non-destructive, accurate and rapid identification of P. cocos from different producing areas in batches, which is conducive to the directional research and production of P. cocos.
Hyperspectral Imaging
;
Wolfiporia
;
China
;
Least-Squares Analysis
;
Support Vector Machine
2.Detection method of early heart valve diseases based on heart sound features.
Chengfa SUN ; Xinpei WANG ; Changchun LIU
Journal of Biomedical Engineering 2023;40(6):1160-1167
Heart valve disease (HVD) is one of the common cardiovascular diseases. Heart sound is an important physiological signal for diagnosing HVDs. This paper proposed a model based on combination of basic component features and envelope autocorrelation features to detect early HVDs. Initially, heart sound signals lasting 5 minutes were denoised by empirical mode decomposition (EMD) algorithm and segmented. Then the basic component features and envelope autocorrelation features of heart sound segments were extracted to construct heart sound feature set. Then the max-relevance and min-redundancy (MRMR) algorithm was utilized to select the optimal mixed feature subset. Finally, decision tree, support vector machine (SVM) and k-nearest neighbor (KNN) classifiers were trained to detect the early HVDs from the normal heart sounds and obtained the best accuracy of 99.9% in clinical database. Normal valve, abnormal semilunar valve and abnormal atrioventricular valve heart sounds were classified and the best accuracy was 99.8%. Moreover, normal valve, single-valve abnormal and multi-valve abnormal heart sounds were classified and the best accuracy was 98.2%. In public database, this method also obtained the good overall accuracy. The result demonstrated this proposed method had important value for the clinical diagnosis of early HVDs.
Humans
;
Heart Sounds
;
Heart Valve Diseases/diagnosis*
;
Algorithms
;
Support Vector Machine
;
Signal Processing, Computer-Assisted
3.Research on eye movement data classification using support vector machine with improved whale optimization algorithm.
Yinhong SHEN ; Chang ZHANG ; Lin YANG ; Yuanyuan LI ; Xiujuan ZHENG
Journal of Biomedical Engineering 2023;40(2):335-342
When performing eye movement pattern classification for different tasks, support vector machines are greatly affected by parameters. To address this problem, we propose an algorithm based on the improved whale algorithm to optimize support vector machines to enhance the performance of eye movement data classification. According to the characteristics of eye movement data, this study first extracts 57 features related to fixation and saccade, then uses the ReliefF algorithm for feature selection. To address the problems of low convergence accuracy and easy falling into local minima of the whale algorithm, we introduce inertia weights to balance local search and global search to accelerate the convergence speed of the algorithm and also use the differential variation strategy to increase individual diversity to jump out of local optimum. In this paper, experiments are conducted on eight test functions, and the results show that the improved whale algorithm has the best convergence accuracy and convergence speed. Finally, this paper applies the optimized support vector machine model of the improved whale algorithm to the task of classifying eye movement data in autism, and the experimental results on the public dataset show that the accuracy of the eye movement data classification of this paper is greatly improved compared with that of the traditional support vector machine method. Compared with the standard whale algorithm and other optimization algorithms, the optimized model proposed in this paper has higher recognition accuracy and provides a new idea and method for eye movement pattern recognition. In the future, eye movement data can be obtained by combining it with eye trackers to assist in medical diagnosis.
Animals
;
Support Vector Machine
;
Whales
;
Eye Movements
;
Algorithms
4.An Atrial Fibrillation Classification Method Study Based on BP Neural Network and SVM.
Chenqin LIU ; Gaozang LIN ; Jingjing ZHOU ; Jilun YE ; Xu ZHANG
Chinese Journal of Medical Instrumentation 2023;47(3):258-263
Atrial fibrillation is a common arrhythmia, and its diagnosis is interfered by many factors. In order to achieve applicability in diagnosis and improve the level of automatic analysis of atrial fibrillation to the level of experts, the automatic detection of atrial fibrillation is very important. This study proposes an automatic detection algorithm for atrial fibrillation based on BP neural network (back propagation network) and support vector machine (SVM). The electrocardiogram (ECG) segments in the MIT-BIH atrial fibrillation database are divided into 10, 32, 64, and 128 heartbeats, respectively, and the Lorentz value, Shannon entropy, K-S test value and exponential moving average value are calculated. These four characteristic parameters are used as the input of SVM and BP neural network for classification and testing, and the label given by experts in the MIT-BIH atrial fibrillation database is used as the reference output. Among them, the use of atrial fibrillation in the MIT-BIH database, the first 18 cases of data are used as the training set, and the last 7 cases of data are used as the test set. The results show that the accuracy rate of 92% is obtained in the classification of 10 heartbeats, and the accuracy rate of 98% is obtained in the latter three categories. The sensitivity and specificity are both above 97.7%, which has certain applicability. Further validation and improvement in clinical ECG data will be done in next study.
Humans
;
Atrial Fibrillation/diagnosis*
;
Support Vector Machine
;
Heart Rate
;
Algorithms
;
Neural Networks, Computer
;
Electrocardiography
5.A preliminary prediction model of depression based on whole blood cell count by machine learning method.
Jing YAN ; Xin Yuan LI ; Yu Lan GENG ; Yu Fang LIANG ; Chao CHEN ; Ze Wen HAN ; Rui ZHOU
Chinese Journal of Preventive Medicine 2023;57(11):1862-1868
This study used machine learning techniques combined with routine blood cell analysis parameters to build preliminary prediction models, helping differentiate patients with depression from healthy controls, or patients with anxiety. A multicenter study was performed by collecting blood cell analysis data of Beijing Chaoyang Hospital and the First Hospital of Hebei Medical University from 2020 to 2021. Machine learning techniques, including support vector machine, decision tree, naïve Bayes, random forest and multi-layer perceptron were explored to establish a prediction model of depression. The results showed that based on the blood cell analysis results of healthy controls and depression group, the accuracy of prediction model reached as high as 0.99, F1 was 0.975. Receiver operating characteristic curve area and average accuracy were 0.985 and 0.967, respectively. Platelet parameters contributed mostly to depression prediction model. While, to random forest differential diagnosis model based on the data from depression and anxiety groups, prediction accuracy reached 0.68 and AUC 0.622. Age, platelet parameters, and average volume of red blood cells contributed the most to the model. In conclusion, the study researched on the prediction model of depression by exploring blood cell analysis parameters, revealing that machine learning models were more objective in the evaluation of mental illness.
Humans
;
Depression
;
Bayes Theorem
;
Machine Learning
;
Support Vector Machine
;
Blood Cell Count
6.A preliminary prediction model of depression based on whole blood cell count by machine learning method.
Jing YAN ; Xin Yuan LI ; Yu Lan GENG ; Yu Fang LIANG ; Chao CHEN ; Ze Wen HAN ; Rui ZHOU
Chinese Journal of Preventive Medicine 2023;57(11):1862-1868
This study used machine learning techniques combined with routine blood cell analysis parameters to build preliminary prediction models, helping differentiate patients with depression from healthy controls, or patients with anxiety. A multicenter study was performed by collecting blood cell analysis data of Beijing Chaoyang Hospital and the First Hospital of Hebei Medical University from 2020 to 2021. Machine learning techniques, including support vector machine, decision tree, naïve Bayes, random forest and multi-layer perceptron were explored to establish a prediction model of depression. The results showed that based on the blood cell analysis results of healthy controls and depression group, the accuracy of prediction model reached as high as 0.99, F1 was 0.975. Receiver operating characteristic curve area and average accuracy were 0.985 and 0.967, respectively. Platelet parameters contributed mostly to depression prediction model. While, to random forest differential diagnosis model based on the data from depression and anxiety groups, prediction accuracy reached 0.68 and AUC 0.622. Age, platelet parameters, and average volume of red blood cells contributed the most to the model. In conclusion, the study researched on the prediction model of depression by exploring blood cell analysis parameters, revealing that machine learning models were more objective in the evaluation of mental illness.
Humans
;
Depression
;
Bayes Theorem
;
Machine Learning
;
Support Vector Machine
;
Blood Cell Count
7.A pace recognition method for exoskeleton wearers based on support vector machine-hidden Markov model.
Dong HU ; Zuojun LIU ; Lingling CHEN ; Qian WANG
Journal of Biomedical Engineering 2022;39(1):84-91
In order to improve the motion fluency and coordination of lower extremity exoskeleton robots and wearers, a pace recognition method of exoskeleton wearer is proposed base on inertial sensors. Firstly, the triaxial acceleration and triaxial angular velocity signals at the thigh and calf were collected by inertial sensors. Then the signal segment of 0.5 seconds before the current time was extracted by the time window method. And the Fourier transform coefficients in the frequency domain signal were used as eigenvalues. Then the support vector machine (SVM) and hidden Markov model (HMM) were combined as a classification model, which was trained and tested for pace recognition. Finally, the pace change rule and the human-machine interaction force were combined in this model and the current pace was predicted by the model. The experimental results showed that the pace intention of the lower extremity exoskeleton wearer could be effectively identified by the method proposed in this article. And the recognition rate of the seven pace patterns could reach 92.14%. It provides a new way for the smooth control of the exoskeleton.
Algorithms
;
Exoskeleton Device
;
Humans
;
Lower Extremity
;
Motion
;
Support Vector Machine
8.A heart sound classification method based on complete ensemble empirical modal decomposition with adaptive noise permutation entropy and support vector machine.
Meijun LIU ; Quanyu WU ; Sheng DING ; Lingjiao PAN ; Xiaojie LIU
Journal of Biomedical Engineering 2022;39(2):311-319
Heart sound signal is a kind of physiological signal with nonlinear and nonstationary features. In order to improve the accuracy and efficiency of the phonocardiogram (PCG) classification, a new method was proposed by means of support vector machine (SVM) in which the complete ensemble empirical modal decomposition with adaptive noise (CEEMDAN) permutation entropy was as the eigenvector of heart sound signal. Firstly, the PCG was decomposed by CEEMDAN into a number of intrinsic mode functions (IMFs) from high to low frequency. Secondly, the IMFs were sifted according to the correlation coefficient, energy factor and signal-to-noise ratio. Then the instantaneous frequency was extracted by Hilbert transform, and its permutation entropy was constituted into eigenvector. Finally, the accuracy of the method was verified by using a hundred PCG samples selected from the 2016 PhysioNet/CinC Challenge. The results showed that the accuracy rate of the proposed method could reach up to 87%. In comparison with the traditional EMD and EEMD permutation entropy methods, the accuracy rate was increased by 18%-24%, which demonstrates the efficiency of the proposed method.
Entropy
;
Heart Sounds
;
Signal Processing, Computer-Assisted
;
Signal-To-Noise Ratio
;
Support Vector Machine
9.Pelvic Injury Discriminative Model Based on Data Mining Algorithm.
Fei-Xiang WANG ; Rui JI ; Lu-Ming ZHANG ; Peng WANG ; Tai-Ang LIU ; Lu-Jie SONG ; Mao-Wen WANG ; Zhi-Lu ZHOU ; Hong-Xia HAO ; Wen-Tao XIA
Journal of Forensic Medicine 2022;38(3):350-354
OBJECTIVES:
To reduce the dimension of characteristic information extracted from pelvic CT images by using principal component analysis (PCA) and partial least squares (PLS) methods. To establish a support vector machine (SVM) classification and identification model to identify if there is pelvic injury by the reduced dimension data and evaluate the feasibility of its application.
METHODS:
Eighty percent of 146 normal and injured pelvic CT images were randomly selected as training set for model fitting, and the remaining 20% was used as testing set to verify the accuracy of the test, respectively. Through CT image input, preprocessing, feature extraction, feature information dimension reduction, feature selection, parameter selection, model establishment and model comparison, a discriminative model of pelvic injury was established.
RESULTS:
The PLS dimension reduction method was better than the PCA method and the SVM model was better than the naive Bayesian classifier (NBC) model. The accuracy of the modeling set, leave-one-out cross validation and testing set of the SVM classification model based on 12 PLS factors was 100%, 100% and 93.33%, respectively.
CONCLUSIONS
In the evaluation of pelvic injury, the pelvic injury data mining model based on CT images reaches high accuracy, which lays a foundation for automatic and rapid identification of pelvic injuries.
Algorithms
;
Bayes Theorem
;
Data Mining
;
Least-Squares Analysis
;
Support Vector Machine
10.Rapid identification of geographic origins of Zingiberis Rhizoma by NIRS combined with chemometrics and machine learning algorithms.
Dai-Xin YU ; Sheng GUO ; Xia ZHANG ; Hui YAN ; Zhen-Yu ZHANG ; Hai-Yang LI ; Jian YANG ; Jin-Ao DUAN
China Journal of Chinese Materia Medica 2022;47(17):4583-4592
In this study, 280 batches of Zingiberis Rhizoma samples from nine producing areas were analyzed to obtain infrared spectral information based on near-infrared spectroscopy(NIRS). Pluralistic chemometrics such as principal component analysis(PCA), partial least squares-discriminant analysis(PLS-DA), orthogonal partial least squares-discriminant analysis(OPLS-DA), K-nearest neighbors(KNN), support vector machine(SVM), random forest(RF), artificial neural network(ANN), and gradient boosting(GB) were applied for tracing of origins. The results showed that the discriminative accuracy of the spectral preprocessing by standard normal variate transformation coupled with the first derivative was 93.9%, which could be used for the construction of the discrimination model. PCA and PLS-DA score plots showed that samples from Shandong, Sichuan, Yunnan, and Guizhou could be effectively distinguished, but the remaining samples were partially overlapped. As revealed by the analysis results by machine learning algorithms, the AUC values of KNN, SVM, RF, ANN, and GB algorithms were 0.96, 0.99, 0.99, 0.99, and 0.98, respectively, with overall prediction accuracies of 83.3%, 89.3%, 90.5%, 91.7%, and 89.3%. It indicated that the developed model was reliable and the machine learning algorithm combined with NIRS for origin identification was sufficiently feasible. OPLS-DA showed that Zingiberis Rhizoma from Sichuan(genuine producing areas) could be significantly distinguished from other regions, with good discriminative accuracy, suggesting that the NIRS established in this study combined with chemometrics can be used for the identification of Zingiberis Rhizoma from Sichuan. This study established a rapid and nondestructive identification and reliable data analysis method for origin identification of Zingiberis Rhizoma, which is expected to provide a new idea for the origin tracing of Chinese medicinal materials.
Algorithms
;
Chemometrics
;
China
;
Ginger
;
Least-Squares Analysis
;
Plant Extracts
;
Principal Component Analysis
;
Support Vector Machine

Result Analysis
Print
Save
E-mail