2.Research of bleeding volume and method in blood-letting acupuncture therapy based on data mining.
Xin LIU ; Chun-Sheng JIA ; Jian-Ling WANG ; Yu-Zhu DU ; Xiao-Xu ZHANG ; Jing SHI ; Xiao-Feng LI ; Yan-Hui SUN ; Shen ZHANG ; Xuan-Ping ZHANG ; Wei-Juan GANG
Chinese Acupuncture & Moxibustion 2014;34(3):257-260
Through computer-based technology and data mining method, with treatment in cases of bloodletting acupuncture therapy in collected literature as sample data, the association rule in data mining was applied. According to self-built database platform, the data was input, arranged and summarized, and eventually required data was acquired to perform the data mining of bleeding volume and method in blood-letting acupuncture therapy, which summarized its application rules and clinical values to provide better guide for clinical practice. There were 9 kinds of blood-letting tools in the literature, in which the frequency of three-edge needle was the highest, accounting for 84.4% (1239/1468). The bleeding volume was classified into six levels, in which less volume (less than 0.1 mL) had the highest frequency (401 times). According to the results of the data mining, blood-letting acupuncture therapy was widely applied in clinical practice of acupuncture, in which use of three-edge needle and less volume (less than 0.1 mL) of blood were the most common, however, there was no central tendency in general.
Acupuncture Points
;
Acupuncture Therapy
;
methods
;
statistics & numerical data
;
Bloodletting
;
methods
;
statistics & numerical data
;
Data Mining
;
Databases, Factual
;
Humans
4.Data Mining for High Dimensional Data in Drug Discovery and Development.
Kwan R LEE ; Daniel C PARK ; Xiwu LIN ; Sergio ESLAVA
Genomics & Informatics 2003;1(2):65-74
Data mining differs primarily from traditional data analysis on an important dimension, namely the scale of the data. That is the reason why not only statistical but also computer science principles are needed to extract information from large data sets. In this paper we briefly review data mining, its characteristics, typical data mining algorithms, and potential and ongoing applications of data mining at biopharmaceutical industries. The distinguishing characteristics of data mining lie in its understandability, scalability, its problem driven nature, and its analysis of retrospective or observational data in contrast to experimentally designed data. At a high level one can identify three types of problems for which data mining is useful: description, prediction and search. Brief review of data mining algorithms include decision trees and rules, nonlinear classification methods, memory-based methods, model-based clustering, and graphical dependency models. Application areas covered are discovery compound libraries, clinical trial and disease management data, genomics and proteomics, structural databases for candidate drug compounds, and other applications of pharmaceutical relevance.
Classification
;
Data Mining*
;
Dataset
;
Decision Trees
;
Disease Management
;
Drug Discovery*
;
Genomics
;
Proteomics
;
Retrospective Studies
;
Statistics as Topic
6.Analysis of medication regularity of traditional Chinese medicine prescriptions for gastropyretic excessiveness diabetes based on data mining.
Ye-Ran WANG ; Yang ZHANG ; Qi-Bing QIN ; Ping WANG ; Long TAN
China Journal of Chinese Materia Medica 2020;45(1):196-201
To analyze the medication regularity of traditional Chinese medicine(TCM) prescriptions for gastropyretic excessiveness diabetes recorded in Chinese Medicine Prescriptions Dictionary. A total of 103 eligible prescriptions were input into the system platform, and the Apriori algorithm was used to analyze their medication regularity. The 103 prescriptions for gastropyretic excessiveness diabetes were selected from the system, and 29 herb medicines were found with frequency of usage more than 8. Totally 33 commonly used herbal pairs(support degree≥10), twenty-three 3-herb core combinations(support degree≥8, confidence values≥0.5), and twenty-one 4-herb core combinations(confidence values≥0.5) were discovered after the medication regularity analysis by Apriori algorithm. The herbal medicine combinations with the highest correlation degree were discovered after the association rule analysis on the 103 prescriptions(support degree≥10, confidence values≥0.5). The four properties, five tastes, channel distributions and frequency of dose of the 103 prescriptions were also obtained after the corresponding analysis. According to the analysis and summary of the above data, the combination of Trichosanthis Radix, Anemarrhenae Rhizoma, Coptidis Rhizoma and Ophiopogonis Radix could reflect the medication regularity of TCM prescriptions for gastropyretic excessiveness diabetes to a certain degree, which is of great significance in guiding value in clinic.
Data Mining
;
Diabetes Mellitus/drug therapy*
;
Drug Prescriptions/statistics & numerical data*
;
Drugs, Chinese Herbal/administration & dosage*
;
Humans
;
Medicine, Chinese Traditional
7.Comorbidity Study on Type 2 Diabetes Mellitus Using Data Mining.
Hye Soon KIM ; A Mi SHIN ; Mi Kyung KIM ; Yoon Nyun KIM
The Korean Journal of Internal Medicine 2012;27(2):197-202
BACKGROUND/AIMS: The aim of this study was to analyze comorbidity in patients with type 2 diabetes mellitus (T2DM) by using association rule mining (ARM). METHODS: We used data from patients who visited Keimyung University Dongsan Medical Center from 1996 to 2007. Of 411,414 total patients, T2DM was present in 20,314. The Dx Analyze Tool was developed for data cleansing and data mart construction, and to reveal associations of comorbidity. RESULTS: Eighteen associations reached threshold (support, > or = 3%; confidence, > or = 5%). The highest association was found between T2DM and essential hypertension (support, 17.43%; confidence, 34.86%). Six association rules were found among three comorbid diseases. Among them, essential hypertension was an important node between T2DM and stroke (support, 4.06%; confidence, 8.12%) as well as between T2DM and dyslipidemia (support, 3.44%; confidence, 6.88%). CONCLUSIONS: Essential hypertension plays an important role in the association between T2DM and its comorbid diseases. The Dx Analyze Tool is practical for comorbidity studies that have an enormous clinical database.
Academic Medical Centers/statistics & numerical data
;
Algorithms
;
Case-Control Studies
;
Chi-Square Distribution
;
Comorbidity
;
Data Mining/*statistics & numerical data
;
Databases, Factual/statistics & numerical data
;
Diabetes Mellitus, Type 2/*epidemiology
;
Dyslipidemias/epidemiology
;
Humans
;
Hypertension/epidemiology
;
Republic of Korea/epidemiology
;
Stroke/epidemiology
8.Software development in data analysis and mining for cDNA microarray.
Bin WU ; Jianguo WANG ; Miqu WANG
Journal of Biomedical Engineering 2007;24(6):1394-1397
Data analysis and mining is a key issue to microarray technology and is usually implemented through software development. This paper summarizes the state-of-art software development in cDNA microarray data analysis and mining. The updated software developments are discussed in three stages: data inquisition from cDNA microarray tests, statistical treatment of cDNA data and data mining from gene network.
Cluster Analysis
;
Data Interpretation, Statistical
;
Data Mining
;
methods
;
Gene Expression Profiling
;
statistics & numerical data
;
Humans
;
Oligonucleotide Array Sequence Analysis
;
methods
;
Software Design
10.Construction and thinking of data element standard directory of traditional Chinese medicine clinical pharmacy information.
Xiao-Xia WANG ; Zhong-Zheng JIN ; Gui-Ming GUO ; Hua-Qiang ZHAI ; Shi-Yuan JIN
China Journal of Chinese Materia Medica 2014;39(9):1724-1727
The aim of this study was to develop the data element standard directory of traditional Chinese medicine (TCM) clinical pharmacy information, to provide application standards and models of TCM clinical pharmacy for the electronic medical record (EMR). The developed line of work is as follows: initially establish research through four forms: literature analysis, questionnaires, discussion groups, expert advice. The research range from the Chinese herbal medicine research, herbal origin, harvesting, processing, identification of traits, physical and chemical identification, modern research, character, taste, Indications, clinical application, processing, dispensing medicine, Chinese medicine specifications, usage, dosage, caution, efficacy indications to small packaging applications, drug research, management and other related issues, including traditional Chinese medicine theory, application and hospital management information; according to the general and part 16 content of the national "Health Information Data Element Standards", and the basic method of extracting data element to study and develop the data element of TCM clinical pharmacy information from the defining content. Correspondingly propose the ideas and methods of construction of the "Data Element Standard Directory of TCM Clinical Pharmacy Information", sort out medicine clinical information data element standard catalog, divided into basic categories, clinical application class, management class three parts, and set norms and standards of identifying data elements, definitions, allowable value of traditional Chinese medicine clinical information, and discuss the sources and standards of information collection, leaving the interface, standardized and scientific terminology, docking with the existing standards, maintenance and management program and oter issues.
China
;
Data Mining
;
methods
;
statistics & numerical data
;
Database Management Systems
;
standards
;
statistics & numerical data
;
Electronic Health Records
;
standards
;
statistics & numerical data
;
Evidence-Based Medicine
;
methods
;
statistics & numerical data
;
Humans
;
Information Dissemination
;
methods
;
Medicine, Chinese Traditional
;
methods
;
Phytotherapy
;
methods
;
statistics & numerical data