Application of Disciplinary Background Knowledge in Medical Text Feature Extraction
10.3969/j.issn.1673-6036.2017.04.012
- VernacularTitle:学科背景知识在医学文本特征抽取中的应用
- Author:
Yingguang ZHAO
;
Shaoping FAN
;
Xinying AN
- Keywords:
Text mining;
TF-IDF;
Feature extraction;
Knowledge discovery
- From:
Journal of Medical Informatics
2017;38(4):50-54,81
- CountryChina
- Language:Chinese
-
Abstract:
The paper analyzes the conditions of research on the current scientific literature text feature extraction methods,applies the TF-IDF method based on background knowledge in the medical text feature extraction,and conducts experimental comparison in four medical fields.The result indicates that this method can obviously improve the extraction effect when there are few vocabularies to be extracted,and is obviously superior to the IDF based TF-IDF method in the aspects of filtration of commonly-used words in the text set and identification of important feature words.