Words frequency statistics-based automatic controlled indexing of medical news
10.3969/j.issn.1671-3982.2014.08.002
- VernacularTitle:基于词频统计法的医学新闻自动受控标引
- Author:
Jingli ZHANG
;
Xiaoyang HE
;
Ting DING
- Publication Type:Journal Article
- Keywords:
Words frequency statistics;
Automatic indexing;
Subject heading indexing;
Controlled indexing;
MeSH
- From:Chinese Journal of Medical Library and Information Science
2014;(8):7-10
- CountryChina
- Language:Chinese
-
Abstract:
After the necessity of using medical news information and the advances in its automatic indexing were analyzed, a novel automatic controlled indexing method of medical news text was put forward. The method intro-duced translated MeSH vocabulary as the main indexing words, merging Chinese commonly used word segmentation dictionary, then calculated word frequency for document text which added split token and sorted it, choose top 5 high-frequency words in MeSH vocabulary indexed document after deleting high-frequency words not in MeSH vo-cabulary.