Correction System of a Mis-recognized Medical Vocabulary of Speech-based Electronic Medical Record.
- Author:
Hwa Jeong SEO
1
;
Ju Han KIM
Author Information
1. Seoul National University Biomedical Informatics, Seoul National University College of Medicine, Korea. hjseo@snu.ac.kr
- Publication Type:Original Article
- Keywords:
EMR(Electronica Medical Record);
Speech Recognition;
Data Entry;
NLP(Natural Language Processing)
- MeSH:
Electronic Health Records*;
Vocabulary*
- From:Journal of Korean Society of Medical Informatics
2002;8(4):11-20
- CountryRepublic of Korea
- Language:Korean
-
Abstract:
Speech recognition as an input tool for electronic medical record enables efficient data entry at the point of care. We evaluated the speech recognition accuracy of IBM ViaVoiceTM for doctor-patient dialogues and for pronounced medical vocabularies. The recognition accuracy for doctor-patient dialogues was 95.4%, while that for pronounced medical vocabularies was 55.1%. In order to put speech-based electronic medical record to practical use, mis-recognized vocabulary must be significantly corrected. This paper describes a correction system for mis-recognized medical vocabulary for speech recognition-enabled electronic medical record. The correction system is composed of an extraction and a correction steps. In the extraction step, hamming distance between a parsed substring and the nearest medical vocabulary in the vocabulary database greater than 50% of the length of the substring was used to determine if the substring is a possible mis-recognized medical vocabulary. In the correction step, possible mis-recognized medical vocabularies are scored such that when both the code and location of a syllable is the same with those of a medical vocabulary found in our database, +5 is given and when the code is the same but the location is not, +1 is given. The medical vocabulary with the highest score in the database is used as the correction for the mis-recognized one. When 33 patient-doctor dialogues with 33 medical vocabularies were tested for three times by six testees (i.e., 33 x 6 x 3 = 594 sentences), 94% of the mis-recognized words were correctly detected and repaired. Poor recognition performance for hard medical vocabularies can be markedly improved by the mis-recognized medical vocabulary correction system.