1.A Study for the Acoustic Characteristics of PB Word List according to Frequency.
Moo Kyun PARK ; Jang Hyeog LEE ; Hyuck Sung KWON ; Gi Jung IM ; Jeong Soo WOO ; Heung Man LEE ; Soon Jae HWANG ; Sung Won CHAE
Korean Journal of Otolaryngology - Head and Neck Surgery 2007;50(6):480-485
BACKGROUND AND OBJECTIVES: The Korean phonetically balanced (PB) word list (Ham's Korean PB Word List) is currently the most commonly used word list for speech discrimination test. The aim of this study was to investigate the frequency characteristics of the Ham's Korean PB word list and to compare frequency characteristics of the Ham's Korean PB word list with that of the 21st Century Sejong words, which represent characteristics of the contemporary spoken Korean language. SUBJECTS AND METHOD: The 300 syllables most commonly used were collected from the words in 21st Century Sejong project. The 21st Century Sejong words were weighted according to the frequency of usage. The 21st Century Sejong words and words from the Ham's Korean PB word list were analyzed using an acoustic analysis program that has the MATLAB function. RESULTS: Syllables with frequencies above 2,000 Hz comprise 34% of the 21st Century Sejong words and 16% of the Ham's Korean PB word list. The Ham's Korean PB word list does not represent the high frequency characteristics of the contemporary spoken Korean language. The 21st Century Sejong words have more 'eu', 'i' and open syllables than the Ham's Korean PB word list. CONCLUSION: The Ham's Korean PB word list does not reflect the high frequency characteristics of the contemporary spoken Korean language included in the 21st Century Sejong words. Therefore, a new PB word list that incorporates a greater number of syllables with high frequency characteristics is required.
Acoustics*
;
Audiometry, Speech
;
Speech Acoustics
;
Speech Discrimination Tests
2.Comparison of Speech Rate and Long-Term Average Speech Spectrum between Korean Clear Speech and Conversational Speech
Jeeun YOO ; Hongyeop OH ; Seungyeop JEONG ; In Ki JIN
Journal of Audiology & Otology 2019;23(4):187-192
BACKGROUND AND OBJECTIVES: Clear speech is an effective communication strategy used in difficult listening situations that draws on techniques such as accurate articulation, a slow speech rate, and the inclusion of pauses. Although too slow speech and improperly amplified spectral information can deteriorate overall speech intelligibility, certain amplitude of increments of the mid-frequency bands (1 to 3 dB) and around 50% slower speech rates of clear speech, when compared to those in conversational speech, were reported as factors that can improve speech intelligibility positively. The purpose of this study was to identify whether amplitude increments of mid-frequency areas and slower speech rates were evident in Korean clear speech as they were in English clear speech. SUBJECTS AND METHODS: To compare the acoustic characteristics of the two methods of speech production, the voices of 60 participants were recorded during conversational speech and then again during clear speech using a standardized sentence material. RESULTS: The speech rate and long-term average speech spectrum (LTASS) were analyzed and compared. Speech rates for clear speech were slower than those for conversational speech. Increased amplitudes in the mid-frequency bands were evident for the LTASS of clear speech. CONCLUSIONS: The observed differences in the acoustic characteristics between the two types of speech production suggest that Korean clear speech can be an effective communication strategy to improve speech intelligibility.
Acoustics
;
Rehabilitation
;
Speech Acoustics
;
Speech Intelligibility
;
Speech Perception
;
Voice
3.Acoustic and Electromyographic Characteristics of Fluent Alaryngeal Speech.
Ki Hwan HONG ; Woo Cheul JUNG ; Hee Wan YOUN ; Hyun Ki KIM
Korean Journal of Otolaryngology - Head and Neck Surgery 1997;40(6):879-887
Alaryngeal speech(esophageal, neoglottal, shunt, and electrolaryngeal speech) differ from normal laryngeal speech primarily with regard to the sound or source of voicing. Theoretically, esophageal, tracheoesophageal, neoglottal and electrolaryngeal speech have difficulty in accomplishing the voiceless consonants. But perceptual studies often reveal that there is a clear production of voiceless consonants resulting good articulation scores in skilled alaryngeal speakers except electrolaryngeal speech. The purpose of the present study was to relate the three-way distinction of Korean voiceless stops in manner of articulation with normal speakers and skilled alaryngeal speakers in terms of the voicing distinction in consonants. Acoustic analysis were performed to investigate the acoustic characteristics of alaryngeal speech compared to the normal speech with special reference to the voiceless distinction. Electromyographic studies were performed to clarify the adjustment of neck muscle during normal and alaryngeal speech.
Acoustics*
;
Neck Muscles
;
Speech, Alaryngeal*
4.Study of three-dimensional speech chart by time-frequency analysis in Chinese Mandarin Monosyllabic word phonemes.
Hao CHEN ; Yu-hong QIAN ; Xiao-qin ZHOU ; Meng-he GUO
Journal of Southern Medical University 2010;30(8):1805-1809
OBJECTIVETo investigate the physical characteristic of Chinese Mandarin Monosyllabic word phonemes by the time-frequency analysis.
METHODSManufactured and study the three-dimensional speech chart which was on the basic of information of the time-frequency analysis in 40 monosyllabic words with the SigmaPlot 9.0 and Matlab 7.0 software.
RESULTSThe mean value of basic frequency in 40 monosyllabic words was between 220 to 320 Hz area, and the mean value of four groups has significant difference by t-test. It is indicated by time-frequency analysis that the characteristics vary according to the tone of monosyllabic words. With the change of time, the various frequencies and its intensity of tone first remain basically unchanged, the curve is horizontal type; the different frequencies of tone second gradually tend to high-frequency direction, the graph is upward-type, additionally, the intensity of some tone drops little; the divers frequencies of tone third firstly deflect to low-frequency areas, then increasingly deviate to high-frequency direction of bias after maintaining a period of time, the figure is v-type, intensity changes are concave-shaped; The various frequencies of tone fourth deviate from high-frequency to low-frequency areas, and its intensity drops sharply, the graph is decreased type.
CONCLUSIONThrough time-frequency analysis and three-dimensional speech chart observation, we consider that the physical characteristic of Chinese Mandarin Monosyllabic word phonemes mainly reflects the frequencies and intensity change with time.
Female ; Humans ; Language ; Phonetics ; Speech ; physiology ; Speech Acoustics
5.A Study for the Characteristic Frequency of Korean Disyllabic Word List.
Hyuck Sung KWON ; Moo Kyun PARK ; Chang Jae CHOI ; Hyung Jin JUN ; Soon Jae HWANG ; Sung Won CHAE
Korean Journal of Otolaryngology - Head and Neck Surgery 2008;51(12):1099-1103
BACKGROUND AND OBJECTIVES: Speech audiometry is a useful test for obtaining practical information which could not be gotten from the pure-tone audiometer test. Nowadays, Hahm's Korean disyllabic word lists are the most commonly used word lists for the speech reception threshold test in Korea. The disyllabic word lists should be composed of spondaic words (equally stressed words). The aim of this study is to evaluate the characteristic frequency of Hahm's disyllabic word lists. SUBJECTS AND METHOD: Two of the Hahm's disyllabic word lists were evaluated. The word lists were analyzed using an acoustic analysis program that has the MATLAB function. RESULTS: The percentage of disyllabic words in the Hahm's disyllabic word lists that had the spondaic words pattern were 33%. The characteristic frequencies of these words were equally stressed above 1,000 Hz. Other disyllabic words showed an unequally stressed pattern. The percentage of words that had the first syllable stressed pattern was 31% and the percentage of second syllable stressed pattern was 36%. CONCLUSION: Some words, which are commonly used in the Hahm's disyllabic words lists, had an unequally stressed pattern. A new Korean dissyllabic word list needs to use more balanced words that have the characteristic frequency of each word.
Acoustics
;
Audiometry, Speech
;
Korea
;
Speech Reception Threshold Test
6.A comparative study of pathological voice based on traditional acoustic characteristics and nonlinear features.
Deying GAN ; Weiping HU ; Bingxin ZHAO
Journal of Biomedical Engineering 2014;31(5):1149-1154
By analyzing the mechanism of pronunciation, traditional acoustic parameters, including fundamental frequency, Mel frequency cepstral coefficients (MFCC), linear prediction cepstrum coefficient (LPCC), frequency perturbation, amplitude perturbation, and nonlinear characteristic parameters, including entropy (sample entropy, fuzzy entropy, multi-scale entropy), box-counting dimension, intercept and Hurst, are extracted as feature vectors for identification of pathological voice. Seventy-eight normal voice samples and 73 pathological voice samples for /a/, and 78 normal samples and 80 pathological samples for /i/ are recognized based on support vector machine (SVM). The results showed that compared with traditional acoustic parameters, nonlinear characteristic parameters could be well used to distinguish between healthy and pathological voices, and the recognition rates for /a/ were all higher than those for /i/ except for multi-scale entropy. That is why the /a/ sound data is used widely in related research at home and abroad for obtaining better identification of pathological voices. Adopting multi-scale entropy for /i/ could obtain higher recognition rate than /a/ between healthy and pathological samples, which may provide some useful inspiration for evaluating vocal compensatory function.
Acoustics
;
Humans
;
Sound
;
Speech Acoustics
;
Voice Disorders
;
Voice Quality
7.Design of standard voice sample text for subjective auditory perceptual evaluation of voice disorders.
Jin-rang LI ; Yan-yan SUN ; Wen XU
Chinese Journal of Otorhinolaryngology Head and Neck Surgery 2010;45(9):719-722
OBJECTIVETo design a speech voice sample text with all phonemes in Mandarin for subjective auditory perceptual evaluation of voice disorders.
METHODSThe principles for design of a speech voice sample text are: The short text should include the 21 initials and 39 finals, this may cover all the phonemes in Mandarin. Also, the short text should have some meanings.
RESULTSA short text was made out. It had 155 Chinese words, and included 21 initials and 38 finals (the final, ê, was not included because it was rarely used in Mandarin). Also, the text covered 17 light tones and one "Erhua". The constituent ratios of the initials and finals presented in this short text were statistically similar as those in Mandarin according to the method of similarity of the sample and population (r = 0.742, P < 0.001 and r = 0.844, P < 0.001, respectively). The constituent ratios of the tones presented in this short text were statistically not similar as those in Mandarin (r = 0.731, P > 0.05).
CONCLUSIONSA speech voice sample text with all phonemes in Mandarin was made out. The constituent ratios of the initials and finals presented in this short text are similar as those in Mandarin. Its value for subjective auditory perceptual evaluation of voice disorders need further study.
Humans ; Language ; Reference Standards ; Speech Acoustics ; Speech Perception ; Speech Production Measurement ; Voice Disorders ; diagnosis ; Voice Quality
8.A review on the applications of acoustic analysis in diagnosing disease.
Journal of Biomedical Engineering 2007;24(6):1419-1422
Acoustic analysis is one of the important branches of biometric recognition technology widely used now. The mainly aim of the technology is to recognize the identity of person and judge the content of speech or diagnose the illness automatically according to the features extracted from the speaker's waveforms. All these features are related with the characteristics of speaker's physiological, pathological and psychological action. Speaker recognition study has its 50-year old history already, but acoustic analysis in diagnosing disease has been founded since 1970s. This paper introduces the main concept and research background of this diagnosing system generally and discusses the problems generated during processing. At last the prospect for the applications of acoustic analysis is forecasted.
Humans
;
Pattern Recognition, Physiological
;
Signal Detection, Psychological
;
Speech
;
physiology
;
Speech Acoustics
;
Speech Disorders
;
diagnosis
;
physiopathology
9.Gender difference in speech intelligibility using speech intelligibility tests and acoustic analyses.
The Journal of Advanced Prosthodontics 2010;2(3):71-76
PURPOSE: The purpose of this study was to compare men with women in terms of speech intelligibility, to investigate the validity of objective acoustic parameters related with speech intelligibility, and to try to set up the standard data for the future study in various field in prosthodontics. MATERIAL AND METHODS: Twenty men and women were served as subjects in the present study. After recording of sample sounds, speech intelligibility tests by three speech pathologists and acoustic analyses were performed. Comparison of the speech intelligibility test scores and acoustic parameters such as fundamental frequency, fundamental frequency range, formant frequency, formant ranges, vowel working space area, and vowel dispersion were done between men and women. In addition, the correlations between the speech intelligibility values and acoustic variables were analyzed. RESULTS: Women showed significantly higher speech intelligibility scores than men and there were significant difference between men and women in most of acoustic parameters used in the present study. However, the correlations between the speech intelligibility scores and acoustic parameters were low. CONCLUSION: Speech intelligibility test and acoustic parameters used in the present study were effective in differentiating male voice from female voice and their values might be used in the future studies related patients involved with maxillofacial prosthodontics. However, further studies are needed on the correlation between speech intelligibility tests and objective acoustic parameters.
Acoustics
;
Female
;
Humans
;
Male
;
Phonetics
;
Prosthodontics
;
Speech Intelligibility
;
Voice
10.Definition and Clinical Implication of Temporal Fine Structure
Young Soo CHANG ; Il Joon MOON
Korean Journal of Otolaryngology - Head and Neck Surgery 2018;61(1):1-8
Perception of sound is associated with the use of multiple acoustic cues in the human auditory system. These acoustic cues can be classified by their temporal and spectral properties. Temporal information can be considered as a slowly varying envelope (ENV) superimposed on a more rapid temporal fine structure (TFS). Various sound-processing techniques have been developed to assess the role of acoustic information carried by temporal ENV and TFS information. Previous studies demonstrated that ENV cues are associated with speech recognition in quiet, while TFS cues have a role to melody/pitch perception and listening to speech in a competing background noise. Besides, a number of studies have shown that the relative insensitivity of hearing-impaired subjects to TFS information. Although current technologies used in cochlear implants are not efficient in delivering the TFS cues, new speech processing strategies have been proposed to deliver TFS information. We herein review the cutting edge of TFS related studies and discuss the clinical implication of TFS.
Acoustics
;
Cochlear Implants
;
Cues
;
Hearing Loss
;
Humans
;
Noise
;
Speech Perception