Analysis of acoustic parameters for consonant voicing classification in clean and telephone speech |
| |
Authors: | Lee Suk-Myung Choi Jeung-Yoon |
| |
Institution: | Yonsei University, 134 Shinchon-dong, Seodaemun-gu, 120-749, Seoul, Republic of Korea. pooh390@dsp.yonsei.ac.kr |
| |
Abstract: | This paper describes acoustic cues for classification of consonant voicing in a distinctive feature-based speech recognition system. Initial acoustic cues are selected by studying consonant production mechanisms. Spectral representations, band-limited energies, and correlation values, along with Mel-frequency cepstral coefficients features (MFCCs) are also examined. Analysis of variance is performed to assess relative significance of features. Overall, 82.2%, 80.6%, and 78.4% classification rates are obtained on the TIMIT database for stops, fricatives, and affricates, respectively. Combining acoustic parameters with MFCCs shows performance improvement in all cases. Also, performance in the NTIMIT telephone channel speech shows that acoustic parameters are more robust than MFCCs. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|