pp. 2809-2819
S&M3007 Research Paper of Special Issue https://doi.org/10.18494/SAM3921 Published: July 28, 2022 Recognition System for Cantonese Speakers in Different Noisy Environments Based on Estimate–Maximize Algorithm [PDF] Yu Fan, Chin-Ta Chen, and Cheng-Fu Yang (Received March 29, 2022; Accepted May 24, 2022) Keywords: speaker identification, estimate–maximize algorithm, Gaussian mixture model, Mel-frequency cepstrum coefficient, maximum likelihood estimation
Highly accurate personal identification systems are required in many different recognition situations. In this study, the Mel-frequency cepstrum coefficient was used to extract the features of speakers. The aim of this study was to identify different speeches in different noisy environments. A maximum likelihood estimation method based on noise probability was proposed to enhance the recognition effects of the Gaussian mixture model of different speeches from mixed noise speech signals. Experimental results indicated that the method had high recognition results under various noise conditions. The recognition results of the proposed method in different noise environments were superior to those of a method using only one type of noise for modeling. Experimental results obtained from some unspecified speakers showed that three different languages (Mandarin, English, and Cantonese) were effectively identified.
Corresponding author: Yu Fan, Cheng-Fu YangThis work is licensed under a Creative Commons Attribution 4.0 International License. Cite this article Yu Fan, Chin-Ta Chen, and Cheng-Fu Yang, Recognition System for Cantonese Speakers in Different Noisy Environments Based on Estimate–Maximize Algorithm, Sens. Mater., Vol. 34, No. 7, 2022, p. 2809-2819. |