Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Text independent speaker identification with finite multivariate generalised Gaussian mixture model and k–means algorithm

Text independent speaker identification with finite multivariate generalised Gaussian mixture... In this paper, we propose text independent speaker identification with a finite multivariate generalised Gaussian Mixture Model (GMM) with a k–means algorithm. Each speaker's speech spectra are characterised with a mixture of generalised Gaussian distribution that includes Gaussian and Laplacian distribution as a particular case. Speech analysis is done with the Mel Frequency Cepstral Coefficients (MFCC) extracted from the front end process. Using the EM algorithm and k–means algorithm the model parameters the numbers of acoustic classes associated with each speech spectra are determined. The performance of the proposed algorithm is studied through experimental evaluation and observed that this algorithm outperforms the existing speaker identification algorithm with GMM. It is also observed that this algorithm performs efficiently even with a heterogeneous population with small (less than 2 seconds) utterances. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png International Journal of Signal and Imaging Systems Engineering Inderscience Publishers

Text independent speaker identification with finite multivariate generalised Gaussian mixture model and k–means algorithm

Loading next page...
 
/lp/inderscience-publishers/text-independent-speaker-identification-with-finite-multivariate-YS6AoNfN2H

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

Publisher
Inderscience Publishers
Copyright
Copyright © Inderscience Enterprises Ltd. All rights reserved
ISSN
1748-0698
eISSN
1748-0701
DOI
10.1504/IJSISE.2013.053419
Publisher site
See Article on Publisher Site

Abstract

In this paper, we propose text independent speaker identification with a finite multivariate generalised Gaussian Mixture Model (GMM) with a k–means algorithm. Each speaker's speech spectra are characterised with a mixture of generalised Gaussian distribution that includes Gaussian and Laplacian distribution as a particular case. Speech analysis is done with the Mel Frequency Cepstral Coefficients (MFCC) extracted from the front end process. Using the EM algorithm and k–means algorithm the model parameters the numbers of acoustic classes associated with each speech spectra are determined. The performance of the proposed algorithm is studied through experimental evaluation and observed that this algorithm outperforms the existing speaker identification algorithm with GMM. It is also observed that this algorithm performs efficiently even with a heterogeneous population with small (less than 2 seconds) utterances.

Journal

International Journal of Signal and Imaging Systems EngineeringInderscience Publishers

Published: Jan 1, 2013

There are no references for this article.