International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
|
Volume 3 - Issue 1 |
Published: July 2012 |
Authors: S. Selva Nidhyananthan, R. Shantha Selva Kumari, G. Jaffino |
![]() |
S. Selva Nidhyananthan, R. Shantha Selva Kumari, G. Jaffino . Improving Speaker Identification Performance by Combining Vocal Tract Features. International Journal of Applied Information Systems. 3, 1 (July 2012), 27-33. DOI=http:/ijais12-450433
@article{ http:/ijais12-450433, author = { S. Selva Nidhyananthan,R. Shantha Selva Kumari,G. Jaffino }, title = { Improving Speaker Identification Performance by Combining Vocal Tract Features }, journal = { International Journal of Applied Information Systems }, year = { 2012 }, volume = { 3 }, number = { 1 }, pages = { 27-33 }, doi = { http:/ijais12-450433 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2012 %A S. Selva Nidhyananthan %A R. Shantha Selva Kumari %A G. Jaffino %T Improving Speaker Identification Performance by Combining Vocal Tract Features%T %J International Journal of Applied Information Systems %V 3 %N 1 %P 27-33 %R http:/ijais12-450433 %I Foundation of Computer Science (FCS), NY, USA
This paper proposes fusion and addition techniques of vocal tract features such as Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Mel Frequency Cepstral Coefficients (DMFCC) in speaker identification. Feature extraction plays an important role as a front end processing block in Speaker Identification (SI) process. Mel frequency features are used to extract the spectral characteristics of the speech such as formant frequency and the bandwidth of formant frequency. This feature estimation method leads to robust recognition performance. The Dynamic Mel frequency features are used to extract the dynamic behavior of the human vocal tract using pitch frequency. This work is focused to increase the identification accuracy with databases containing short length speech signal. Experimental evaluation is carried out on TIMIT database with 630 speakers using Gaussian Mixture Model (GMM).