Text-Independent Speaker Identification Using an Improved Hidden Markov Model

Abdullah, Sayed Jaafer; Supervisor - zzeldin Mohamed Osman

Co-Supervisor -  Mohamed Elhafiz Mustafa Musa

Please use this identifier to cite or link to this item: https://repository.sustech.edu/handle/123456789/1610

Title:	Text-Independent Speaker Identification Using an Improved Hidden Markov Model
Authors:	Abdullah, Sayed Jaafer Supervisor - zzeldin Mohamed Osman Co-Supervisor - Mohamed Elhafiz Mustafa Musa
Keywords:	computer programs
Issue Date:	1-Jan-2012
Publisher:	Sudan University of Science and Technology
Citation:	Abdullah,Sayed Jaafer.Text-Independent Speaker Identification Using an Improved Hidden Markov Model/Sayed Jaafer Abdullah;Izzeldin Mohamed Osman .-khartoum : Sudan University of Science and Technology, computer science,2012,126p. :ill; 28cm .-ph.D.
Abstract:	In this thesis, we attempted to build speaker identification system. The purpose of this thesis is to improve the performance of speaker identification system based on hidden Markov model classifier. We proposed an approach in which the coefficients of LPCC or MFCC can be increased, the codebook size can also be increased, and HMM classifier can be trained and tested with multiple codebooks. The implementation of the system is divided into three stages: feature extraction, vector quantization, and training and classification. In feature extraction stage, we studied LPCC and MFCC by which the spectral features of speech signal can be estimated. Then we showed how these features can be computed. In the vector quantization stage, we generated a distinct codebook of size 256 clusters for each speaker model. In the stage of training and classification, we studied the algorithms and implementation of HMM. Then we showed how HMM can be trained to estimate the model parameters using Baum-Welch algorithm and how can be tested using forward algorithm. To evaluate the HMM classifier, we extracted a sample from the Switchboard dataset. The sample consists of recordings of 40 speakers of American English. Experimental results show that the average identification rate of 97.5% has been obtained. Also the results show that the identification rate of LPCCs is better than that of MFCCs. We founded that the system can achieve a stable identification rate with a codebook of size >=256. We compared LDA and MLP classifiers with HMM classifier. Results illustrated that the HMM classifier achieves a superior results. These results conclude that the proposed approach can improve the performance of speaker identification system.
Description:	Thesis
URI:	http://repository.sustech.edu/handle/123456789/1610
Appears in Collections:	PhD theses : Computer Science and Information Technology

Files in This Item:

File	Description	Size	Format
Text-Independent Speaker ... .pdf Restricted Access	Research	2.05 MB	Adobe PDF	View/Open Request a copy

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets