Please use this identifier to cite or link to this item: https://repository.sustech.edu/handle/123456789/7418
Title: SOUND RECOGNITION FOR SELECTED VERSES OF THE HOLY QURAN
Other Titles: التعرف الصوتي علي آيات مختارة من القرآن الكريم
Authors: ELAMIN, ASHRAF MOHAMMED ABDALLAH
Supervisor - ISMAIL EL-AZHARY
Keywords: Voice Recognition
Computer Science
‫‪HMMs‬‬
MFCC coefficients
Issue Date: 1-May-2012
Publisher: Sudan University of Science and Technology
Citation: Elamen,ASHRAF MOHAMMED ABDALLAH.SOUND RECOGNITION FOR SELECTED VERSES OF THE HOLY QURAN/ASHRAF MOHAMMED ABDALLAH ELAMIN;ISMAIL EL-AZHARY.-Khartoum:Sudan University of Science and Technology,College of Computer Science and Information Technology,2012. .-252p. :ill. ;28cm.-M.sc.
Abstract: In this study of sound recognition for selected verses of the Holy Quran, five verses selected from the Holy Quran. This done as a small number of verses contains all the Arabic phonemes. One hundred recitations of each verse were prepared in wave files. Verses had been recited by famous certified readers of the Holy Quran. A file created from each recite by extracting only the first word of the verse. A wave file of noise is set by the researcher's voice, for testing purpose. All wave files recorded at Sampling rate =22.050 kHz , PCM signed 16 bit mono. Three types of coefficients are extracted from each wave file to represent features of speech. They are Mel Frequency Cepstrum Coefficients MFCC, Power Spectral Density PSD and Reflection Coefficients RC. Also three techniques of speech recognition are used. They are Hidden Markov Models, Dynamic time warping and Artificial Neural Networks. Test were done at two levels. The first stage was applied at the full verse level. All the three techniques mentioned were used. HMMs and ANNs trained by the first 30 samples and test done by all the 100 sample of each verse. The second stage was applied in the same way, but the used samples were of the first word of the verse. HMMs technique only was selected to be used in recognition. That due to high recognition rates scored at the first stage. This stage repeated in the same way , but used the first 50 samples in training instead of the first 30 samples. Mainly, HMMs scored high rates of recognition to coefficients used(MFCC, PSD and RC). Low recognition rates with high confusion scored by ANNs and DTW at IIIverse level. For all coefficients used, high scores of recognition rates with low confusion rate concentrated in HMMs with MFCC. MFCC scored higher recognition rates than PSD and RC.
Description: Thesis
URI: http://repository.sustech.edu/handle/123456789/7418
Appears in Collections:PhD theses : Computer Science and Information Technology

Files in This Item:
File Description SizeFormat 
SOUND RECOGNITION FOR ... .pdfTitle81.73 kBAdobe PDFView/Open
Researsh.pdf
  Restricted Access
Researsh1.66 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.