A Proposed Automatic Speech Recognition model for the Sudanese Dialect

Mansour, Ayman Abdelaziz Elhassan; Supervisor, - Wafaa Faisal Mukhtar

SUST Home
→
Theses and Dissertations
→
College of Computer Science and Information Technology
→
Masters Dissertations : Computer Science and Information Technology
→
View Item

dc.contributor.author	Mansour, Ayman Abdelaziz Elhassan
dc.contributor.author	Supervisor, - Wafaa Faisal Mukhtar
dc.date.accessioned	2020-12-01T10:58:39Z
dc.date.available	2020-12-01T10:58:39Z
dc.date.issued	2020-09-10
dc.identifier.citation	Mansour, Ayman Abdelaziz Elhassan . A Proposed Automatic Speech Recognition model for the Sudanese Dialect / Ayman Abdelaziz Elhassan Mansour ; Wafaa Faisal Mukhtar .- Khartoum: Sudan University of Science and Technology, college of Computer science and information technology, 2020 .- 78p. :ill. ;28cm .- M.Sc.	en_US
dc.identifier.uri	http://repository.sustech.edu/handle/123456789/25521
dc.description	Thesis	en_US
dc.description.abstract	Nowadays, speech recognition plays a major role in designing a natural voice interface for communication between human and their modern digital life equipment. It is presenting an easy way to cross the language barrier between monolingual individuals. But the obvious problem with this field is the lack of wide support for several universal languages and their dialects; while most of the daily interaction is done using them. This research comes to ensure the viability of designing the Automatic speech recognition model for the Sudanese Dialect. The researcher focused on building a dataset by collecting represented resources and perform pre-processing to construct the dataset. The Automatic speech recognition model was built by training the model to recognize each character of the Sudanese Dialect. The model's architecture followed the end-to-end speech recognition approach. Each building block of the model was formed using Convolution Neural Networks rather than Recurrent Neural Networks, the usual choice of the speech-related task, and the training was done using the Connectionist Temporal Classification learning algorithm. In this research, a Sudanese dialect dataset was built overcoming the lack of annotated data and reached an average label error rate of 73.67%. The proposed model will enable the use of the collected dataset in any Natural Language Processing future research targeting the Sudanese Dialect. The designed model, with its performance, provided some insights about the current recognition task. The model can reach a much better label error rate by deploying any improvement such as a language model. The applications for this research are vastly available from designing archives for the Sudanese content with its text format to develop real-time speech recognizer.	en_US
dc.description.sponsorship	Sudan University of Science and Technology	en_US
dc.language.iso	en	en_US
dc.publisher	Sudan University of Science and Technology	en_US
dc.subject	Computer Science	en_US
dc.subject	Sudanese Dialect	en_US
dc.subject	Automatic Speech Recognition	en_US
dc.title	A Proposed Automatic Speech Recognition model for the Sudanese Dialect	en_US
dc.title.alternative	نموذج مقترح للتعرف الآلى على الأصوات فى اللهجة السودانية	en_US
dc.type	Thesis	en_US