SUST Repository

Multiple Imputation, Regression Imputation and Expectation Maximization Methods for Handling Missing Data

Show simple item record

dc.contributor.author Mohammed, Montasir Abbas Ahmed
dc.contributor.author Supervisor, -Adil M. Y. Waniss
dc.contributor.author Co-Supervisor, -Khalid R. K. Genawi
dc.date.accessioned 2022-11-21T07:47:05Z
dc.date.available 2022-11-21T07:47:05Z
dc.date.issued 2019-07-12
dc.identifier.citation Mohammed, Montasir Abbas Ahmed .Multiple Imputation, Regression Imputation and Expectation Maximization Methods for Handling Missing Data \ Montasir Abbas Ahmed Mohammed ; Adil M. Y. Waniss .- Khartoum:Sudan University of Science & Technology,College of Science,2019.-146p.:ill.;28cm.-Ph.D. en_US
dc.identifier.uri http://repository.sustech.edu/handle/123456789/27843
dc.description Thesis en_US
dc.description.abstract Missing data are widespread, and pose problems for many statistical procedures. We all should be using methods that treat missing data properly, rather than deleting data or using single imputation. Importantly, researcher should pay attention by using most appropriate analysis of his data, in order to arrive to conclusions that have more accurate parameters. To achieve this objective, an appropriate method of handling treating missing data must be chosen before starting the analysis. This research aims to a comparative study to the Multiple Imputation (MI) method of estimation against two other methods; the Regression Imputation of estimation and the Expectation-maximization (EM) algorithm of estimation, for estimating missing data. The study is based in application on data randomly generated, some of them were missed by different percentages (5%, 10%, 15%, 20% and 30%). It also uses SPSS Program as statistical package to help in estimating and analyzing the data. Data randomized was tested using little's test on which this data was divided into missing completely at random and missing not completely at random. The study proved that based on descriptive statistics, there is considerable differences between means and variances of estimated missing values, and to test the statistical significance of differences, the study used ANOVA test, and the vi consequently results proved that there is no significant difference between means. The study also found that (98%) of the correlations were not significant based on the correlation matrix. finally, the study compared the estimated missing values after calculating the mean absolute error (MAE), based on the results, the study concluded that the Expectation-maximization (EM) method of estimation is better than the other two methods in producing more efficient estimates. This study recommends to give attention should be paid to the missing data in the design and performance of the studies and in the analysis of the resulting data. And the application of the sophisticated statistical analysis techniques should only be performed after the maximal efforts have been employed to reduce missing data as they can cause bias and lead to invalid conclusions. It also recommends using the Expectation-maximization (EM) method of estimation because its estimates are the most efficient. en_US
dc.description.sponsorship Sudan University of Science & Technology en_US
dc.language.iso en en_US
dc.publisher Sudan University of Science & Technology en_US
dc.subject Seismic en_US
dc.subject Statistics en_US
dc.subject Regression Imputation en_US
dc.subject Expectation Maximization Methods en_US
dc.subject Handling Missing Data en_US
dc.title Multiple Imputation, Regression Imputation and Expectation Maximization Methods for Handling Missing Data en_US
dc.title.alternative طرق الإدخال المتعدد، إدخال الإنحدار وتعظيم التوقع لتقدير البيانات المفقودة en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Share

Search SUST


Browse

My Account