SUST Repository

Imbalanced data classification Enhancement Using SMOTE and NearMiss sampling Techniques

Show simple item record

dc.contributor.author Babikir, MayaminTilalAbdelrahim
dc.contributor.author Supervisor, -Wafaa Faisal
dc.date.accessioned 2022-08-28T10:09:59Z
dc.date.available 2022-08-28T10:09:59Z
dc.date.issued 2022-07-27
dc.identifier.citation Babikir, MayaminTilalAbdelrahim . Imbalanced data classification Enhancement Using SMOTE and NearMiss sampling Techniques \ MayaminTilalAbdelrahimBabikir ; Wafaa Faisal .- Khartoum:Sudan University of Science & Technology,College of Computer Science and Information Technology,2022.-47.p.:ill.;28cm.-M.Sc. en_US
dc.identifier.uri http://repository.sustech.edu/handle/123456789/27430
dc.description Thesis en_US
dc.description.abstract An approach to construction of classifiers from imbalanced datasets is described. The dataset is imbalanced if the classification categories are not approximately equally represented,often real-world data sets are predominately composed of "normal" examples with only a small percentage of "abnormal" or "interesting" examples. It is also the case that the cost of misclassifying an abnormal (interesting) example as a normal example is often much higher than the cost of the reverse error. Under-sampling of the majority (normal) class has been proposed as a good means of increasing the sensitivity of a classifier to the minority class. This research shows that a combination of method of over-sampling the minority (abnormal) class and under-sampling the majority (normal) class can achieve better classifier performance. The methodology involves acquisition the dataset form UCI repository and applying SVM and Random Forest classifier, applying SMOTE method and evaluating classification accuracy before and after balancing. en_US
dc.description.sponsorship Sudan University of Science & Technology en_US
dc.language.iso en en_US
dc.publisher Sudan University of Science & Technology en_US
dc.subject Information Technology Entitled: en_US
dc.subject Computer Science and Information Technology en_US
dc.subject Imbalanced data classification Enhancement en_US
dc.subject SMOTE and NearMiss sampling Techniques en_US
dc.title Imbalanced data classification Enhancement Using SMOTE and NearMiss sampling Techniques en_US
dc.title.alternative تحسين دقة تصنيف البيانات غير المتوازنة باستخدام تقنيتي en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Share

Search SUST


Browse

My Account