Please use this identifier to cite or link to this item: https://repository.sustech.edu/handle/123456789/10579
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAbd alkreem, Musab Abd algader
dc.contributor.authorSupervisor - Mohamed Elhafiz Mustafa Musa
dc.date.accessioned2015-02-16T11:40:30Z
dc.date.available2015-02-16T11:40:30Z
dc.date.issued2014-09-22
dc.identifier.citationAbd alkreem, Musab Abd algader. Classification Algorithms Comparison- Case Study: Cancer Patients(SEER Data Set)/ Musab Abd algader Abd alkreem ؛Mohamed Elhafiz Mustafa WAHBI .-Khartoum : sudan university of science and technology,computer science,2014.-53p:ill;28cm.-M.Sc.en_US
dc.identifier.urihttp://repository.sustech.edu/handle/123456789/10579
dc.descriptionThesisen_US
dc.description.abstractData mining is the automatic search of huge data to discover patterns and trends that go beyond simple analysis. Data mining is also known as Knowledge Discovery in Data (KDD). This study investigates the discovery of the survival rate or survivability of a certain disease is possible by extracting the knowledge from the data related to that disease. To do such investigate a large data set needed one of these data sources is SEER[1] (Surveillance Epidemiology and End Results), which is a unique, reliable and essential resource for investigating the different aspects of cancer. In this study we have investigated three data mining techniques Multilayer Perceptron (MLP), K-nearest neighbor and the C4.5 decision trees the goal is to find the best accuracy to predict 5 years survivability of breast cancer. SEER database (period of 1973-2009 with 657,712 records) were used, starting from previous study we determined common variables use, after preprocessed there are 18 variables and 180,302 records. Weka was used to train and test the three techniques. The result show that the best technique is C4.5 accuracy is %95.6 and the second technique is K-nearest neighbor with accuracy %95.4 and the worst is MLP with accuracy %95.3.en_US
dc.description.sponsorshipSudan University of Science and Technologyen_US
dc.language.isoen_USen_US
dc.publisherSudan University of Science and Technologyen_US
dc.subjectClassification Algorithmsen_US
dc.subjectCancer Patientsen_US
dc.subjectSEER Data Seten_US
dc.subjectData miningen_US
dc.subjectKDDen_US
dc.titleClassification Algorithms Comparisonen_US
dc.title.alternativeمقارنه خوارزميات التصنيفen_US
dc.typeThesisen_US
Appears in Collections:Masters Dissertations : Computer Science and Information Technology

Files in This Item:
File Description SizeFormat 
Classification Algorithms Comparison ... .pdfTitle57.64 kBAdobe PDFView/Open
Research.pdfResearch487.28 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.