Please use this identifier to cite or link to this item: https://repository.sustech.edu/handle/123456789/1506
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMohamed, Gamal Saad
dc.contributor.authorSupervisor - Awad EL-Kareem Mohammed Yousof
dc.date.accessioned2013-09-11T11:55:38Z
dc.date.available2013-09-11T11:55:38Z
dc.date.issued2012-09-01
dc.identifier.citationMohamed,Gamal Saad.Detecting Similarity Among Multiple Data Sources/ Gamal Saad Mohamed;Awad EL-Kareem Yousof.-khartoum:Sudan University of Science & Technology,computer science,2012.-92p:ill;28cm.-Ph.D.en_US
dc.identifier.urihttp://repository.sustech.edu/handle/123456789/1506
dc.descriptionThesisen_US
dc.description.abstractEfficient techniques to detect similar data in many data sources has become one of the most important and challenging issues in many areas such as Data Base, Bioinformatics and Data Mining.In this research, a three phase framework for similarity detection is proposed: In the first phase: Data Sources were collected from the web, depending on how it relates to a predetermined domain. The base source is the source of the data available, which describes the domain. In the second phase: the sources obtained are filtered to select data sources with a greater probability of containing data describing the domain by examining the degree of similarity between the base source, and each source from the sources obtained "External Sources". Whereas the selection is only for the external sources which its simi_degree value is less than, or equal to the average of the simi_degree values of all sources. In the third phase: Content similarity is examined between the base source, and all the selected external sources in phase 1, by using the proposed "Probability Measure" that gives a value on the basis of which it is determined whether the content of external sources is similar to the content of the base resource. Experimental result shows that the researcher's similarity framework can achieve better quality result than the conventional approaches.en_US
dc.description.sponsorshipSudan University of Science and Technologyen_US
dc.language.isoenen_US
dc.publisherSudan University of Science and Technologyen_US
dc.subjectData managementen_US
dc.titleDetecting Similarity Among Multiple Data Sources For Categorized DATAen_US
dc.typeThesisen_US
Appears in Collections:PhD theses : Computer Science and Information Technology

Files in This Item:
File Description SizeFormat 
Detecting Similarity among ... .pdfTitle43.8 kBAdobe PDFView/Open
Abstract.pdfAbstract74.02 kBAdobe PDFView/Open
chapter 1.pdf
  Restricted Access
chapter 32 kBAdobe PDFView/Open Request a copy
chapter 2.pdf
  Restricted Access
chapter 103.9 kBAdobe PDFView/Open Request a copy
chapter 3.pdf
  Restricted Access
chapter 47.32 kBAdobe PDFView/Open Request a copy
chapter 4.pdf
  Restricted Access
chapter 100 kBAdobe PDFView/Open Request a copy
chapter5.pdf
  Restricted Access
chapter 213.15 kBAdobe PDFView/Open Request a copy
chapter 6.pdf
  Restricted Access
chapter 16.96 kBAdobe PDFView/Open Request a copy
appendix.pdfappendix61.35 kBAdobe PDFView/Open
refrence.pdfrefrence15.81 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.