dc.contributor.author |
Omer, Hajer Yousif |
|
dc.contributor.author |
Supervisor, -Albaraa Abuobieda Mohammed |
|
dc.date.accessioned |
2018-02-22T08:30:55Z |
|
dc.date.available |
2018-02-22T08:30:55Z |
|
dc.date.issued |
2017-10-03 |
|
dc.identifier.citation |
Omer, Hajer Yousif.A new IR Method for Mixed Document Scoring Adjustment Using Multilingual Document\Hajer Yousif Omer ;Albaraa Abuobieda Mohammed.-Khartoum:Sudan University of Science & Technology, College of Computer Science and Information Technology ,2017.-66p.:ill.;28cm.-M.Sc |
en_US |
dc.identifier.uri |
http://repository.sustech.edu/handle/123456789/20392 |
|
dc.description |
Thesis |
en_US |
dc.description.abstract |
Non-English-speaking users, such as Arabic speakers, are not able to express terminology in their native languages. It leads to mixing languages together. Therefore, such non English speaking users may express their queries in a mixed form between two languages, mostly English and the native language, in order to precisely present their concepts to search engines. This also makes the web essentially cross lingual and/or multilingual. These effects specifically in the non-English scientific documents, which contain some, English terms to express terminology. It shown that in non-English documents, those in Arabic for example, terms often accompanied by their translations resulting in the co-occurrence of the same term in two different languages. When searching using mixed query the result is that this documents dominate the top retrieved documents. However, the approach is to add another level to adjust the score using the co-occurrence to minimize it. After implementing the proposed method, it show improvement in the ranking. In this thesis, we have designed a new method to set the matching level in retrieving information from multilingual documents. The method tested on a set of documents (watan-2004 of a textual nature). The results in this research show that the proposed new method has improved the accuracy of retrieval after comparing it to a search engine that does not have this feature.
|
en_US |
dc.description.sponsorship |
Sudan University of Science & Technology |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Sudan University of Science and Technology |
en_US |
dc.subject |
Document |
en_US |
dc.subject |
Scoring Adjustment |
en_US |
dc.subject |
Multilingual Document |
en_US |
dc.title |
A new IR Method for Mixed Document Scoring Adjustment Using Multilingual Document |
en_US |
dc.title.alternative |
طريقة جديدة لضبط مستوى التطابق في استرجاع المعلومات من الوثائق متعددة اللغات |
en_US |
dc.type |
Thesis |
en_US |