SUST Repository

A new IR Method for Mixed Document Scoring Adjustment Using Multilingual Document

Show simple item record

dc.contributor.author Omer, Hajer Yousif
dc.contributor.author Supervisor, -Albaraa Abuobieda Mohammed
dc.date.accessioned 2018-02-22T08:30:55Z
dc.date.available 2018-02-22T08:30:55Z
dc.date.issued 2017-10-03
dc.identifier.citation Omer, Hajer Yousif.A new IR Method for Mixed Document Scoring Adjustment Using Multilingual Document\Hajer Yousif Omer ;Albaraa Abuobieda Mohammed.-Khartoum:Sudan University of Science & Technology, College of Computer Science and Information Technology ,2017.-66p.:ill.;28cm.-M.Sc en_US
dc.identifier.uri http://repository.sustech.edu/handle/123456789/20392
dc.description Thesis en_US
dc.description.abstract Non-English-speaking users, such as Arabic speakers, are not able to express terminology in their native languages. It leads to mixing languages together. Therefore, such non English speaking users may express their queries in a mixed form between two languages, mostly English and the native language, in order to precisely present their concepts to search engines. This also makes the web essentially cross lingual and/or multilingual. These effects specifically in the non-English scientific documents, which contain some, English terms to express terminology. It shown that in non-English documents, those in Arabic for example, terms often accompanied by their translations resulting in the co-occurrence of the same term in two different languages. When searching using mixed query the result is that this documents dominate the top retrieved documents. However, the approach is to add another level to adjust the score using the co-occurrence to minimize it. After implementing the proposed method, it show improvement in the ranking. In this thesis, we have designed a new method to set the matching level in retrieving information from multilingual documents. The method tested on a set of documents (watan-2004 of a textual nature). The results in this research show that the proposed new method has improved the accuracy of retrieval after comparing it to a search engine that does not have this feature.   en_US
dc.description.sponsorship Sudan University of Science & Technology en_US
dc.language.iso en en_US
dc.publisher Sudan University of Science and Technology en_US
dc.subject Document en_US
dc.subject Scoring Adjustment en_US
dc.subject Multilingual Document en_US
dc.title A new IR Method for Mixed Document Scoring Adjustment Using Multilingual Document en_US
dc.title.alternative طريقة جديدة لضبط مستوى التطابق في استرجاع المعلومات من الوثائق متعددة اللغات en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Share

Search SUST


Browse

My Account