Volume 12, Number 1, September 2016 - DOI: http://dx.doi.org/10.21700/ijcis.2016.273

IJCIS

Computing and Information Sciences is a peer reviewed journal that is committed to timely publication of original research, surveying and tutorial contributions on the analysis and development of computing and information science. The journal is designed mainly to serve researchers and developers, dealing with information and computing. Papers that can provide both theoretical analysis, along with carefully designed computational experiments, are particularly welcome. The journal is published 2-3 times per year with distribution to librarians, universities, research centers, researchers in computing, mathematics, and information science. The journal maintains strict refereeing procedures through its editorial policies in order to publish papers of only the highest quality. The refereeing is done by anonymous Reviewers. Often, reviews take four months to six months to obtain, occasionally longer, and it takes an additional several months for the publication process.

DOI: http://dx.doi.org/10.21700/ijcis.2016.108

Arabic Word Sense Disambiguation Using Wikipedia

Marwah Alian* - email: marwah2001@yahoo.com 
Arafat Awajan
Akram Al-Kouz

Department of Computer Science, Princess Sumaya University for Technology, Amman, Jordan

*Corresponding author

Received: 30 July 2016
Revised: 10 August 2016
Accepted: 25 August 2016
Published: 29 September 2016

Abstract:: In this research we introduce a new approach for Arabic word disambiguation by utilizing Wikipedia as the lexical resource for disambiguation. The nearest context for an ambiguous word is selected using Vector Space Model and cosine similarity between the word’s context and the retrieved senses from Wikipedia. Three experiments have been conducted to evaluate the proposed approach, two experiments use the first retrieved sentence for each sense from Wikipedia but they use different Vector Space Model while the third experiment use the first paragraph for the retrieved sense from Wikipedia. The experiments show that using the first retrieved paragraph is better than the first retrieved sentence and the use of Tf-Idf VSM is better than using raw frequency VSM.

Keywords: Arabic Word Disambiguation; Disambiguation Resource; Vector Space Model; Arabic WordNet; Arabic Wkikpedia.


  • PDF (200 KB)
  • ZIP (197 KB)


  •  

    Contacts

    Editor-in-Chief
    Prof. Jihad Mohamad Alja'am 
    Email: journal.editor.ijcis@gmail.com

    The Journal Secretary
    Eng. Dana Bandok
    Ontario, Canada 
    Email: sec.ijcis@gmail.com 

    Home Page »