Skip to main content

Relation extraction from arabic wikipedia

Research Authors
Gehad Zakria, Mamdouh Farouk, Khaled Fathy, Malak N Makar
Research Date
Research Department
Research Journal
Indian Journal of Science and Technology
Research Member
Research Vol
Volume 12, Issue 46
Research Website
https://scholar.google.com/scholar?oi=bibs&cluster=16981496231598000900&btnI=1&hl=en
Research Year
2019
Research_Pages
01-06
Research Abstract

Objectives/Methods

This study aims to extract relations between entities from Arabic text. RelationExtraction is one of the most important tasks in text mining. Relation extraction is considered as a main step for many applications such as extracting triples from the text, Question Answering and Ontology building. However, extracting relations from the Arabic text is a difficult task compared to English due to lack of annotated Arabic corpora. This paper proposes a method for extracting relations from Arabic text based on ArabicWikipedia articles characteristics. The propose system extracts sentences that contain principle entity, secondary entity and relation from Wikipedia article, then we use WordNet and DBpedia to build the training set. Finally Naive Bayes Classifier is used to train and test the datasets.

Finding

There are few works to extract relations from Arabic text. These works depend on classification, clustering and rule based.

Application/improvement

The experiments show the effectiveness of the proposed approach which achieves high precision with 89% for classifying 19 type of semantic relations.