keyboard_arrow_up
Arabic Dataset for Automatic Keyphrase Extraction

Authors

Mohammed Al Logmani1 and Husni Al Muhtaseb2, 1Saudi Aramco, Saudi Arabia and 2King Fahd University for Petroleum & Minerals, Saudi Arabia

Abstract

We propose a dataset in Arabic language for automatic keyphrase extraction algorithms. Our Arabic dataset contains 400 documents along with their keyphrases. The dataset covers eighteen different categories. An evaluation using a state-of-the-art algorithm demonstrates the accuracy of our dataset is similar to that of English datasets.

Keywords

Keyphrase extraction, Arabic, dataset

Full Text  Volume 7, Number 1