International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 110 | Views: 267

Research Paper | Computer Science and Information Technology | Sudan | Volume 6 Issue 8, August 2017 | Popularity: 6.3 / 10


     

Evaluation Corpus for Restricted-Domain Question-Answering Systems for the Holy Quran

Bothaina Hamoud, Eric Atwell


Abstract: This paper presents the compilation of a corpus of question-answer pairs for the holy Quran. The corpus has been manually collected from a wide range of sources, and designed to represent the Quran Arabic-English Question and Answer Corpus (QAEQ & AC). QAEQ & AC is a written, bilingual corpus, which comprises Arabic and English text. First, question-answer pairs have been collected from several trusted expert sources. Then the data were merged and cleaned using Microsoft Excel. After that data were converted to the format that suitable for mining tools, where we have created a comma-separated value (CSV) file format. The corpus obtained consists of more than 1500 question-answer pairs which is nearly 50.000 words, divided over Arabic and English languages. It includes different question types such as what, when, why, etc. , and different answer length. We anticipate that the current and subsequent versions of our corpus will be a valuable evaluation resource for computational linguists investigating Quran question and answer, it might be used as a gold standard in researches, that dealing with natural language processing, information retrieval, artificial intelligence. The corpus can be subjected to an annotation to derive linguistic information such as morphological, syntactic, semantic, and lexical information.


Keywords: QAEQ&AC, Quran, corpus, data, question-answer pairs, dataset


Edition: Volume 6 Issue 8, August 2017


Pages: 1133 - 1138



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Bothaina Hamoud, Eric Atwell, "Evaluation Corpus for Restricted-Domain Question-Answering Systems for the Holy Quran", International Journal of Science and Research (IJSR), Volume 6 Issue 8, August 2017, pp. 1133-1138, https://www.ijsr.net/getabstract.php?paperid=4081701, DOI: https://www.doi.org/10.21275/4081701



Similar Articles

Downloads: 0

Survey Paper, Computer Science and Information Technology, China, Volume 10 Issue 12, December 2021

Pages: 299 - 304

A Survey of Clustering Algorithms for Streaming

Denis Patrick Bell, Yang Chunting

Share this Article

Downloads: 0

Research Paper, Computer Science and Information Technology, Kenya, Volume 10 Issue 6, June 2021

Pages: 1621 - 1628

Enterprise Resource Planning Integration and Performance of Safaricom Public Limited Company Kenya

Wambui Caroline, Tumuti Joshua

Share this Article

Downloads: 0

Case Studies, Computer Science and Information Technology, India, Volume 11 Issue 6, June 2022

Pages: 1356 - 1365

The Life-Saving Mission for COVID-19 Vaccination on Google Cloud (GC) Ecosystem

Ramamurthy Valavandan, Kumaraswamy Reddy, Prasanth Parayatham, Ubaiyadulla Sherif, Pallav Kohli, Vikram Sharma, Pragathi S, Vijay R, Surasa Mukherjee, Nitin Ambekar, Dinesh Sai Teja Neeli, Santosh Baran, Vijender Singh, Saurabh Uniyal, Praveen B, Musheer Ahmed N

Share this Article

Downloads: 0

Analysis Study Research Paper, Computer Science and Information Technology, India, Volume 11 Issue 12, December 2022

Pages: 278 - 282

Data Analysis of the Multimission Satellite Product Generation Pattern for Defining the Archival Policy in the Data Centre

C. Pradeep, Gaurav Gupta, Murali Krishna, G. Prasad

Share this Article

Downloads: 0

Research Paper, Computer Science and Information Technology, India, Volume 12 Issue 3, March 2023

Pages: 900 - 903

Web Mining

Sunkara Nagasivaanjaneya Reddy, R. Nagarjuna Yadav, Alka Choksi

Share this Article
Top