Downloads: 110 | Views: 267
Research Paper | Computer Science and Information Technology | Sudan | Volume 6 Issue 8, August 2017 | Popularity: 6.3 / 10
Evaluation Corpus for Restricted-Domain Question-Answering Systems for the Holy Quran
Bothaina Hamoud, Eric Atwell
Abstract: This paper presents the compilation of a corpus of question-answer pairs for the holy Quran. The corpus has been manually collected from a wide range of sources, and designed to represent the Quran Arabic-English Question and Answer Corpus (QAEQ & AC). QAEQ & AC is a written, bilingual corpus, which comprises Arabic and English text. First, question-answer pairs have been collected from several trusted expert sources. Then the data were merged and cleaned using Microsoft Excel. After that data were converted to the format that suitable for mining tools, where we have created a comma-separated value (CSV) file format. The corpus obtained consists of more than 1500 question-answer pairs which is nearly 50.000 words, divided over Arabic and English languages. It includes different question types such as what, when, why, etc. , and different answer length. We anticipate that the current and subsequent versions of our corpus will be a valuable evaluation resource for computational linguists investigating Quran question and answer, it might be used as a gold standard in researches, that dealing with natural language processing, information retrieval, artificial intelligence. The corpus can be subjected to an annotation to derive linguistic information such as morphological, syntactic, semantic, and lexical information.
Keywords: QAEQ&AC, Quran, corpus, data, question-answer pairs, dataset
Edition: Volume 6 Issue 8, August 2017
Pages: 1133 - 1138
Make Sure to Disable the Pop-Up Blocker of Web Browser
Similar Articles
Downloads: 0
Survey Paper, Computer Science and Information Technology, China, Volume 10 Issue 12, December 2021
Pages: 299 - 304A Survey of Clustering Algorithms for Streaming
Denis Patrick Bell, Yang Chunting
Downloads: 0
Research Paper, Computer Science and Information Technology, Kenya, Volume 10 Issue 6, June 2021
Pages: 1621 - 1628Enterprise Resource Planning Integration and Performance of Safaricom Public Limited Company Kenya
Wambui Caroline, Tumuti Joshua
Downloads: 0
Case Studies, Computer Science and Information Technology, India, Volume 11 Issue 6, June 2022
Pages: 1356 - 1365The Life-Saving Mission for COVID-19 Vaccination on Google Cloud (GC) Ecosystem
Ramamurthy Valavandan, Kumaraswamy Reddy, Prasanth Parayatham, Ubaiyadulla Sherif, Pallav Kohli, Vikram Sharma, Pragathi S, Vijay R, Surasa Mukherjee, Nitin Ambekar, Dinesh Sai Teja Neeli, Santosh Baran, Vijender Singh, Saurabh Uniyal, Praveen B, Musheer Ahmed N
Downloads: 0
Analysis Study Research Paper, Computer Science and Information Technology, India, Volume 11 Issue 12, December 2022
Pages: 278 - 282Data Analysis of the Multimission Satellite Product Generation Pattern for Defining the Archival Policy in the Data Centre
C. Pradeep, Gaurav Gupta, Murali Krishna, G. Prasad
Downloads: 0
Research Paper, Computer Science and Information Technology, India, Volume 12 Issue 3, March 2023
Pages: 900 - 903Web Mining
Sunkara Nagasivaanjaneya Reddy, R. Nagarjuna Yadav, Alka Choksi