Welcome to the Quranic Arabic Corpus, an annotated linguistic resource which shows the Arabic grammar, syntax and morphology for each word in the Holy Quran. The corpus provides three levels of analysis: morphological annotation, a syntactic treebank and a semantic ontology.
The Quran is a significant religious text written in Quranic Arabic,
and is followed by believers of the Islamic faith. The Quran contains
6,236 numbered verses (ayāt) and is divided
into 114 chapters.
An example verse from the Quran:
(21:30) Have those who disbelieved not considered that the heavens and the earth were a joined entity, and We separated them and made from water every living thing? Then will they not believe?
- Version 0.4 Released - new and updated linguistic features in this version of the corpus
- Word by Word Quran - maps out the syntax of the entire Quran, with analysis and translation
- Quranic Grammar - traditional Arabic grammar (إعراب) illustrated using dependency graphs
How you can get involved
This project contributes to the research of the Quran by applying natural language computing technology to analyze the Arabic text of each verse. The word by word grammar is very accurate, but ensuring complete accuracy is not possible without your help. If you come across a word and you feel that a better analysis could be provided, you can suggest a correction online by clicking on an Arabic word.
World map of users of the Quranic Arabic
Corpus, provided by Google Analytics.
Countries with the highest number of users are shaded in darker green.
The map above shows worldwide interest in the Quranic Arabic Corpus. Every day, the website is used by over 2,500 people from 165 different countries. Help us review the information on this website so that together we can build the most accurate linguistic resource for Quranic Arabic.
The Quranic Arabic Dependency Treebank (QADT)
The Quranic treebank is an effort to map out the entire grammar of the Quran by linking Arabic words through dependencies. The linguistic structure of verses is represented using mathematical graph theory. The annotated corpus provides a novel visualization of Quranic syntax using dependency graphs.
The Ontology of Quranic Concepts
The Quranic Ontology uses knowledge representation to define the key concepts in the Quran, and shows the relationships between these concepts using predicate logic. Named entities in verses, such as the names of historic people and places mentioned in the Quran, are linked to concepts in the ontology.
- Message Board - discuss the Arabic language and grammatical analysis of the Quran
- Frequently Asked Questions - more information about the research project