The Semantic Quran dataset is a multilingual RDF representation of translations of the Quran. The dataset was created by integrating data from two different semi-structured sources. The dataset were aligned to an ontology designed to represent multilingual data from sources with a hierarchical structure. The resulting RDF data encompasses 43 different languages which belong to the most under represented languages in Linked Data, including Arabic, Amharic and Amazigh. We designed the dataset to be easily usable in natural-language processing applications with the goal of facilitating the development of knowledge extraction tools for these languages. In particular, the Semantic Quran is compatible with the Natural-Language Interchange Format and contains explicit morpho-syntactic information on the utilized terms.

Data and Resources

Additional Info

Field Value
Author Mohamed Sherif
Version 1.0
Last Updated April 27, 2017, 14:17 (UTC)
Created December 4, 2012, 21:41 (UTC)
Version 1
links:dbpedia 7718
links:wiktionary 18655
shortname SemanticQuran
triples 15741399
comments powered by Disqus
comments powered by Disqus