A Draft Version of the Linked Human Imprintome

This is a particular dataset of human genes comprising an updated collection of all (known and predicted to be) imprinted genes in the human genome, termed imprintome. Our research group has examined, compiled, structured and linked data from around 240 unique genes to use it as a sharing resource for genome and epigenome interrogated studies. A detailed biological description of imprinted genes with their roles and features is out of the scope of this work, but we rather aim at providing a reasonable, valid application of the Semantic Web and Linked Data approaches to a structured dataset regarding the human imprintome. Therefore, we have focused on displaying the big picture of the human imprintome as a large-scale draft map of linked data for enabling the tasks of browsing and mining imprinted genes towards further, domain-expert understanding of their complex nature. Data management operations, specially those enabled by S3QL (http://s3ql.info) that can also be formalized in SPARQL (Deus et al., 20110) are planned to be used. There is an attempt to work on the maturation of the core S3DB application and other applications using S3DB's API to interoperate with the S3DB data service (http://link.s3db.org/owl), before adopting the final model and via a SPARQL endpoint (approximately 130,000 triples in a N3 file format). Currently, we provide two (02) datasets of human genes in RDF format to comprise the Linked Human Imprintome v1.0 (a draft version): one with 120 imprinted genes and the second with 128 predicted-to-be-imprinted genes. The still in-progress work of the Linked Human Imprintome will involve additional data-centric tools and applications, including evaluating initial data mining tasks and the use of mashpoint frameworks for publishing data.

Данные и Ресурсы

Дополнительная информация

Поле Величина
Источник http://www.uece.br/mestradonutricao/index.php/certificacao
Автор Diana Magalhaes de Oliveira
Версия v1.0

Comments