Neurocommons text mining pilot

About

> The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by following this link to the mailing list. The tar.gz format is a tarred and gzipped file containing triples given in the N-Triples syntax.

Data exposed: extracted from Temis software applied to 7% of Medline records Size of dump and data set: 24 MB Notes: released without contract

Openness

Data is comprised of other datasets - most of which are open.

Dáta a Dátové zdroje

Doplňujúce informácie

Pole Hodnota
Zdroj http://sw.neurocommons.org/2007/text-mining.html
Autor <URI>
Posledná aktualizácia Október 10, 2013, 23:04 (Etc/UTC)
Vytvorené Október 27, 2009, 19:02 (Etc/UTC)
comments powered by Disqus
comments powered by Disqus