Provenance Reconstruction 2: Human Generated News

The ground truth for the second dataset was created using the sources mentioned in news articles from WikiNews. The link between news articles and their sources is modeled using the prov:hadPrimarySource relation. The raw data consists of the entire HTML of the WikiNews articles, without the sources, and a list of URIs “human_sources.txt”. In other words, the goal of this task is to match the source URIs from this list to the correct WikiNews article.

Data and Resources

Additional Info

Field Value
Author Paul Groth, Tom De Nies, Robin Verborgh, Sarah Magliacane
Last Updated June 13, 2014, 11:19 (UTC)
Created June 13, 2014, 11:17 (UTC)
year 2014
comments powered by Disqus
comments powered by Disqus