Wikipedia Clickstream

This project contains data sets containing counts of (referer, resource) pairs extracted from the request logs of Wikipedia. A referer is an HTTP header field that identifies the address of the webpage that linked to the resource being requested. The data shows how people get to a Wikipedia article and what links they click on. In other words, it gives a weighted network of articles, where each edge weight corresponds to how often people navigate from one page to another. To give an example, consider the figure below, which shows incoming and outgoing traffic to the "London" article on English Wikipedia during January 2015.

Alt text

Official Documentation

Can be found here

Data and Resources

Additional Info

Field Value
Author Ellery Wulczyn
Maintainer Ellery Wulczyn
Last Updated April 7, 2016, 18:42 (Etc/UTC)
Created February 6, 2015, 00:06 (Etc/UTC)
comments powered by Disqus
comments powered by Disqus