A collection of datasets about Wikipedia and other projects run by the Wikimedia Foundation. The collection is open to contributions by researchers not affiliated with the Foundation.
Our overall data policy is to release into the public domain all datasets that don't require attribution and to license datasets that include textual/media contributions from Wikimedians under the appropriate open license, most commonly a CC BY 3.0 license.
Conjunts de dades
3 conjunts de dades trobats.
-
A complete anonymized dump of 11M article ratings collected over 1 year (July 2011 - July 2012) from the English Wikipedia. Read more...
-
This experiment looks at the effects of linking to the revision history of Wikipedia articles with a prominent "last modified" timestamp. Currently, the only way for readers to discover...
-
Data on user preferences set by active Wikipedia editors. Active editors are defined as registered users with at least 5 edits per month in a given project. The dumps were generated on...