A collection of datasets about Wikipedia and other projects run by the Wikimedia Foundation. The collection is open to contributions by researchers not affiliated with the Foundation.
Our overall data policy is to release into the public domain all datasets that don't require attribution and to license datasets that include textual/media contributions from Wikimedians under the appropriate open license, most commonly a CC BY 3.0 license.
1 datasets found.
This is a non-random dataset containing the edit histories of about 47,000 editors. This can be used for machine learning purposes and the outcome variable is the number of edits six...