- Nome
- Tim McNamara
- Membro desde
- Jul 25, 2011
- 30 Conjuntos de dados
- 130 Edições
Conjuntos de dados
-
A database of all photographs and related data from or about space from 1961 collected by NASA. It includes data from the International Space Station, NASA's Earth Observatory and flight...
-
Ngrams and code from Dr. Peter Norvig's chapter for Beautiful Data (2009), edited by Segaran and Hammerbacher. Data files are derived from the Google Web Trillion Word Corpus, as...
-
A free database of chemicals, covering around 26 million compounds integrated from 400 data sources. The database APIs are free for academic users, or for non-academic non-commercial...
-
From the dataset prologue: This is a set of Prolog assertions for facts about countries, kindly made available by Ronen Feldman of Bar-Ilan University. It was extracted by Ronen Feldman...
-
Attribution is requested by the owner: You may freely download this data for your own exclusive research purposes. You should publish computer recognition results achieved on this data,...
-
Open access repository of conference poster submissions.
-
8.5 gigabytes of faces for training facial recognition software.
-
"free and open access to biodiversity data" GBIF is a global organisation working for open science with biodiversity data. It seeks to make it easy for data gatherers, technical...
-
GEO-Portal provides access to remote sensing data. It is a vast database that focuses on providing data specific for public goods, notably Disasters Health...
-
The Global Risk Data Platform is a repository of s maps, raw data and a WMS service for disaster related information. It aggregates data from several third party sources. From its...
-
Data for and rendered map of training levels in the European Union.
-
About Detailed morality statistics from 37 countries. From the website: The Human Mortality Database (HMD) was created to provide detailed mortality and population data to researchers,...
-
The Internet Storm Center has a large number of security related tools. For example, it includes a service which checks file MD5 hashes against a database of of tens millions of confirmed...
-
From the website: The Multimission Archive at [the Space Telescope Science Institute] is a NASA funded project to support and provide to the astronomical community a variety of...
-
Microsoft has developed services on the basis of ngrams from all of Bing's en_US corpus. The raw public data available include two files with the top 100k words from this corpus. The...
-
M-Lab is an industry/academy partnership for assessing the capabilities of broadband around the world. From its website: Measurement Lab (M-Lab) is an open, distributed server platform...
-
A large (305GB) database of images for training facial recognition software. From the website: It contains 337 subjects, captured under 15 view points and 19 illumination conditions in...
-
A catalogue with over 500 datasets relating to NASA's missions.
-
The databank is a metadata catalogue provided by Landcare Research for data collected for the Nationial Vegistation Survey.
-
The library is a collection of machine-readable texts and metadata, especially relating to New Zealand and the Asia/Pacific Region. From the website: [The library] provides several...
-
RCV1 is a dataset of 810,000 documents (2.5GB uncompressed), which is available by request from the NIST. Those documents are distributed by CD. For derivative data that is publicly...
-
This is a publicly available, tokenized version of the Reuters RCV1 corpus by David D Lewis et al. The creator requests attribution.
-
UN OCHA (Office for the Coordination of Humanitarian Affairs) provides a very comprehensive directory for major disasters all around the world named ReliefWeb. The system is not very...
-
A set of documents from Reuters' 1986 newswire which have been classified. This dataset is appropriate for testing natural language processing and information retrieval algorithms....
-
From the website SCface is a database of static images of human faces. Images were taken in uncontrolled indoor environment using five video surveillance cameras of various qualities....
-
The unit generally provides reports and mapping tools to support emergency response. These resources are from its data page. From its website: The mission of the Humanitarian Information...
-
This data comes from Kaggle.com, a machine learning competition site. This is a prediction problem in which there are 50000 datapoints (vectors), each datapoint being itself a sparse...
-
Data relating to USA's official development assistance and disaster response activities. The data from usaid.gov/data is sourced from US Official Development Assistance Database...
-
The Ushahidi community provides access to the data from its deployments.
-
The Water Quality Information System (WQIS) holds data relating to New Zealand. The Water Quality Information System (WQIS) contains data and metadata on a range of common water quality...