34 tietoaineistoa avainsanalla size-large:
-
Title: Scientific Literature Digital Library Description CiteSeer is a scientific literature digital library and search engine that focuses primarily on the literature in computer and...
-
Description The International HapMap Project is a partnership of scientists and funding agencies from Canada, China, Japan, Nigeria, the United Kingdom and the United States to develop a...
-
Description From main page: Accident & Incident Reports Preliminary Data Final Data More » Accident & Incident Data Aviation Data & Statistics...
-
Description Library of Congress Authority data harvested in December 2006. From the readme: Using a custom agent, we were able to harvest 6.95 million authority records using the...
-
The Galaxy Zoo files contain almost a quarter of a million galaxies which have been imaged with a camera attached to a robotic telescope (the Sloan Digital Sky Survey, no less). In order...
-
-
Description From home page: "RefDIC is an open-access database of quantitative mRNA/Protein profiles specifically for immune cells." From http://refdic.rcai.riken.jp/document.cgi:...
-
Released in 2008 and funded by Hewlett Foundation. From front page: ... This database makes searchable the copyright renewal records received by the US Copyright Office between 1950 and...
-
Description Not a producer of data but focused on extracting and aggregating data from other sources. Openness: OPEN License: no explicit license used but all underlying data...
-
Description: Taken from http://bulk.resource.org/: Welcome to bulk.resource.org, a service of public.resource.org. This system contains UNSUPPORTED, AS-IS copies of selected U.S....
-
Description From the main page: GenBank® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences (Nucleic Acids Research, 2008...
-
Description A large collection of USA time series data taken from a variety of sources, primarily state and central government in the USA. From the about page...
-
Description Data created by traveline and used by (among others) transportdirect. From http://www.pti.org.uk/repository.htm: The third snapshot of the traveline data was taken in October...
-
Description From front page: OpenGuides™ is a network of free, community-maintained wiki guidebooks to places around the world. Anyone is free to contribute, whether it's by writing new...
-
Description Large global surveys of 'values' taking place every five years since 1990 described on its website as "The world's most comprehensive investigation of Political and...
-
From about page: The Open Directory Project is the largest, most comprehensive human-edited directory of the Web. It is constructed and maintained by a vast, global community of...
-
Description As of August 2008 over 52 thousand structures available for download. From home page: The RCSB PDB provides a variety of tools and resources for studying the structures of...
-
From home page: The MONDIAL database has been compiled from geographical Web data sources listed below: CIA World Factbook, a predecessor of Global Statistics which...
-
Description From front page: The main objective of FLOSSMETRICS is to construct, publish and analyse a large scale database with information and metrics about libre software development...
-
From excellent (txt) README (!) http://www.archimedespalimpsest.net/0_ReadMe.txt: 1 Rights and Conditions of Use The Archimedes Palimpsest data is released with license for use under...
-
Description "Free access to data produced by the Office for National Statistics, government departments and devolved administrations." Datasets Census: links to lists of all...
-
Description Lots of sheet music. While quite a bit has source files much only seems to be in pdf. Openness: OPEN License: not specified but strongly appears to be open plus most...
-
Description From the front page: ceprDATA.org provides consistent, user-friendly versions of the Survey of Income and Program Participation (SIPP), Current Population Survey (CPS), and...
-
Description From website: The Statistical Abstract of the United States, published since 1878, is the authoritative and comprehensive summary of statistics on the social, political, and...
-
Description US government profiles of countries and territories around the world. Information on geography, people, government, transportation, economy, communications, etc. Openness:...
-
Data exposed: ontology focused on bibliography data of publications from DBLP with additions that include affiliations, universities, and publishers Size of dump and data set: 11M...
-
The data includes all publicly traded firms that had a market capitalization greater than $5 million on January 1, 2010 for non-US firms and all publicly traded companies for US firms....
-
Description "Freebase is an open database of the world’s information. It is built by the community and for the community—free for anyone to query, contribute to, built applications on top...
-
This data set contains 10000054 ratings and 95580 tags applied to 10681 movies by 71567 users of the online movie recommender service MovieLens. Users were selected at random for...
-
Detailed information on almost 3 million U.S. patents granted between January 1963 and December 1999, all citations made to these patents between 1975 and 1999 (over 16 million), and a...
-
Large film/movie database claiming: 425,000+ titles 1,700,000 + filmographies of cast and crew members Films from 1891 to Present Foreign and...
-
Data exposed: corporate ownership Size of dump and data set: 1.8 million triples Notes: also found in the of SPARQL Endpoints
-
The dataset consists of one hundred collegiate Facebook friendship networks. The dataset also includes information on gender, high-school, dorm, academic major, and some other attributes....
-
FAOSTAT provides time-series and cross sectional data relating to food and agriculture for some 200 countries. Openness: ? No explicit license No bulk download...