API AccessAccess dataset files directly from scripts, code, or AI agents.
Browse dataset files
Access dataset files directly from scripts, code, or AI agents.
Dataset Files
Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.
/core/crunchcrawl/
https://datahub.io/core/crunchcrawl/_r/-/README.md
https://datahub.io/core/crunchcrawl/_r/-/cliargs.php
https://datahub.io/core/crunchcrawl/_r/-/companydata.csv
https://datahub.io/core/crunchcrawl/_r/-/companydata.txt
https://datahub.io/core/crunchcrawl/_r/-/companyurls.txt
https://datahub.io/core/crunchcrawl/_r/-/companyurls_test.txt
https://datahub.io/core/crunchcrawl/_r/-/datapackage.json
https://datahub.io/core/crunchcrawl/_r/-/gathercompanies.php
https://datahub.io/core/crunchcrawl/_r/-/gathercompanyurls.php
https://datahub.io/core/crunchcrawl/_r/-/parallelcurl.php
https://datahub.io/core/crunchcrawl/_r/-/places2k.txt
https://datahub.io/core/crunchcrawl/_r/-/scratchpad.py
https://datahub.io/core/crunchcrawl/_r/-/zcta5.txt
https://datahub.io/core/crunchcrawl/_r/-/zips_by_amount.csv
https://datahub.io/core/crunchcrawl/_r/-/zips_by_numbers.csv
Key Files
Start with these files — they give you everything you need to understand and access the dataset.
datapackage.json— metadata & schema
https://datahub.io/core/crunchcrawl/_r/-/datapackage.json
README.md— documentation
https://datahub.io/core/crunchcrawl/_r/-/README.md
Typical Usage
- 1. Fetch datapackage.json to inspect schema and resources
- 2. Download data resources listed in datapackage.json
- 3. Read README.md for full context
Data Previews
companydata
Loading data...
Schema
| name | type |
|---|---|
| name | string |
| founded_year | integer |
| country_code | string |
| state_code | string |
| zip_code | string |
| city | string |
| address1 | string |
| address2 | string |
| raised_amount | number |
zips-by-amount
Loading data...
Schema
| name | type |
|---|---|
| lat | number |
| lon | number |
| value | number |
| tooltip | string |
zips-by-numbers
Loading data...
Schema
| name | type |
|---|---|
| lat | number |
| lon | number |
| value | number |
| tooltip | string |
Data Files
| File | Description | Size | Last modified | Download |
|---|---|---|---|---|
companydata | 1.98 MB | 27 days ago | companydata | |
zips-by-amount | 177 kB | 27 days ago | zips-by-amount | |
zips-by-numbers | 412 kB | 27 days ago | zips-by-numbers |
| Files | Size | Format | Created | Updated | License | Source |
|---|---|---|---|---|---|---|
| 3 | 2.57 MB | csv | 1 day ago | Creative Commons Attribution 4.0 International | Crunchbase |
Crunchcrawl
This module lets you index and download the company information held in Crunchbase.
*Before using, double-check http://www.crunchbase.com/robots.txt and the API conditions to ensure you're obeying the terms-of-service*
It contains various scripts to index and pull down the latest data about the company, as well as a snaphot of the data as it was on Monday August 23rd 2010. This data is CC-BY, see http://www.crunchbase.com/help/licensing-policy for more information.
By Pete Warden <pete@petewarden.com>, freely reusable, see http://petewarden.typepad.com for more