crunchcrawl

693

0

Updated:5 months ago

Files:3

Size:2.57 MB

Formats:csv

License:CC-BY-4.0

Crunchcrawl This module lets you index and download the company information held in Crunchbase. Before using, double-check http://www.crunchbase.com/robots.txt and the API conditions to ensure you'r...

Explore with AI Report an Issue

API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files

Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/core/crunchcrawl/

https://datahub.io/core/crunchcrawl/_r/-/README.md

├ cliargs.php

https://datahub.io/core/crunchcrawl/_r/-/cliargs.php

├ companydata.csv

https://datahub.io/core/crunchcrawl/_r/-/companydata.csv

├ companydata.txt

https://datahub.io/core/crunchcrawl/_r/-/companydata.txt

├ companyurls.txt

https://datahub.io/core/crunchcrawl/_r/-/companyurls.txt

├ companyurls_test.txt

https://datahub.io/core/crunchcrawl/_r/-/companyurls_test.txt

├ datapackage.json

https://datahub.io/core/crunchcrawl/_r/-/datapackage.json

├ gathercompanies.php

https://datahub.io/core/crunchcrawl/_r/-/gathercompanies.php

├ gathercompanyurls.php

https://datahub.io/core/crunchcrawl/_r/-/gathercompanyurls.php

├ parallelcurl.php

https://datahub.io/core/crunchcrawl/_r/-/parallelcurl.php

├ places2k.txt

https://datahub.io/core/crunchcrawl/_r/-/places2k.txt

├ scratchpad.py

https://datahub.io/core/crunchcrawl/_r/-/scratchpad.py

https://datahub.io/core/crunchcrawl/_r/-/zcta5.txt

├ zips_by_amount.csv

https://datahub.io/core/crunchcrawl/_r/-/zips_by_amount.csv

└ zips_by_numbers.csv

https://datahub.io/core/crunchcrawl/_r/-/zips_by_numbers.csv

Key Files

Start with these files — they give you everything you need to understand and access the dataset.

datapackage.json— metadata & schema

https://datahub.io/core/crunchcrawl/_r/-/datapackage.json

README.md— documentation

https://datahub.io/core/crunchcrawl/_r/-/README.md

Typical Usage

1. Fetch datapackage.json to inspect schema and resources
2. Download data resources listed in datapackage.json
3. Read README.md for full context

Data Files

Explore with AI

companydata

Download

About

Last updated: 9 February 2026
Total rows: ...
Format: CSV
File size: 1.98 MB
Source: Crunchbase
License: Creative Commons Attribution 4.0 International

zips-by-amount

Download

About

Last updated: 9 February 2026
Total rows: ...
Format: CSV
File size: 177 kB
Source: Crunchbase
License: Creative Commons Attribution 4.0 International

zips-by-numbers

Download

About

Last updated: 9 February 2026
Total rows: ...
Format: CSV
File size: 412 kB
Source: Crunchbase
License: Creative Commons Attribution 4.0 International

About this dataset

Crunchcrawl


This module lets you index and download the company information held in Crunchbase.
*Before using, double-check http://www.crunchbase.com/robots.txt and the API conditions to ensure you're obeying the terms-of-service*

It contains various scripts to index and pull down the latest data about the company, as well as a snaphot of the data as it was on Monday August 23rd 2010. This data is CC-BY, see http://www.crunchbase.com/help/licensing-policy for more information.

By Pete Warden <pete@petewarden.com>, freely reusable, see http://petewarden.typepad.com for more