Updated

Task: Wrangle GCAT Space Objects Dataset

Every artificial object ever launched into space — from Sputnik in 1957 to the present — catalogued by Jonathan McDowell. Includes launch date, country, object type (payload, rocket body, debris, component), orbit class, and current status.

API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files
Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/technology/gcat-artificial-space-objects/
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/AGENTS.md
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/README.md
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/TASK.md
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/data/objects_per_year.csv
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/data/satcat.csv
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/data/satcat_raw.tsv
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/datapackage.json
Key Files

Start with these files — they give you everything you need to understand and access the dataset.

datapackage.jsonmetadata & schema
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/datapackage.json
README.mddocumentation
https://datahub.io/technology/gcat-artificial-space-objects/_r/-/README.md
Typical Usage
  1. 1. Fetch datapackage.json to inspect schema and resources
  2. 2. Download data resources listed in datapackage.json
  3. 3. Read README.md for full context

Data Views

Data Previews

Satellite Catalog (satcat)

Loading data...

Schema

nametypeformatdescriptionconstraintstitle
jcatstringJonathan's Catalog ID — unique GCAT identifier. Prefix letter indicates catalog (S = standard satcat).JCAT ID
satcatstringUS Space Force / NORAD catalog number. Not set for all objects.NORAD Catalog Number
namestringOfficial or common name of the space object.Object Name
launch_datestringdefaultDate of launch (ISO 8601, YYYY-MM-DD). Partial dates (year only) appear when exact date is uncertain.Launch Date
launch_yearnumberFour-digit launch year, extracted from launch_date. Suitable for aggregation.Launch Year
object_typestringSimplified object classification. Values: Payload, Rocket Body, Debris, Component, Suborbital Payload, Unknown. Derived from the GCAT SatType byte 1.{ "enum": [ "Payload", "Rocket Body", "Debris", "Component", "Suborbital Payload", "Unknown" ] }Object Type
statestringISO country code of the owning nation or organization (e.g. US, SU, CN, RU, FR). Historical codes like SU (Soviet Union) are preserved.State
ownerstringAbbreviated name of the owning organization or agency.Owner
statusstringCurrent or final orbital status. Values: In Orbit, Decayed, Deorbited, Beyond Earth Orbit, Exploded.Status
orbit_classstringOperational orbit category code (e.g. LEO/I, GEO/S, MEO, HEO). See https://planet4589.org/space/gcat/web/intro/orbits.html for definitions.Orbit Class
perigee_kmnumberPerigee altitude above Earth's surface in kilometres, at last known orbital epoch.Perigee (km)
apogee_kmnumberApogee altitude above Earth's surface in kilometres, at last known orbital epoch.Apogee (km)
inclination_degnumberOrbital inclination in degrees, at last known orbital epoch.Inclination (degrees)

Objects Launched per Year by Type

Loading data...

Schema

nametypedescriptiontitle
yearnumberYear
PayloadnumberNumber of payloads launched.Payloads
Rocket BodynumberNumber of rocket bodies (launch vehicle stages) tracked.Rocket Bodies
DebrisnumberNumber of fragmentation debris pieces tracked.Debris
ComponentnumberNumber of payload components tracked.Components

Data Files

FileDescriptionSizeLast modifiedDownload
satcat
Standard catalog of all artificial space objects. One row per phase; most objects have a single phase. Covers all objects ever tracked in Earth orbit and beyond.6.41 MBabout 2 months ago
satcat
objects-per-year
Pre-aggregated count of objects launched per year, broken down by object type. Used for the bar chart view.1.46 kBabout 2 months ago
objects-per-year
FilesSizeFormatCreatedUpdatedLicenseSource
26.41 MBcsvabout 2 months agoOpen Data Commons Public Domain Dedication and LicenseGCAT — Jonathan McDowell's General Catalog of Artificial Space Objects

Task: Wrangle GCAT Space Objects Dataset

Objective

Download and clean data from Jonathan McDowell's GCAT (General Catalog of Artificial Space Objects) and publish it as a structured dataset.

Source

What to do

  1. Download the main catalog from https://planet4589.org/space/gcat/data/cat/satcat.tsv

    • This is a TSV with a fixed-width header section (lines starting with #) followed by tab-separated data
    • Strip the header comment block, keep the column headers, save as data/satcat.csv
  2. Inspect and document the schema — key fields include:

    • JCAT — Jonathan's catalog number (primary key)
    • Satname — satellite name
    • Country — country of origin
    • LDate — launch date
    • Dest — destination orbit type (e.g. LEO, GEO, MEO, HEO, SSO, Lunar, etc.)
    • Status — operational status (A=active, D=decayed, etc.)
    • Type — object type (P=payload, R=rocket body, D=debris)
    • Mass — mass in kg (where known)
  3. Clean the data:

    • Convert to proper CSV with consistent quoting
    • Ensure dates are in ISO format (YYYY-MM-DD) where possible
    • Document any fields that have their own codebooks
  4. Update datapackage.json:

    • Set "status": "structured"
    • Add the CSV as a resource with schema (field names and types)
    • Add a views entry for a bar chart of launches by year (group by launch year, count objects)
  5. Write README.md — describe the dataset, the source, key fields, and example use cases

File structure when done

gcat-artificial-space-objects/
  datapackage.json    # updated with resource + schema
  README.md           # dataset description
  data/
    satcat.csv        # cleaned catalog
  AGENTS.md
  TASK.md
  .datahubignore

Notes

  • The GCAT site has many sub-catalogs (launch vehicles, launch sites, etc.) — focus only on satcat for now
  • McDowell updates GCAT regularly; note the download date in README.md
  • Do NOT push to DataHub — leave that for the human operator
  • Status field codes: A=active, AR=active reentry, D=decayed, DU=decayed unknown date, E=expelled from solar system, L=landed, M=maneuvering, N=new (not tracked), R=reentered, S=sample return, U=uncontrolled reentry