Extended U.S. Economic Indicators: GDP and Unemployment Data (1920–Present)
Files | Size | Format | Created | Updated | License | Source |
---|---|---|---|---|---|---|
4 | 27.5 kB | csv | Creative Commons Attribution 4.0 | U.S. Bureau of Economic Analysis (BEA) via FRED |
This data package contains extended datasets for U.S. Gross Domestic Product (GDP) and Unemployment Rates, combining official modern data with historical annual data compiled from archival sources. The datasets provide a continuous time series starting from 1920.
Data Files
File | Description | Size | Last modified | Download |
---|---|---|---|---|
gdp_combined_1920_2023_annual | Extended real GDP dataset combining historical annual data (1920–1946) from U.S. Census Bureau with modern annual aggregates (1947–2023) from BEA via FRED. | 2.8 kB | gdp_combined_1920_2023_annual | |
gdp_since_1920_1947_annual_us_census_bureau | Annual real GDP data for 1920–1947 manually compiled from U.S. Census Bureau’s historical statistics. | 1.13 kB | gdp_since_1920_1947_annual_us_census_bureau | |
unrate_1920_1948 | Extended unemployment rate dataset with annual data (1920–1947) from U.S. Census Bureau and monthly data (1948–present) from BLS via FRED. | 2.67 kB | unrate_1920_1948 | |
unrate_since_1948_mly_bls | Monthly unemployment rate data from January 1948 to the present, sourced from BLS via FRED. | 20.9 kB | unrate_since_1948_mly_bls |
Data Previews
Combined Extended GDP Data (1920–2023 Annual)
Schema
name | type | description |
---|---|---|
Year | integer | The calendar year (1920–2023). |
GDP | number | Real GDP in billions of chained dollars. |
Source | string | Data source: 'Census Historical' or 'BEA/FRED'. |
Historical GDP Data (1920–1947 Annual from Census Bureau)
Schema
name | type | description |
---|---|---|
Year | integer | The calendar year (1920–1947). |
GDP | number | Real GDP in billions of chained dollars. |
Source | string | Data source: 'Census Historical'. |
Extended Unemployment Rate Data (1920–Present)
Schema
name | type | description |
---|---|---|
Date | string | Date in YYYY format for annual data or YYYY-MM for monthly data. |
Unemployment Rate | number | Unemployment rate as a percentage. |
Source | string | Data source: 'Census Historical' or 'BLS/FRED'. |
Monthly Unemployment Rate Data (1948–Present from BLS)
Schema
name | type | description |
---|---|---|
Date | string | Date in YYYY-MM format. |
Unemployment Rate | number | Unemployment rate as a percentage. |
Extended U.S. Economic Indicators: GDP and Unemployment Data (1920–Present)
This repository contains extended datasets for U.S. Gross Domestic Product (GDP) and Unemployment Rates, combining official modern data with historical annual data compiled from archival sources. The goal is to provide a continuous time series starting from 1920, filling gaps where official quarterly or monthly data is unavailable prior to the late 1940s.
The datasets are unique in their manual extension using historical statistics, making them suitable for long-term economic analysis, research, or educational purposes.
Data Sources
Gross Domestic Product (GDP)
- Modern Data: Quarterly, seasonally adjusted GDP data from January 1947 to the present, sourced from the U.S. Bureau of Economic Analysis (BEA) via the Federal Reserve Bank of St. Louis (FRED).
- Historical Extension: Annual real GDP data for 1920–1946 manually compiled from the U.S. Census Bureau’s Bicentennial Edition: Historical Statistics of the United States, Colonial Times to 1970. Official GDP data prior to 1947 is not available from BEA or FRED.
- Archive Link: U.S. Census Bureau Historical Statistics
Unemployment Rate
- Modern Data: Monthly unemployment rate data from January 1948 to the present, sourced from the U.S. Bureau of Labor Statistics (BLS) via the Federal Reserve Bank of St. Louis (FRED).
- Historical Extension: Annual unemployment data for 1920–1947 manually compiled from the U.S. Census Bureau’s Bicentennial Edition: Historical Statistics of the United States, Colonial Times to 1970.
- Archive Link: U.S. Census Bureau Historical Statistics
Dataset Files
GDP Files
- gdp_combined_1920_2023_annual.csv: The extended dataset combining historical annual data (1920–1946) with annual aggregates from modern sources (1947–2023). Columns include:
Year
: The calendar year.GDP
: Real GDP in billions of chained dollars (or appropriate unit; adjust based on your data normalization).Source
: Indicates whether data is from "Census Historical" or "BEA/FRED".
- gdp_since_1920_1947_annual_us_census_bureau.csv: Historical annual GDP data specifically from the U.S. Census Bureau (1920–1947). This is the raw extension data used in the combined file.
Unemployment Files
- unrate_1920_1948.csv: The extended dataset with annual unemployment rates (1920–1947) and monthly data starting from 1948. Columns include:
Date
: Year or YYYY-MM format.Unemployment Rate
: Percentage of the labor force unemployed.Source
: Indicates whether data is from "Census Historical" or "BLS/FRED".
- unrate_since_1948_mly_bls.csv: Monthly unemployment rate data from BLS/FRED (1948–present). This is the raw modern data used in the extended file.
Data Compilation Process
- Historical Data: Manually extracted from tables in the U.S. Census Bureau’s historical compendium. For GDP, this involved aggregating or estimating annual figures where quarterly data was absent. For unemployment, annual averages were used.
- Modern Data: Downloaded from FRED API or website.
- Merging: Historical and modern data were aligned by year, with any necessary adjustments for consistency (e.g., chaining methods for GDP). The combined files provide a seamless time series.
- Notes on Limitations:
- Pre-1947 data is annual only; no quarterly or monthly breakdowns available.
- Data may have methodological differences between historical and modern sources (e.g., definitions of unemployment).
- Ensure to check for updates from official sources, as this dataset is a snapshot up to 2023.
Usage
- Loading in Python (Example):
import pandas as pd # Load combined GDP data gdp_df = pd.read_csv('gdp_combined_1920_2023_annual.csv') print(gdp_df.head()) # Load extended unemployment data unrate_df = pd.read_csv('unrate_1920_1948.csv') print(unrate_df.head())
- These datasets can be used for time-series analysis, econometric modeling, or visualization in tools like Excel, R, or Python.
License
This dataset is released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. You are free to share and adapt the data, provided you give appropriate credit to the original sources (BEA, BLS, FRED, and U.S. Census Bureau) and this repository.
Acknowledgments
- Data compilation inspired by the need for long-term economic historical context.
- Thanks to the U.S. government agencies for providing public data.
If you use this dataset in your work, please cite it as: "Extended U.S. GDP and Unemployment Data (1920–Present), [Your GitHub Username/Repo], 2025."
For questions or contributions, open an issue or pull request in this repository.