Blog

Blog

Exploring the world of open data: updates, insights, and innovations to empower data-driven solutions.


February 14, 2025

Global Data Solutions: Curated Datasets for Informed Business Decisions

Discover DataHub.io’s tailored data solutions - comprehensive country, postal code, and logistics datasets engineered for transparent, actionable insights that drive business success.


February 13, 2025

Celebrating Data Charm: 5 Collections to Fall in Love with

Finding your perfect match, a DataHub.io collection that fits your needs.


February 12, 2025

VIX Index Data: How to Use Market Volatility for Smarter Trading & Portfolio Protection

What is the CBOE Volatility Index (VIX)? A practical guide to the VIX Index and its applications in trading, featuring a daily updated dataset in CSV and JSON formats.


February 10, 2025

Love Data Week 2025: A Global Celebration of Data’s Impact

Exploring how data ethics and management are shaping the future, one dataset at a time.


February 5, 2025

One database, every postal code worldwide: simplify your search

Ensure data accuracy, reduce delays, and minimize costs with our monthly refreshed database.


February 4, 2025

The Evolution of DNA Sequencing Costs: Insights from 2001 to 2022

Discover how genome sequencing dropped from USD 100 million to just USD 200.


January 31, 2025

30+ Years of History – The most comprehensive Serie A dataset available

Football open data: structured stats for journalists, analysts & managers – updated daily.


January 30, 2025

DataHub: The Platform That Makes Data Work for You

DataHub makes data easy to find, structure, and publish. Built by Datopian, creators of CKAN, Frictionless Data, PortalJS, and more, DataHub provides high-quality, curated datasets for industries that need accuracy, structure, and seamless integration.


January 30, 2025

Resolving Data Inconsistencies in Our Pharma Spending Dataset

Discover how we identified and corrected discrepancies in our pharma spending dataset—particularly with Greece's 2021 PC_GDP—by refining our data filtering processes and script logic.


January 24, 2025

Advanced statistics for English Premier League analysis

Over three decades of free datasets for journalists, investors, and managers in the football industry.


January 23, 2025

Football Data: our open-source collection of worldwide statistics

25 GitHub projects and 5 DataHub datasets to analyze players, stadiums, and competitions.


January 22, 2025

Pharmaceutical Spending Trends: 50 Years of Insights from 50 Nations

OECD Data Insights for Smarter Decisions, Investments, and Policy Innovations.


December 30, 2024

Introducing DataHub.io’s New Global Data Solutions

Discover how DataHub.io’s latest releases—Global Geo Data and the Worldwide Postal Code Database—empower organizations to innovate faster and more efficiently.


December 24, 2024

Technical Deep Dive: Our Global Postal Code Dataset & Roadmap

Discover the power of accurate, comprehensive postal code data with our Global Postal Code Dataset & Roadmap. Learn how we provide high-resolution coverage across 30+ countries, expand to 100+ countries by 2025, and ensure data quality with advanced validation pipelines. Perfect for logistics, geospatial analytics, and market expansion, our dataset offers bulk downloads, API access, and GIS compatibility. Explore our innovative crowdfunding model and practical use cases to transform your data-driven strategies.


December 23, 2024

Discover the Commodities Collection: Metals, Energy, Agriculture & More

Explore the new Commodities Collection on DataHub.io, featuring datasets on precious metals, energy resources, agricultural products, and livestock. Perfect for traders, researchers, and enthusiasts seeking market insights and trends.


December 13, 2024

Optimizing the Clinical Trials (US) Repository: Data Storage and Git LFS Solutions

The clinical-trials-us repository provides a critical resource, offering official U.S. clinical trial outcomes from the FDA. This data is vital for researchers, medical professionals, and policymakers. However, as the repository continues to grow, a key issue has surfaced regarding the best way to manage large datasets—specifically, the 2.3 GB of XML files sourced from ClinicalTrials.gov.


December 12, 2024

Celebrating 100 Stars on GitHub: R2 Bucket Uploader - Simplifying Cloudflare R2 File Uploads

We're excited to announce that our open-source library, R2 Bucket Uploader, has just reached 100 stars on GitHub! 🎉 This milestone is a testament to the value it provides to the community, and we wanted to take a moment to highlight its key features and how it simplifies integrating with Cloudflare R2 storage.


December 9, 2024

Kicking Off: Enhancing Football Datasets on Datahub.io

Discover the latest updates to Datahub.io's football datasets repository, including improved data processing workflows, expanded datasets, and automated updates.


November 7, 2024

Empowering Logistics with a Global Postal Code Data Solution

In the logistics industry, having accurate, up-to-date postal code data is crucial for smooth operations, especially when navigating complex international shipping requirements. In our recent project with a Fortune 500 logistics enterprise, we delivered a comprehensive postal code dataset solution designed to meet this need on a global scale.


October 21, 2024

Country List Dataset: Latest Update, Easy Access on DataHub.io, and Upcoming NPM Release

The Country List dataset is one of the most essential core datasets we maintain at DataHub. It provides a simple, up-to-date list of countries with their official English names and 2 digit codes (ISO 3166-1) in a developer-friendly CSV format.


October 1, 2024

Updating the Country Codes Open Dataset: A Major Overhaul

We are excited to share the latest updates on our open dataset, country-codes. Over time, this dataset had become outdated, and the underlying codebase required a significant overhaul. To ensure it remains reliable and up-to-date, we embarked on a comprehensive update, restructuring the codebase and implementing new processes for regular maintenance.


September 27, 2024

What is next: Enhancing Dataset Discovery and Providing Core Data for the World

In the world of data, accessibility and quality are crucial. As we move into the next phase of DataHub.io, our goal is to make it the go-to place for finding essential and popular datasets. Alongside that, we're building a seamless experience for data publishers to upload and showcase their datasets. Over time, we envision this evolving into a vibrant data marketplace.


2024-09-25

Introducing a new endpoint for fetching raw data files from DataHub Cloud datasets


2024-07-10

Learn how to publish your Obsidian vault with DataHub Cloud


2024-05-20

Learn how to configure basic SEO fields and navigation bar in your DataHub Cloud sites


2024-05-17

Learn how to publish a dataset with DataHub Cloud


2024-05-03

Learn how to style your DataHub Cloud sites with custom CSS


2024-03-05

DataHub Cloud Launch on Open Data Day: Build elegant data-driven sites with markdown & deploy in seconds


2023-12-12

Unveiling MarkdownDB's Latest Features: Export to JSON, task extraction, and computed fields 🚀


2023-10-11

Announcing MarkdownDB: an open source tool to create an SQL API to your markdown files! 🚀

MarkdownDB - an open source library to transform markdown content into sql-queryable data. Build rich markdown-powered sites easily and reliably. New dedicated website at markdowndb.com


2023-05-30

Create a catalog of anything using Markdown files in Obsidian


2023-05-29

Quarto: A tool to publish Jupyter notebooks as static websites


2023-04-18

Exporting Wikidata with SPARQL and ChatGPT


2023-04-01

Tutorial: Publishing data rich documents on DataHub


2023-02-13

We have some important updates re Datahub.io!


2022-03-14

Generate an interactive webpage from CSV data and markdown


2021-06-22

A Short Case Study Involving Table Schema Frictionless Specs at the European Union


2021-02-19

A Vision for the next generation of the DataHub (v3)

An overview of the next generation of the DataHub. We want to make it incredibly easy, fast and reliable to share your data in a useable way.\n


2020-05-08

COVID-19 and Compartmental Models in Epidemiology


2020-03-17

Open Data Day 2020 and COVID-19 data


2020-03-08

Comparotron: A simple way to visualize and share comparisons


2018-09-10

New Machine Learning Datasets


2018-09-05

Automatically updated core datasets on DataHub


2018-08-31

Sports data on DataHub


2018-08-23

Attribute Relation File Format (ARFF)


2018-07-18

How to use multiple DataHub accounts


2018-07-16

World Bank Indicators on DataHub


2018-07-10

Automated KPIs collection and visualization of the funnels


2018-06-11

Revamped awesome collections: data sets that are grouped by subject


2018-05-25

Machine learning datasets


2018-05-23

Auto-publish your datasets using Travis-CI


2018-05-15

JavaScript SDK for data deployment


2018-05-14

How to initialize a data package using data tool


2018-04-19

Validate your Data Package descriptor online


2018-04-11

Q1 2018 Review


2018-03-26

New Features and Improvements


2018-01-29

Improved Reporting and Debugging of Data Publishing


2018-01-24

Data Validation in the DataHub


2018-01-23

Which country spends the most on pharmaceutical drugs?


2017-12-13

Introducing private datasets on the DataHub


2017-12-01

Data desktop app - alpha release with drag and drop data publishing support


2017-11-16

How to use Data Packages from R


2017-11-14

Import online data files directly with scheduling


2017-11-03

Core Data: Essential Datasets for Data Wranglers and Data Scientists


2017-10-31

See events and activity related to datasets or publishers


2017-10-19

Datasets in zip format


2017-10-18

Previews for large datasets


2017-10-17

Vega views upgrade - now using v3


2017-10-16

Excel Files on the DataHub: Automated Previews and Data Extraction


2017-10-11

Data Package v1 Specifications. What has Changed and how to Upgrade


© 2024 All rights reservedBuilt with DataHub Cloud

Built with LogoDataHub Cloud