JavaScript SDK for data deployment

May 15th, 2018       Anuar Ustayev

Here we explain how you can use JavaScript SDK for data deployment purposes. If you need a detailed step-by-step tutorial, please, go to this article:

Read more

How to initialize a data package using data tool

May 14th, 2018       Anuar Ustayev

In this article we explain how easy is adding a datapackage.json file for your data. You need to have data tool installed - download...

Read more

Validate your Data Package descriptor online

April 19th, 2018       Dmitry German

To help users with creation of Data Packages we have implemented a descriptor validation tool:


Read more

Q1 2018 Review

April 11th, 2018       Anuar Ustayev       Rufus Pollock       Adam Kariv

We’re sharing an update on all the progress we made in the first quarter of 2018. We massively improved our data command line tool, sped up data deployment 5-100x and introduced...

Read more

New Features and Improvements

March 26th, 2018       Dmitry German

Good day, dear data miners, scientists and statisticians!

During the last month we were focused on polishing the existing product - DataHub platform and the data-cli tool....

Read more

Improved Reporting and Debugging of Data Publishing

January 29th, 2018       Anuar Ustayev

We’ve integrated our pipelines system with the website to display more insights to our users. Any dataset you publish on DataHub could be in one of three states: processing,...

Read more

Data Validation in the DataHub

January 24th, 2018       Rufus Pollock       Anuar Ustayev

Users can now use the DataHub to validate their tabular data, for example checking that dates really are dates or that a column of daily revenue is always positive.

Data validation is also...

Read more

Which country spends the most on pharmaceutical drugs?

January 23rd, 2018       Meiran Zhiyenbayev

There are several graphs that illustrate pharmaceutical drug spendings from the list OECD countries. Data is clean and available in several formats such as csv, json,...

Read more

Introducing private datasets on the DataHub

December 13th, 2017       Anuar Ustayev       Rufus Pollock

Today we are releasing support for private datasets on the DataHub. Private datasets are exactly that: private and visible and accessible only to their owners.

This feature...

Read more

Data desktop app - alpha release with drag and drop data publishing support

December 1st, 2017       Anuar Ustayev

We are pleased to announce the launch of our new desktop application for DataHub users. The app brings drag and drop publishing of data. In addition, users can preview and validate their data prior...

Read more

How to use Data Packages from R

November 16th, 2017       Meiran Zhiyenbayev       Anuar Ustayev

This tutorial demonstrates how to use Data Packages from R. We assume that you already know about Data Packages and its Read more

Import online data files directly with scheduling

November 14th, 2017       Anuar Ustayev       Rufus Pollock

Users can now import online data files directly into the DataHub using the data command line tool – and setup scheduled re-imports at the same time.

We’re very excited about...

Read more

Core Data: Essential Datasets for Data Wranglers and Data Scientists

November 3rd, 2017       Rufus Pollock       Meiran Zhiyenbayev       Anuar Ustayev

The “Core Data” project provides essential data for the data wranglers and data science community. Its online home is on the DataHub:

Read more

See events and activity related to datasets or publishers

October 31st, 2017       Anuar Ustayev

You can now see publisher and dataset related events. As we are tracking processes happening in our system, users have ability to discover which publishers have been active or datasets are updated...

Read more

Datasets in zip format

October 19th, 2017       Anuar Ustayev

We are now generating compressed versions of datasets so users can download a dataset as a single file. You can find it in the “Data Files” table in the showcase page. For example, you can have a...

Read more

Previews for large datasets

October 18th, 2017       Anuar Ustayev

We are now generating preview versions of large datasets so your web browser does not crash by loading large amount of data. The preview versions consist of first 5k rows of datasets (if a dataset...

Read more

Vega views upgrade - now using v3

October 17th, 2017       Anuar Ustayev

As you know publishers can create various views using Vega visualizations in DataHub (learn more about views here). We have just upgraded our platform to use Vega...

Read more

Excel Files on the DataHub: Automated Previews and Data Extraction

October 16th, 2017       Anuar Ustayev

In this tutorial, we will explain how to push Excel data to the DataHub. When an Excel file is pushed, we can extract data from selected sheets for previewing and downloading in alternative...

Read more

Data Package v1 Specifications. What has Changed and how to Upgrade

October 11th, 2017       Meiran Zhiyenbayev

This post walks you through the major changes in the Data Package v1 specs compared to pre-v1. It covers changes in the full suite of Data Package specifications including Data Resources and Table...

Read more

How much space are you using?

October 4th, 2017       Anuar Ustayev

We’ve just added the functionality some basic information on how many datasets you have and how much space you are using.

You can see this information by logging in and visiting your...

Read more

Subscribe and Download