Diagnosed Diabetes Prevalence 2004-2013

JohnSnowLabs

Files Size Format Created Updated License Source
2 23MB csv zip 2 weeks ago johnsnowlabs Centers for Disease Control and Prevention
Download

Data Files

File Description Size Last changed Download Other formats
diagnosed-diabetes-prevalence-2004-2013-csv [csv] 1MB diagnosed-diabetes-prevalence-2004-2013-csv [csv] diagnosed-diabetes-prevalence-2004-2013-csv [json] (1MB)
diagnosed-diabetes-prevalence-2004-2013_zip [zip] Compressed versions of dataset. Includes normalized CSV and JSON data with original data and datapackage.json. 2MB diagnosed-diabetes-prevalence-2004-2013_zip [zip]

diagnosed-diabetes-prevalence-2004-2013-csv  

This is a preview version. There might be more data in the original version.

Field information

Field Name Order Type (Format) Description
State 1 string Name of a State.
FIPS_Codes 2 integer The FIPS county code is a five-digit Federal Information Processing Standard (FIPS) code (FIPS 6-4) which uniquely identifies counties and county equivalents in the United States, certain U.S. possessions, and certain freely associated states.
County 3 string A county is a geographical region of a country used for administrative or other purposes.
Number_2004 4 integer Total Number of Diabetic patients.
Percent_2004 5 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2004 6 number Lower Confidence Limit
Upper_Confidence_Limit_2004 7 number Upper Confidence Limit
Age_Adjusted_Percent_2004 8 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2004 9 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2004 10 number Age Adjusted Upper Confidence Limit
Number_2005 11 integer Total Number of Diabetic patients.
Percent_2005 12 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2005 13 number Lower Confidence Limit
Upper_Confidence_Limit_2005 14 number Upper Confidence Limit
Age_Adjusted_Percent_2005 15 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2005 16 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2005 17 number Age Adjusted Upper Confidence Limit
Number_2006 18 integer Total Number of Diabetic patients.
Percent_2006 19 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2006 20 number Lower Confidence Limit
Upper_Confidence_Limit_2006 21 number Upper Confidence Limit
Age_Adjusted_Percent_2006 22 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2006 23 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2006 24 number Age Adjusted Upper Confidence Limit
Number_2007 25 number Total Number of Diabetic patients.
Percent_2007 26 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2007 27 number Lower Confidence Limit
Upper_Confidence_Limit_2007 28 number Upper Confidence Limit
Age_Adjusted_Percent_2007 29 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2007 30 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2007 31 number Confidence
Number_2008 32 number Total Number of Diabetic patients.
Percent_2008 33 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2008 34 number Lower Confidence Limit
Upper_Confidence_Limit_2008 35 number Upper Confidence Limit
Age_Adjusted_Percent_2008 36 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2008 37 number Age Adjusted Upper Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2008 38 number Age Adjusted Lower Confidence Limit
Number_2009 39 integer Total Number of Diabetic patients.
Percent_2009 40 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2009 41 number Lower Confidence Limit
Upper_Confidence_Limit_2009 42 number Upper Confidence Limit
Age_Adjusted_Percent_2009 43 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2009 44 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2009 45 number Age Adjusted Upper Confidence Limit
Number_2010 46 integer Total Number of Diabetic patients.
Percent_2010 47 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2010 48 number Lower Confidence Limit
Upper_Confidence_Limit_2010 49 number Upper Confidence Limit
Age_Adjusted_Percent_2010 50 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2010 51 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2010 52 number Age Adjusted Upper Confidence Limit
Number_2011 53 integer Total Number of Diabetic patients.
Percent_2011 54 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2011 55 number Lower Confidence Limit
Upper_Confidence_Limit_2011 56 number Upper Confidence Limit
Age_Adjusted_Percent_2011 57 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2011 58 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2011 59 number Age Adjusted Upper Confidence Limit
Number_2012 60 integer Total Number of Diabetic patients.
Percent_2012 61 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2012 62 number Lower Confidence Limit
Upper_Confidence_Limit_2012 63 number Upper Confidence Limit
Age_Adjusted_Percent_2012 64 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2012 65 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2012 66 number Age Adjusted Upper Confidence Limit
Number_2013 67 integer Total Number of Diabetic patients.
Percent_2013 68 number Total percentage of Diabetic patients in %.
Lower_Confidence_Limit_2013 69 number Lower Confidence Limit
Upper_Confidence_Limit_2013 70 number Upper Confidence Limit
Age_Adjusted_Percent_2013 71 number Age Adjusted Percent
Age_Adjusted_Lower_Confidence_Limit_2013 72 number Age Adjusted Lower Confidence Limit
Age_Adjusted_Upper_Confidence_Limit_2013 73 number Age Adjusted Upper Confidence Limit

diagnosed-diabetes-prevalence-2004-2013_zip  

This is a preview version. There might be more data in the original version.

Read me

Import into your tool

In order to use Data Package in R follow instructions below:

install.packages("devtools")
library(devtools)
install_github("hadley/readr")
install_github("ropenscilabs/jsonvalidate")
install_github("ropenscilabs/datapkg")

#Load client
library(datapkg)

#Get Data Package
datapackage <- datapkg_read("https://pkgstore.datahub.io/JohnSnowLabs/diagnosed-diabetes-prevalence-2004-2013/latest")

#Package info
print(datapackage)

#Open actual data in RStudio Viewer
View(datapackage$data$"diagnosed-diabetes-prevalence-2004-2013-csv")
View(datapackage$data$"diagnosed-diabetes-prevalence-2004-2013_zip")

Tested with Python 3.5.2

To generate Pandas data frames based on JSON Table Schema descriptors we have to install jsontableschema-pandas plugin. To load resources from a data package as Pandas data frames use datapackage.push_datapackage function. Storage works as a container for Pandas data frames.

In order to work with Data Packages in Pandas you need to install our packages:

$ pip install datapackage
$ pip install jsontableschema-pandas

To get Data Package run following code:

import datapackage

data_url = "https://pkgstore.datahub.io/JohnSnowLabs/diagnosed-diabetes-prevalence-2004-2013/latest/datapackage.json"

# to load Data Package into storage
storage = datapackage.push_datapackage(data_url, 'pandas')

# to see datasets in this package
storage.buckets

# you can access datasets inside storage, e.g. the first one:
storage[storage.buckets[0]]

In order to work with Data Packages in Python you need to install our packages:

$ pip install datapackage

To get Data Package into your Python environment, run following code:

import datapackage

dp = datapackage.DataPackage('https://pkgstore.datahub.io/JohnSnowLabs/diagnosed-diabetes-prevalence-2004-2013/latest/datapackage.json')

# see metadata
print(dp.descriptor)

# get list of csv files
csvList = [dp.resources[x].descriptor['name'] for x in range(0,len(dp.resources))]
print(csvList) # ["resource name", ...]

# access csv file by the index starting 0
print(dp.resources[0].data)

To use this dataset in JavaScript, please, follow instructions below:

Install data.js module using npm:

  $ npm install data.js

Once the package is installed, use code snippet below:

  const {Dataset} = require('data.js')

  const path = 'https://pkgstore.datahub.io/JohnSnowLabs/diagnosed-diabetes-prevalence-2004-2013/latest/datapackage.json'

  const dataset = Dataset.load(path)

  // get a data file in this dataset
  const file = dataset.resources[0]
  const data = file.stream()

In order to work with Data Packages in SQL you need to install our packages:

$ pip install datapackage
$ pip install jsontableschema-sql
$ pip install sqlalchemy

To import Data Package to your SQLite Database, run following code:

import datapackage
from sqlalchemy import create_engine

data_url = 'https://pkgstore.datahub.io/JohnSnowLabs/diagnosed-diabetes-prevalence-2004-2013/latest/datapackage.json'
engine = create_engine('sqlite:///:memory:')

# to load Data Package into storage
storage = datapackage.push_datapackage(data_url, 'sql', engine=engine)

# to see datasets in this package
storage.buckets

# to execute sql command (assuming data is in "data" folder, name of resource is data and file name is data.csv)
storage._Storage__connection.execute('select * from data__data___data limit 1;').fetchall()

# description of the table columns
storage.describe('data__data___data')
Datapackage.json