Files Size Format Created Updated License Source
1 326MB csv 1 month ago
sources/selected-crimes-local-authorities-2012-2015-* Collection of data about Israeli Police events by local authorities and collection of selected crimes. Data source: https://www.odata.org.il/dataset/maazarim1 Contains data for years 2012-2015. Overview Contains 4 XLS (see in sources read more
Download

Data Files

File Description Size Last changed Download Other formats
selected_crimes_2012_2015 [csv] 78MB selected_crimes_2012_2015 [csv] selected_crimes_2012_2015 [json] (249MB)

selected_crimes_2012_2015  

This is a preview version. There might be more data in the original version.

Field information

Field Name Order Type (Format) Description
resource 1 string
place_name 2 string
stat_region_code 3 string
stat_group 4 string
stat_offence 5 string
year_quarter 6 string
value 7 number
id 8 integer

Read me

sources/selected-crimes-local-authorities-2012-2015-*

Collection of data about Israeli Police events by local authorities and collection of selected crimes.

Data source: https://www.odata.org.il/dataset/maazarim1

Contains data for years 2012-2015.

Overview

Contains 4 XLS (see in sources directory), files are identified by their suffix:

  • 1b-small
  • 2a-large
  • 2a-small
  • 2b-large

Some of these files have multiple sheets.

Not all sheets are processed, see the commented-out sheets in plus_plus.source-spec.yaml

The final data is normalized to a single table with resource column identifying the source XLS file / sheet number.

Import into your tool

In order to use Data Package in R follow instructions below:

install.packages("devtools")
library(devtools)
install_github("hadley/readr")
install_github("ropenscilabs/jsonvalidate")
install_github("ropenscilabs/datapkg")

#Load client
library(datapkg)

#Get Data Package
datapackage <- datapkg_read("https://pkgstore.datahub.io/05e414f28d31ed208ca48dac7ababec6/selected_crimes_2012_2015/latest")

#Package info
print(datapackage)

#Open actual data in RStudio Viewer
View(datapackage$data$"selected_crimes_2012_2015")

Tested with Python 3.5.2

To generate Pandas data frames based on JSON Table Schema descriptors we have to install jsontableschema-pandas plugin. To load resources from a data package as Pandas data frames use datapackage.push_datapackage function. Storage works as a container for Pandas data frames.

In order to work with Data Packages in Pandas you need to install our packages:

$ pip install datapackage
$ pip install jsontableschema-pandas

To get Data Package run following code:

import datapackage

data_url = "https://pkgstore.datahub.io/05e414f28d31ed208ca48dac7ababec6/selected_crimes_2012_2015/latest/datapackage.json"

# to load Data Package into storage
storage = datapackage.push_datapackage(data_url, 'pandas')

# to see datasets in this package
storage.buckets

# you can access datasets inside storage, e.g. the first one:
storage[storage.buckets[0]]

In order to work with Data Packages in Python you need to install our packages:

$ pip install datapackage

To get Data Package into your Python environment, run following code:

import datapackage

dp = datapackage.DataPackage('https://pkgstore.datahub.io/05e414f28d31ed208ca48dac7ababec6/selected_crimes_2012_2015/latest/datapackage.json')

# see metadata
print(dp.descriptor)

# get list of csv files
csvList = [dp.resources[x].descriptor['name'] for x in range(0,len(dp.resources))]
print(csvList) # ["resource name", ...]

# access csv file by the index starting 0
print(dp.resources[0].data)

To use this dataset in JavaScript, please, follow instructions below:

Install data.js module using npm:

  $ npm install data.js

Once the package is installed, use code snippet below:

  const {Dataset} = require('data.js')

  const path = 'https://pkgstore.datahub.io/05e414f28d31ed208ca48dac7ababec6/selected_crimes_2012_2015/latest/datapackage.json'

  const dataset = Dataset.load(path)

  // get a data file in this dataset
  const file = dataset.resources[0]
  const data = file.stream()

In order to work with Data Packages in SQL you need to install our packages:

$ pip install datapackage
$ pip install jsontableschema-sql
$ pip install sqlalchemy

To import Data Package to your SQLite Database, run following code:

import datapackage
from sqlalchemy import create_engine

data_url = 'https://pkgstore.datahub.io/05e414f28d31ed208ca48dac7ababec6/selected_crimes_2012_2015/latest/datapackage.json'
engine = create_engine('sqlite:///:memory:')

# to load Data Package into storage
storage = datapackage.push_datapackage(data_url, 'sql', engine=engine)

# to see datasets in this package
storage.buckets

# to execute sql command (assuming data is in "data" folder, name of resource is data and file name is data.csv)
storage._Storage__connection.execute('select * from data__data___data limit 1;').fetchall()

# description of the table columns
storage.describe('data__data___data')
Datapackage.json