Skip to content

snatch59/oecd-data-mining

Repository files navigation

oecd-data-mining

The Organisation for Economic Co-operation and Development (OECD) Interface software suite provides a means to discover, download, and convert OECD SDMX-JSON data sets into CSV files. The files can be further processed to select a subset according to set criteria (e.g. industries with electricity), with specific fixed (normalized) column types. The suite covers:

  • downloading list of all OECD data set IDs and descriptions;
  • downloading list of all data set schema;
  • downloading all OECD SDMX-JSON data sets;
  • converting all time period data sets to un-pivoted CSV files;
  • selecting a subset according to set criteria with specific fixed (normalized) column types
  • concatenating this subset of fixed column CSV files into an overall master CSV file.

There is also the means to work with just OECD frequency dimension data, which are a subset of the main time period data sets. This suite covers :

  • identifing frequency dimension (annual/quarterly) supporting schema;
  • downloading just OECD SDMX-JSON data sets with a frequency dimension;
  • converting frequency dimension data sets to multi-indexed CSV files.

The OECD Interface software suite is written for Python 3.5, pandas 0.24.2, and uses the pandasdmx 0.8.2 Python package to convert SDMX-JSON files to multi-indexed CSV files.

The full online documentation is to be found here, and lays out the workflow for using these utilities: https://snatch59.github.io/oecd-data-mining/

Releases

No releases published

Packages

No packages published