Automated data quality profiling/testing #6280
jqnatividad
started this conversation in
Ideas
Replies: 1 comment
-
Expanding on this. Here's one workflow I can imagine for a Great Expectations CKAN plugin:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Beyond publishing data, it would be good if CKAN also helps ensure the quality of the data.
Typically, data quality checks are done before publishing, but what if CKAN also helps with the process?
Has anybody done any work on this front?
The appropriately named Great Expectations looks promising.
As CKAN is widely used in the public sector, here's an interesting case study of Great Expectations being used for preparing housing data to support policy analysis.
https://urban-institute.medium.com/automating-data-quality-checks-with-great-expectations-f6b7a8e51201
Maybe, we can even leverage this GE self-updating data dictionary plugin to prepopulate CKAN's data dictionary?
Beta Was this translation helpful? Give feedback.
All reactions