Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Validating for GovData harvesting fails #14

Open
ondics opened this issue Aug 11, 2022 · 0 comments
Open

Validating for GovData harvesting fails #14

ondics opened this issue Aug 11, 2022 · 0 comments

Comments

@ondics
Copy link

ondics commented Aug 11, 2022

When checking our datasets, the EU validator using Profile DCAT-AP.de 2.0 - Spezifikation (BETA.08-04.1)

https://offenedaten.frankfurt.de/catalog.rdf?fq=tags:GovData-Harvesting

we get these errors:

Test:[Shape] - [http://dcat-ap.de/def/dcatde/Distribution_dcat_mediaType_v_List] - [Wert] - [text/csv]
Test:[Shape] - [http://dcat-ap.de/def/dcatde/Distribution_dct_format_v_List] - [Wert] - [HTML]
Test:[Shape] - [http://dcat-ap.de/def/dcatde/Catalog_dct_language_v_List] - [Wert] - [de]

The fields come from the CKAN core and have semantic relevance in CKAN, so we cannot easily change values of those fields.

The easiest way to fix these errors for GovData harvesting would be to modify the creation of the RDF file catalog.rdf
using some code like:

match ckanLanguage:
        case "de":
            rdfLanguage = "http://publications.europa.eu/resource/authority/language/DEU"
        case "en":
            rdfLanguage = "http://publications.europa.eu/resource/authority/language/ENG"
        case _:
            return "CKAN language not supported currently"

For a quick fix we could start with the most popular strings used in dcat_mediaType, dct_format and language_v.

Are there better fixes? Or maybe in other CKAN extensions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant