Compare/Merge Natural Earth Vector and Wikidata databases ( work in progress )
Work in progress ...
- It is about > 2-3 hour run time ...
- sytem req: linux, docker, docker-compose
git clone https://github.com/ImreSamu/natural-earth-vector-qa.git
cd natural-earth-vector-qa
make build
make init
make runparallel
make export
The Wikidata SPARQL query service is limited to 5 parallel queries per IP, and this program using all 5!
filename | description |
---|---|
wikidata_naturalearth_qa.db | matching file in sqlite3 database format |
_wd_match.csv | all matching information in csv format |
_wd_match_f1_ok.csv | only the first class matches |
_wd_match_f2_good.csv | only the second class matches |
_wd_match_f3_maybe.csv | maybe |
_wd_match_wikidataid_diffs.csv | data problems? ne.wikidataid != best match |
_wd_match_wikidataid_new.csv | new wikidataids |
_wd_match_wikidataid_validated.csv | validated wikidataids |
_wd_match_validated_unicodename_diff | validated, but the name can be updated |
- Program license: License: MIT](https://opensource.org/licenses/MIT)
- Data license:
- Input databases:
- Natural Earth Vector : public domain
- Wikidata
- "All structured data from the main and property namespace is available under the Creative Commons CC0 License"
- Output reports/databases:
- Creative Commons CC0 License;
- Input databases: