ckan_geoconnex_bulk_runner_demo_medium.mp4
Status: This codebase is currently a work in progress and more documentation is planned.
The ckan_geoconnex_bulk_runner codebase is meant to run as a container for a bulk integration of a CKAN instance's relevant datasets and vector geospatial features (e.g. for water data hubs) to the Geoconnex knowledge graph. The codebase ultimately runs as a program outputting to standard output JSON-LD on a new line for each approved dataset/location which the Geoconnex crawler then uses to update the Geoconnex knowledge graph.
Refer to the "Contributing via Bulk Containers" documentation here for more information: https://docs.geoconnex.us/contributing/bulk/
This runner is expected to be implemented for a water data hub with the relevant fields and/or ckanext-gztr (not open-source yet) and/or DataPusher+ enabled. For questions reach out to datHere, Center for Geospatial Solutions, or add an issue/discussion.
cargo run -p ckan_geoconnex_bulk_runner --releaseTo ignore standard error output and only show valid output:
cargo run -p ckan_geoconnex_bulk_runner --release 2>/dev/nullcargo test -p ckan_geoconnex_bulk_runnerTo include print statements in test output, run:
cargo test -p ckan_geoconnex_bulk_runner -- --nocaptureIf you have the local dump files setup available you can run those tests with:
cargo test -p ckan_geoconnex_bulk_runner -F local -- --nocapture