feat: Gtfs change tracker 1634#1737
Open
jcpitre wants to merge 22 commits into
Open
Conversation
cka-y
reviewed
Jun 11, 2026
davidgamez
reviewed
Jun 11, 2026
| # google_cloudfunctions2_function does not expose volume mounts in its schema. | ||
| # This terraform_data resource mounts both the datasets GCS bucket and an in-memory tmpfs | ||
| # on the underlying Cloud Run service after the function is deployed. | ||
| resource "terraform_data" "gtfs_change_tracker_gcs_mount" { |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
closes #1634
This pull request introduces a new GTFS change tracking feature to the pipeline. It adds a new Cloud Function (
gtfs-change-tracker) that computes and stores structured diffs between pairs of GTFS datasets, integrates this function into the batch processing workflow, and updates the infrastructure to support the new service. The changes span Python code, requirements, and Terraform infrastructure.GTFS Change Tracking Feature:
Added a new Cloud Function
gtfs-change-trackerthat computes diffs between two GTFS datasets, uploads the changelog to GCS, and persists a summary in the database. This includes the main logic (src/main.py), configuration (function_config.json), and dependencies (requirements.txt,requirements_dev.txt). [1] [2] [3] [4]Implemented a utility function
create_http_gtfs_change_tracker_taskto enqueue Cloud Tasks for the new function, and integrated it into the pipeline so that a change tracking task is created when a new dataset is processed and a previous dataset exists. [1] [2] [3]Infrastructure and Deployment:
Testing
Testing was done with only 1 datasset. Here is the curl call used:
The resulting json diff report is:
ntd-90200-202605211420_ntd-90200-202604081703_changelog.json
Please make sure these boxes are checked before submitting your pull request - thanks!
./scripts/api-tests.shto make sure you didn't break anything