You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(elt-common): Configure iceberg warehouse based on pipeline directory name (#376)
ref #321
Instead of requiring the `warehouse` value to be configured for
pyiceberg, set it based on the pipeline directory name. This is as per
the most recent couple of comments in the issue.
I suspect `is_ingest_job` may not be the best way to handle what it
does, but probably best to revisit when we add the `transform`
functionality and have to actually start thinking about non-ingest jobs.
Copy file name to clipboardExpand all lines: elt-pipelines/README.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -46,7 +46,9 @@ elt-pipelines/
46
46
| | | | |-- <job name>.py
47
47
```
48
48
49
-
- Each 'target warehouse' is the name of an Iceberg warehouse. The data ingested by the pipelines inside that directory end up in that warehouse.
50
-
- The directory structure from `ingest` down is what is required for `elt-common` to be able to run 'ingest' pipelines.
49
+
- This directory structure is required for using `elt-common`
50
+
- Each 'target warehouse' is the name of an Iceberg warehouse. The data ingested by the pipelines inside that directory
51
+
end up in that warehouse.
51
52
- Data from ingest pipelines is considered 'raw' data, and is loaded into a warehouse suffixed with `_landing`.
52
-
- Under construction: Each warehouse will also have a `transform` subdirectory containing pipelines for converting the raw data into it's final state in the target warehouse.
53
+
- Under construction: Each warehouse will also have a `transform` subdirectory containing pipelines for converting the
54
+
raw data into its final state in the target warehouse.
0 commit comments