You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Compress screencast idle time and split Spark scene into sub-scenes
Add sentinel markers in scene.py so screencast.sh can detect Spark
sub-scene boundaries (profile, validate, sample). Replace monolithic
Spark narration with 4 focused clips. Add cast file compression step
in narrate.sh to cap idle gaps at 2s and realign timing log, fixing
audio/video desync.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: examples/narrate.sh
+92-3Lines changed: 92 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -42,7 +42,14 @@ gen_clip "gx" "Generate a full Great Expectations suite deterministically
42
42
gen_clip "prompts""Generate structured prompts for LLMs. Documentation prompts. Validation rule prompts. All the column metadata and domain context is included automatically."
43
43
gen_clip "diff""Schema evolution tracking. Modify a table and see exactly what changed. Added columns. Modified descriptions."
44
44
gen_clip "sql_plan""Generate full SQL execution plans from UMF metadata. Joins, column derivations, survivorship logic, aggregations. All computed automatically from the schema relationships."
45
-
gen_clip "spark""Now the PySpark features. Starting a Spark session. Creating DataFrames. Profiling schemas. Validating data against UMF specs. And generating sample data. All from the same UMF metadata."
45
+
gen_clip "context""Context-aware validation. Different nullable rules for each Line of Business, all from one YAML file."
46
+
gen_clip "compat""Schema evolution safety. Check backward compatibility before deploying changes."
47
+
gen_clip "excel""Export to Excel for domain experts. Import their edits back with no data loss."
48
+
gen_clip "cli""CLI commands for schema authoring. Add a column, modify its type, rename it with alias preservation. Assign domain types from the built-in registry. And manage validation expectations — all without touching YAML directly."
49
+
gen_clip "spark_session""Now we enter PySpark territory. Creating a Spark session and a sample DataFrame with five claims."
50
+
gen_clip "spark_profile""SparkToUmfMapper infers a UMF schema from the DataFrame. Column names, types, and nullability, all detected automatically."
51
+
gen_clip "spark_validate""TableValidator checks the DataFrame against the UMF spec. It catches type drift that causes silent data corruption."
52
+
gen_clip "spark_sample""Sample data generation from UMF specs. 100 rows per table, respecting types, nullable rules, and domain constraints."
46
53
gen_clip "close""That's tablespec. Define once. Use everywhere."
0 commit comments