Stages call helpers (run_map, run_vanilla, YT client methods, etc.) declared under client.operations in YAML. This section documents each pattern.
:maxdepth: 1
map
map-reduce-typed-jobs
command-mode-map-reduce
vanilla
yql
s3
table
sort
| Topic | Link |
|---|---|
| Map | map.md |
| Map-reduce (TypedJob) | map-reduce-typed-jobs.md |
| Map-reduce (command mode) | command-mode-map-reduce.md |
| Vanilla | vanilla.md |
| YQL | yql.md |
| S3 | s3.md |
| Table helpers | table.md |
| Sort | sort.md |
| Pattern | Input / output | Parallelism |
|---|---|---|
| Map | Table → table | YT splits input across tasks |
| Map-reduce / reduce | Sorted or grouped table work | Map + reduce phases |
| Vanilla | None required | Single job |
| YQL | One or more tables → table | Query planner |
| S3 | Object store → table (typical) | Driver listing + cluster for follow-up |
| Table helpers | Driver-side Python | None on cluster |
| Sort | Table → sorted table | YT sort operation |
- Custom Python per row → map.
- Declarative SQL shape → YQL.
- No table contract → vanilla.
- Row stream → map.
S3 stages often feed map or YQL; compose them as separate stages or multiple operations in one stage (Multiple operations).
Extract / transform / load:
stages:
enabled_stages:
- extract_from_s3
- transform_data
- load_to_tableSetup / process / validate:
stages:
enabled_stages:
- setup_environment
- process_data
- validate_results