-
Notifications
You must be signed in to change notification settings - Fork 66
Getting Started
Versatile Data Kit (VDK) is a data framework that enables Data Engineers to
🧑💻 develop,
📊 and manage data workloads, aka data jobs
- Ingest data from different sources.
- Use Python/SQL and VDK templates to transform data.
- Package, version, and deploy data applications while dealing with credentials, retries, and reconnects.
- Provide built-in monitoring and smart notification capabilities.
- Track code and data modifications for quicker troubleshooting and version rollback.
See our introduction blog post
➡️ See the Installation for more details.
➡️ Develop your First Data Job if you are impatient to start quickly.
➡️ Get your hands on interesting examples.
➡️ See Versatile Data Kit interfaces for a list of interfaces.
➡️ User Guide is the most detailed documentation for data engineers and scientists wanting to use VDK. If you have a question and the answer is unclear, tell us (open an issue), and we will update it.
➡️ Check details on how to deploy the VDK backend (Control Service) in this example here
➡️ Terms Dictionary contains definitions of all terms used in the documentation.
SDK - Develop Data Jobs
SDK Key Concepts
Control Service - Deploy Data Jobs
Control Service Key Concepts
- Scheduling a Data Job for automatic execution
- Deployment
- Execution
- Production
- Properties and Secrets
Operations UI
Community
Contacts