You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+51-2Lines changed: 51 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -848,16 +848,17 @@ Data can be verified by executing these checks through a data quality engine.
848
848
849
849
Quality attributes can be:
850
850
- A text in natural language that describes the quality of the data.
851
+
- A predefined metric from the library of commonly used metrics
851
852
- An individual SQL query that returns a single value that can be compared.
852
853
- Engine-specific types: Pre-defined quality checks, as defined by data quality libraries. Currently, the engines `soda` and `great-expectations` are supported.
853
854
854
-
A quality object can be specified on field level and on model level.
855
+
A quality object can be specified on the field level and on the model level.
855
856
The top-level quality object is deprecated.
856
857
857
858
#### Description Text
858
859
859
860
A description in natural language that defines the expected quality of the data.
860
-
This is useful to express requirements or expectation when discussing the data contract with stakeholders.
861
+
This is useful to express requirements or expectations when discussing the data contract with stakeholders.
861
862
Later in the development process, these might be translated into an executable check (such as `sql`).
862
863
It can also be used as a prompt to check the data with an AI engine.
863
864
@@ -929,6 +930,54 @@ models:
929
930
SQL queries allow powerful checks for custom business logic.
930
931
A SQL query should run not longer than 10 minutes.
931
932
933
+
#### Library / Metrics
934
+
935
+
A set of predefined metrics commonly used in data quality checks, designed to be compatible with all major data quality engines. This simplifies the work for data engineers by eliminating the need to manually write SQL queries.
0 commit comments