You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: skills/fabric-lakehouse/SKILL.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,28 +24,28 @@ Lakehouse in Microsoft Fabric is an item that gives users a place to store their
24
24
25
25
### Key Components
26
26
27
-
-**Delta Tables** Managed tables with ACID compliance and schema enforcement
28
-
-**Files** Unstructured/semi-structured data in the Files section
29
-
-**SQL Endpoint** Auto-generated read-only SQL interface for querying
30
-
-**Shortcuts** Virtual links to external/internal data without copying
31
-
-**Fabric Materialized Views** Pre-computed tables for fast query performance
27
+
-**Delta Tables**: Managed tables with ACID compliance and schema enforcement
28
+
-**Files**: Unstructured/semi-structured data in the Files section
29
+
-**SQL Endpoint**: Auto-generated read-only SQL interface for querying
30
+
-**Shortcuts**: Virtual links to external/internal data without copying
31
+
-**Fabric Materialized Views**: Pre-computed tables for fast query performance
32
32
33
33
### Tabular data in a Lakehouse
34
34
35
35
Tabular data in a form of tables are stored under "Tables" folder. Main format for tables in Lakehouse is Delta. Lakehouse can store tabular data in other formats like CSV or Parquet, these formats only available for Spark querying.
36
-
Tables can be internal, when data is stored under "Tables" folder" or external, when only reference to a table is stored under "Tables" folder but the data itself is stored in a referenced location. Referecing tables are done through Shortcuts, which can be internal, pointing to other location in Fabric, or external pointing to data stored outside of Fabric.
36
+
Tables can be internal, when data is stored under "Tables" folder or external, when only reference to a table is stored under "Tables" folder but the data itself is stored in a referenced location. Referencing tables are done through Shortcuts, which can be internal, pointing to other location in Fabric, or external pointing to data stored outside of Fabric.
37
37
38
38
### Schemas for tables in a Lakehouse
39
39
40
-
When creating a lakehouse user can choose to enable schemas. Schemas are used to organize Lakehouse tables. Schemas are implemented as folders under "Tables" folder and store tables inside of those folders. Default schema is "dbo" and it can't be deleted or renamed. All other schemas are optional and can be created, renamed, or deleted. User can reference schema located in other lakehouse using Schema Shortcut that way referincing all tables with one shortcut that are at the destination schema.
40
+
When creating a lakehouse user can choose to enable schemas. Schemas are used to organize Lakehouse tables. Schemas are implemented as folders under "Tables" folder and store tables inside of those folders. Default schema is "dbo" and it can't be deleted or renamed. All other schemas are optional and can be created, renamed, or deleted. User can reference schema located in other lakehouse using Schema Shortcut that way referencing all tables with one shortcut that are at the destination schema.
41
41
42
42
### Files in a Lakehouse
43
43
44
-
Files are stored uner "Files" folder. Users can create folders and subfolders to organize their files. Any file format can be stored in Lakehosue.
44
+
Files are stored under "Files" folder. Users can create folders and subfolders to organize their files. Any file format can be stored in Lakehouse.
45
45
46
46
### Fabric Materialized Views
47
47
48
-
Set of pre-computed tables that are automatically updated based on schedule. They provide fast query performance for complex aggregations and joins. Materialized views are defined using PySpark or Spark SQL stored in asociated Notebook.
48
+
Set of pre-computed tables that are automatically updated based on schedule. They provide fast query performance for complex aggregations and joins. Materialized views are defined using PySpark or Spark SQL stored in associated Notebook.
49
49
50
50
### Spark Views
51
51
@@ -59,7 +59,7 @@ User can have workspace roles (Admin, Member, Contributor, Viewer) that provide
59
59
60
60
### Data access or OneLake Security
61
61
62
-
For data access use OneLake security model, which is based on Azure Active Directory (AAD) and role-based access control (RBAC). Lakehouse data is stored in OneLake, so access to data is controlled through OneLake permissions. In adition to object-level permissions, Lakehouse also supports column-level and row-level security for tables, allowing fine-grained control over who can see specific columns or rows in a table.
62
+
For data access use OneLake security model, which is based on Microsoft Entra ID (formerly Azure Active Directory) and role-based access control (RBAC). Lakehouse data is stored in OneLake, so access to data is controlled through OneLake permissions. In addition to object-level permissions, Lakehouse also supports column-level and row-level security for tables, allowing fine-grained control over who can see specific columns or rows in a table.
63
63
64
64
65
65
## Lakehouse Shortcuts
@@ -68,11 +68,11 @@ Shortcuts create virtual links to data without copying:
68
68
69
69
### Types of Shortcuts
70
70
71
-
-**Internal** Link to other Fabric Lakehouses/tables, cross-workspace data sharing
72
-
-**ADLS Gen2** Azure Data Lake Storage Gen2 external Azure storage
73
-
-**Amazon S3** AWS S3 buckets, cross-cloud data access
74
-
-**Dataverse** Microsoft Dataverse, business application data
75
-
-**Google Cloud Storage** GCS buckets, cross-cloud data access
71
+
-**Internal**: Link to other Fabric Lakehouses/tables, cross-workspace data sharing
72
+
-**ADLS Gen2**: Azure Data Lake Storage Gen2 external Azure storage
73
+
-**Amazon S3**: AWS S3 buckets, cross-cloud data access
74
+
-**Dataverse**: Microsoft Dataverse, business application data
75
+
-**Google Cloud Storage**: GCS buckets, cross-cloud data access
0 commit comments