Skip to content

Commit f6014e3

Browse files
authored
Merge pull request #81 from marancibia/main
Update Oracle Delta Sharing CDF.ipynb
2 parents 7a265d7 + b99a31f commit f6014e3

File tree

1 file changed

+1
-9
lines changed

1 file changed

+1
-9
lines changed

delta-sharing/change-data-feed/Oracle Delta Sharing CDF.ipynb

Lines changed: 1 addition & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -18,14 +18,6 @@
1818
"This notebook reads **Change Data Feed (CDF)** from an Oracle Autonomous Database\n",
1919
"that publishes a Delta Sharing endpoint, and displays the raw change rows.\n",
2020
"\n",
21-
"## Why a custom REST approach?\n",
22-
"\n",
23-
"| Problem | Root cause | Workaround |\n",
24-
"|---|---|---|\n",
25-
"| `spark.read.format(\"deltaSharing\")` throws `InvocationTargetException` | Spark's Java Delta Sharing connector is incompatible with Oracle endpoints on serverless compute | Use the Python `delta_sharing` REST client instead |\n",
26-
"| `load_table_changes_as_pandas()` throws `KeyError: '_commit_timestamp'` | Oracle's file-level CDF omits the `_commit_timestamp` column that the library expects | Call the REST API directly and parse with `DeltaSharingReader._to_pandas()` |\n",
27-
"| `spark.read.parquet(*urls)` throws `UNSUPPORTED_FILE_SYSTEM` | Spark can't read HTTPS pre-signed URLs from Oracle object storage | Download via HTTP with `_to_pandas()`, then convert to Spark DataFrame |\n",
28-
"\n",
2921
"## Oracle's file-level CDF\n",
3022
"\n",
3123
"Oracle implements CDF at the **file level**, not the row level. When *any* row in a\n",
@@ -247,4 +239,4 @@
247239
},
248240
"nbformat": 4,
249241
"nbformat_minor": 0
250-
}
242+
}

0 commit comments

Comments
 (0)