You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/docs/_generated/node-convert-options.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,7 +28,7 @@ description: Options for the Node.js convert function
28
28
|`imageDir`|`string`| - | Directory for extracted images |
29
29
|`pages`|`string`| - | Pages to extract (e.g., "1,3,5-7"). Default: all pages |
30
30
|`includeHeaderFooter`|`boolean`|`false`| Include page headers and footers in output |
31
-
|`detectStrikethrough`|`boolean`|`false`| Detect strikethrough text and wrap with ~~ in Markdown output (experimental) |
31
+
|`detectStrikethrough`|`boolean`|`false`| Detect strikethrough text and wrap with ~~ in Markdown output or <del></del> tag in HTML output (experimental)|
32
32
|`hybrid`|`string`|`"off"`| Hybrid backend (requires a running server). Quick start: pip install "opendataloader-pdf[hybrid]" && opendataloader-pdf-hybrid --port 5002. For remote servers use --hybrid-url. Values: off (default), docling-fast |
33
33
|`hybridMode`|`string`|`"auto"`| Hybrid triage mode. Values: auto (default, dynamic triage), full (skip triage, all pages to backend) |
34
34
|`hybridUrl`|`string`| - | Hybrid backend server URL (overrides default) |
Copy file name to clipboardExpand all lines: content/docs/_generated/python-convert-options.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ description: Options for the Python convert function
29
29
|`image_dir`|`str`| - | Directory for extracted images |
30
30
|`pages`|`str`| - | Pages to extract (e.g., "1,3,5-7"). Default: all pages |
31
31
|`include_header_footer`|`bool`|`False`| Include page headers and footers in output |
32
-
|`detect_strikethrough`|`bool`|`False`| Detect strikethrough text and wrap with ~~ in Markdown output (experimental) |
32
+
|`detect_strikethrough`|`bool`|`False`| Detect strikethrough text and wrap with ~~ in Markdown output or <del></del> tag in HTML output (experimental)|
33
33
|`hybrid`|`str`|`"off"`| Hybrid backend (requires a running server). Quick start: pip install "opendataloader-pdf[hybrid]" && opendataloader-pdf-hybrid --port 5002. For remote servers use --hybrid-url. Values: off (default), docling-fast |
34
34
|`hybrid_mode`|`str`|`"auto"`| Hybrid triage mode. Values: auto (default, dynamic triage), full (skip triage, all pages to backend) |
35
35
|`hybrid_url`|`str`| - | Hybrid backend server URL (overrides default) |
Copy file name to clipboardExpand all lines: content/docs/cli-options-reference.mdx
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -32,7 +32,7 @@ This page documents all available CLI options for opendataloader-pdf.
32
32
|`--image-dir`| - |`string`| - | Directory for extracted images |
33
33
|`--pages`| - |`string`| - | Pages to extract (e.g., "1,3,5-7"). Default: all pages |
34
34
|`--include-header-footer`| - |`boolean`|`false`| Include page headers and footers in output |
35
-
|`--detect-strikethrough`| - |`boolean`|`false`| Detect strikethrough text and wrap with ~~ in Markdown output (experimental) |
35
+
|`--detect-strikethrough`| - |`boolean`|`false`| Detect strikethrough text and wrap with ~~ in Markdown output or <del></del> tag in HTML output (experimental)|
36
36
|`--hybrid`| - |`string`|`"off"`| Hybrid backend (requires a running server). Quick start: pip install "opendataloader-pdf[hybrid]" && opendataloader-pdf-hybrid --port 5002. For remote servers use --hybrid-url. Values: off (default), docling-fast |
37
37
|`--hybrid-mode`| - |`string`|`"auto"`| Hybrid triage mode. Values: auto (default, dynamic triage), full (skip triage, all pages to backend) |
privatestaticfinalStringDETECT_STRIKETHROUGH_DESC = "Detect strikethrough text and wrap with ~~ in Markdown output (experimental)";
115
+
privatestaticfinalStringDETECT_STRIKETHROUGH_DESC = "Detect strikethrough text and wrap with ~~ in Markdown output or <del></del> tag in HTML output (experimental)";
0 commit comments