Skip to content

Commit b77fa3d

Browse files
committed
typo removal
1 parent 11e26ac commit b77fa3d

2 files changed

Lines changed: 9 additions & 9 deletions

File tree

docs/pymupdf4llm/api.rst

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -146,10 +146,10 @@ The |PyMuPDF4LLM| API
146146
- **"page_boxes"** - |PyMuPDFLayoutMode_Valid| a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::
147147

148148
{
149-
"index": 0-based integer index of the box in reading sequence
150-
"class": str, # one of "text", "picture", "table", etc.
149+
"index": int, # 0-based integer index of the box in reading sequence
150+
"class": str, # one of "text", "picture", "table", etc.
151151
"bbox": [x0, y0, x1, y1], # boundary box coordinates
152-
"pos": (start, stop) # 0-based integers: bbox_text = chunk["text"][start:stop]
152+
"pos": (start, stop), # 0-based integers: bbox_text = chunk["text"][start:stop]
153153
}
154154

155155
:arg float page_height: specify a desired page height. For relevance see the `page_width` parameter. If using the default `None`, the document will appear as one large page with a width of `page_width`. Consequently in this case, no markdown page separators will occur (except the final one), respectively only one page chunk will be returned.
@@ -216,10 +216,10 @@ The |PyMuPDF4LLM| API
216216
- **"page_boxes"** - a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::
217217

218218
{
219-
"index": 0-based integer index of the box in reading sequence
220-
"class": str, # one of "text", "picture", "table", etc.
219+
"index": int, # 0-based integer index of the box in reading sequence
220+
"class": str, # one of "text", "picture", "table", etc.
221221
"bbox": [x0, y0, x1, y1], # boundary box coordinates
222-
"pos": (start, stop) # 0-based integers: bbox_text = chunk["text"][start:stop]
222+
"pos": (start, stop), # 0-based integers: bbox_text = chunk["text"][start:stop]
223223
}
224224

225225

@@ -267,8 +267,8 @@ The |PyMuPDF4LLM| API
267267
{
268268
"field_name": str, # the full name of the form field, components separated by dots
269269
{
270-
"value": str # the field value as string
271-
"pages": list # list of 0-based page numbers where the field appears
270+
"value": str, # the field value as string
271+
"pages": list, # list of 0-based page numbers where the field appears
272272
}
273273
}
274274

docs/pymupdf4llm/index.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ Functionality
4040

4141
- Standard text and tables are detected, brought in the right reading sequence and then together converted to **GitHub**-compatible **Markdown** text. Tables in plain text output mode are rendered using the `tabulate <https://pypi.org/project/tabulate/>`_ package.
4242

43-
- Header lines are identified via the font size and appropriately prefixed with one or more `#` tags. When using the package together with :ref:`PyMuPDF Layout <https://pypi.org/project/pymupdf-layout/>`_, titels, section headers and page headers and footers are detected.
43+
- Header lines are identified via the font size and appropriately prefixed with one or more `#` tags. When using the package together with :ref:`PyMuPDF Layout <https://pypi.org/project/pymupdf-layout/>`_, titles, section headers and page headers and footers are detected.
4444

4545
- Bold, italic, mono-spaced text and code blocks are detected and formatted accordingly. Similar applies to ordered and unordered lists.
4646

0 commit comments

Comments
 (0)