typo removal

JorjMcKie · JorjMcKie · commit d3735f193126 · 2026-01-11T13:43:34.000-04:00
diff --git a/docs/pymupdf4llm/api.rst b/docs/pymupdf4llm/api.rst
@@ -146,10 +146,10 @@ The |PyMuPDF4LLM| API
         - **"page_boxes"** - |PyMuPDFLayoutMode_Valid| a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::
 
             {
-                "index": 0-based integer index of the box in reading sequence
-                "class": str,  # one of "text", "picture", "table", etc.
+                "index": int,              # 0-based integer index of the box in reading sequence
+                "class": str,              # one of "text", "picture", "table", etc.
                 "bbox": [x0, y0, x1, y1],  # boundary box coordinates
-                "pos": (start, stop)  # 0-based integers: bbox_text = chunk["text"][start:stop]
+                "pos": (start, stop),      # 0-based integers: bbox_text = chunk["text"][start:stop]
             }
 
     :arg float page_height: specify a desired page height. For relevance see the `page_width` parameter. If using the default `None`, the document will appear as one large page with a width of `page_width`. Consequently in this case, no markdown page separators will occur (except the final one), respectively only one page chunk will be returned.
@@ -216,10 +216,10 @@ The |PyMuPDF4LLM| API
         - **"page_boxes"** - a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::
 
             {
-                "index": 0-based integer index of the box in reading sequence
-                "class": str,  # one of "text", "picture", "table", etc.
+                "index": int,              # 0-based integer index of the box in reading sequence
+                "class": str,              # one of "text", "picture", "table", etc.
                 "bbox": [x0, y0, x1, y1],  # boundary box coordinates
-                "pos": (start, stop)  # 0-based integers: bbox_text = chunk["text"][start:stop]
+                "pos": (start, stop),      # 0-based integers: bbox_text = chunk["text"][start:stop]
             }
 
 
@@ -267,8 +267,8 @@ The |PyMuPDF4LLM| API
         {
             "field_name": str,      # the full name of the form field, components separated by dots
             {
-                "value": str        # the field value as string
-                "pages": list       # list of 0-based page numbers where the field appears
+                "value": str,       # the field value as string
+                "pages": list,      # list of 0-based page numbers where the field appears
             }
         }    
 
diff --git a/docs/pymupdf4llm/index.rst b/docs/pymupdf4llm/index.rst
@@ -40,7 +40,7 @@ Functionality
 
 - Standard text and tables are detected, brought in the right reading sequence and then together converted to **GitHub**-compatible **Markdown** text. Tables in plain text output mode are rendered using the `tabulate <https://pypi.org/project/tabulate/>`_ package.
 
-- Header lines are identified via the font size and appropriately prefixed with one or more `#` tags. When using the package together with :ref:`PyMuPDF Layout <https://pypi.org/project/pymupdf-layout/>`_, titels, section headers and page headers and footers are detected.
+- Header lines are identified via the font size and appropriately prefixed with one or more `#` tags. When using the package together with :ref:`PyMuPDF Layout <https://pypi.org/project/pymupdf-layout/>`_, titles, section headers and page headers and footers are detected.
 
 - Bold, italic, mono-spaced text and code blocks are detected and formatted accordingly. Similar applies to ordered and unordered lists.
 

Original file line number	Diff line number	Diff line change
`@@ -146,10 +146,10 @@ The \|PyMuPDF4LLM\| API`
`146`	`146`	`- "page_boxes" - \|PyMuPDFLayoutMode_Valid\| a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::`
`147`	`147`
`148`	`148`	`{`
`149`		`- "index": 0-based integer index of the box in reading sequence`
`150`		`- "class": str, # one of "text", "picture", "table", etc.`
	`149`	`+ "index": int, # 0-based integer index of the box in reading sequence`
	`150`	`+ "class": str, # one of "text", "picture", "table", etc.`
`151`	`151`	`"bbox": [x0, y0, x1, y1], # boundary box coordinates`
`152`		`- "pos": (start, stop) # 0-based integers: bbox_text = chunk["text"][start:stop]`
	`152`	`+ "pos": (start, stop), # 0-based integers: bbox_text = chunk["text"][start:stop]`
`153`	`153`	`}`
`154`	`154`
`155`	`155`	:arg float page_height: specify a desired page height. For relevance see the `page_width` parameter. If using the default `None`, the document will appear as one large page with a width of `page_width`. Consequently in this case, no markdown page separators will occur (except the final one), respectively only one page chunk will be returned.
`@@ -216,10 +216,10 @@ The \|PyMuPDF4LLM\| API`
`216`	`216`	`- "page_boxes" - a list of dictionaries representing the layout boundary boxes. Each dictionary has the following structure::`
`217`	`217`
`218`	`218`	`{`
`219`		`- "index": 0-based integer index of the box in reading sequence`
`220`		`- "class": str, # one of "text", "picture", "table", etc.`
	`219`	`+ "index": int, # 0-based integer index of the box in reading sequence`
	`220`	`+ "class": str, # one of "text", "picture", "table", etc.`
`221`	`221`	`"bbox": [x0, y0, x1, y1], # boundary box coordinates`
`222`		`- "pos": (start, stop) # 0-based integers: bbox_text = chunk["text"][start:stop]`
	`222`	`+ "pos": (start, stop), # 0-based integers: bbox_text = chunk["text"][start:stop]`
`223`	`223`	`}`
`224`	`224`
`225`	`225`
`@@ -267,8 +267,8 @@ The \|PyMuPDF4LLM\| API`
`267`	`267`	`{`
`268`	`268`	`"field_name": str, # the full name of the form field, components separated by dots`
`269`	`269`	`{`
`270`		`- "value": str # the field value as string`
`271`		`- "pages": list # list of 0-based page numbers where the field appears`
	`270`	`+ "value": str, # the field value as string`
	`271`	`+ "pages": list, # list of 0-based page numbers where the field appears`
`272`	`272`	`}`
`273`	`273`	`}`
`274`	`274`