Skip to content

repositories Search Results · repo:GDJJJ/PDFtoLateXorDoc language:Python

Filter by

0 files  (255 ms)

0 files

inGDJJJ/PDFtoLateXorDoc (press backspace or delete to remove)

在科技项目申报书文档处理场景中,大量文档以图像格式的PDF存在,通过OCR识别、文本处理和文档合并等步骤提取科技项目图像文档中的文字信息以及表格、图像、公式等非文本信息,通过调用布局检测、大语言模型实现了图像、多层级标题等结构化信息的还原。
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! Press the / key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! Press the / key to activate the search input again and adjust your query.