|
for paragraph in document.paragraphs: |
We are just starting to roll out a production version of this application. A user uploaded a word doc that has all their text in tables. The batch was not picking up any text at all. I realized that the code to get the text from word documents is only loading the document.paragraphs objects from word documents which means it is skipping anything that is in a table in a word document.
chat-with-your-data-solution-accelerator/code/backend/batch/utilities/document_loading/word_document.py
Line 35 in a9997b0
We are just starting to roll out a production version of this application. A user uploaded a word doc that has all their text in tables. The batch was not picking up any text at all. I realized that the code to get the text from word documents is only loading the document.paragraphs objects from word documents which means it is skipping anything that is in a table in a word document.