In the situation where the IDP stack is using Bedrock for OCR and also Bedrock for extraction, no prompt caching is used. Assuming the same LLM is used for each, this is an opportunity for improvement.
Implementing this would involve restructuring the OCR and Extraction prompts to start with the same prefix (probably something like: "here is an image of a page: {IMAGE}"), then adding a cache point immediately after.
I expect this will be easy to implement but perhaps more arduous to verify that it doesn't negatively impact accuracy.
https://deepwiki.com/search/if-i-use-bedrock-for-ocr-it-wi_3259f96e-c288-4288-b00c-80967d8708c1?mode=fast
In the situation where the IDP stack is using Bedrock for OCR and also Bedrock for extraction, no prompt caching is used. Assuming the same LLM is used for each, this is an opportunity for improvement.
Implementing this would involve restructuring the OCR and Extraction prompts to start with the same prefix (probably something like: "here is an image of a page: {IMAGE}"), then adding a cache point immediately after.
I expect this will be easy to implement but perhaps more arduous to verify that it doesn't negatively impact accuracy.
https://deepwiki.com/search/if-i-use-bedrock-for-ocr-it-wi_3259f96e-c288-4288-b00c-80967d8708c1?mode=fast