This repository contains some examples on how to extract insights from the Textract output. We have showed examples to extract headers, paragraphs and footers based on the font size, indentation and paragraph endings and line separators.
***Note: This is not a solution for all the types of paragraphs/text segments. ***
Here we took examples of some of the files we have worked with, and this only gives guidance on how to use metadata provided by Textract.
Also, once the segments were identified we are using Amazon Comprehend to get sentiment.
This library is licensed under the MIT-0 License. See the LICENSE file.