AnalyzeDocument Processor

Part of the AWS Textract processor family

The AnalyzeDocument processor will search a document for text, forms, and tables. You can tell the processor to search for specific things by modifying the Feature Types property.

Properties

All of our Textract processors also include these common properties.

Properties whose names are in bold and italic are required.

  • Feature Types- a dropdown list that tells controls what the API is looking for. It can be set to one of the following:

    • tables - the API will search for text, as well as tables

    • forms - the API will search for text, as well as forms, which are areas where a user would be expected to input information

    • tables-and-forms - the API will search for text, tables, and forms

Data Output

If the Destination property is set to flowfile-attribute, then the output of this processor will be routed to the FlowFile's ocr.AnalyzedDocument attribute, which will be created if it isn't present.

Field Name

Data Type

Description

blocks

array of Block

The list of blocks returned from the API

Last updated