DetectDocumentText Processor
Part of the AWS Textract processor family
The DetectDocumentText processor will extract text from a given document, which can be either an image or a PDF document.
Properties
This processor does not have any unique properties outside of the common ones.
Data Output
If the Destination property is set to flowfile-attribute
, then the output of this processor will be routed to the FlowFile's ocr.DetectedText
attribute, which will be created if it isn't present.
Field Name
Data Type
Description
blocks
array of Block
The list of blocks returned from the API
Last updated
Was this helpful?