Skip to content

File OCR

The File OCR worker uses Google Document AI features to extract information from images and PDF files and returns a table to the workflow. Currently this worker only uses the Table Extraction resource, used to extract tables from images and PDFs.

Parameters

The File OCR worker must receive specific parameters in order to execute properly. When parameters are not set correctly it may cause the worker to not work as expected.

Operation

Select the File OCR operation using Fixed Value variant. Currently the field is disabled and set to Table Extraction.

File

Select the file to be parsed using Fixed File, Upstream Data or Get Latest variants.

Result

Once the worker finishes its executions successfully, it will return a result object containing the merged portfolio to the workflow, which can be used by downstream workers. Below you can see an example of the File OCR's result object hierarchy.

  • OCR Data (object)
  • Table (list of lists)