# Textract

Category: AWS
Runtime: 0.9+
Available: Free

# Overview

Textract node by Kumologica integrates with Amazon Textract (opens new window) service for analyzing an input document for relationships between the detected items. The node helps in parse the Textract result into a structured format for processing.

# Setup

# Prerequisites

For Textract node to work properly, you need to have a valid subscription of AWS cloud account. Ensure to have an Access credentials (opens new window) generated for the node to connect with the Textract service associated with the account.

# Installation

To install the node from designer:

  1. Go to Add more nodes option on the palette section.
  2. Check for the Amazon Textract node and click install.
  3. Once installed, the designer is required to be restarted.

To install the node from CLI:

  1. Go to your project workspace where you can see your package.json file.
  2. Run the below give npm command.
    npm i @kumologica/kumologica-contrib-awstextract

# Technical Details


  • Type (Optional)
    Option to extract response for FORM or for TABLES.
  • Content (Optional)
    Buffer content for the textract service to process.
  • S3 Bucket (Optional)
    S3 bucket name where the file to process resides. S3 bucket is to be provided only if content field is not provided.
  • File Name (Optional)
    Name of the file in S3 bucket to process.
  • Access Key (Optional)
    AWS access Key for connecting the textract service.
  • Access Secret (Optional)
    AWS access secret for connecting the textract service.
  • Region (Optional)
    Region in AWS for connecting the textract service. eg: ap-southeast-2.
  • Request Timeout (Optional)
    Sets the socket to timeout after timeout milliseconds of inactivity on the socket. Defaults to two minutes (120000)

Supported Operations



Textract node throws the following exception.

  • AWS Textract Client Failed


Payload will be overwritten by this node. None of the variables will be overwritten or enriched by this node. The response payload is always the parsed JSON response of the Textract result.

# Develop a Service

Video tutorial