Amazon Textract Response Parser library for TS.
This is a TS port of the Python TRP script from AWS.
npm install @scribelabsai/amazon-trp
- Load the blocks into a document (e.g. using
GetDocumentAnalysisCommand
from@aws-sdk/client-textract
)
import { TextractClient, GetDocumentAnalysisCommand } from '@aws-sdk/client-textract';
import { Document } from '@scribelabsai/amazon-trp';
import type { BlockStruct } from '@scribelabsai/amazon-trp';
const client = new TextractClient();
const resp = await client.send(new GetDocumentAnalysisCommand({ JobId: 'MY_JOBID }));
const doc = new Document(resp.blocks as BlockStruct[]);
- Do something with the document (e.g. getting tables)
doc.pages.forEach((p) => {
p.tables.forEach((t) => {
// Do something with the table
});
});
MIT, see LICENSE file.