Extract Data From Scanned PDF Document

This bot extracts relevant information from any type of structured PDF documents, scanned documents, and PDF image documents.

Top Benefits

  • Bot improves Adaptability, Configurability and Simplicity.
  • Easy for anyone to configure. No need to depend on the technical team.
  • No code changes required even if the document template changes. Only configuration will change.

Tasks

  • Extract required n-number of data from scanned pdf/pdf documents in the excel file.
  • Supports any kind of structured document.
  • Data extraction support based on the generic XML configuration.
  • Get values through key and value search criteria.
  • If no key values are available, you can get value through the row numbers. 
  • We can get and confirm if any 'Key' exists in the entire document.
  • Current bot will support all PDF structured documents, Scanned documents, and Image scanned documents.
  • OCR support 'Tesseract'.

1.Get the list of structured PDF documents
2.Convert into txt files using OCR/PDF integration
3.Configure it based on the relevant required data in the XML
4.Run the bot and get the data into excel file

Get Bot

Free

Bot Security Program
Level 1
Applications
Business Process
Category
Downloads
1146
Vendor
Automation Type
Bot
Last Updated
July 24, 2020
First Published
November 19, 2018
Enterprise Version
10.7
Support

See the Bot in Action

PREV NEXT

Setup Process

Install

Download the Bot and follow the instructions to install it in your AAE Control Room.

Configure

Open the Bot to configure your username and other settings the Bot will need (see the Installation Guide or ReadMe for details.)

Run

That's it - now the Bot is ready to get going!

Requirements and Inputs

  • AAE 10.7 sp2 or higher, Microsoft Excel
  • Inputs: PDF documents/Scanned structured pdf documents/ Image PDF documents such as Account statements, Loan Disbursement Letter, ID cards etc.