Pega RPA - How to extract table content from standard PDF
Hi Everyone,
Objective: To extract table content in understandable view from a standard pdf using Pega Robotics Studio application.
Requirements: Bot will get table title '' to search for in given standard pdf and extract all the table in a veriable or excel etc which should be understandable to read column by column etc.
Attachments:
- file name: 'Table_automation.png' shows our automation which search for a title in general and get text line by line Issues: it will go until end of the pdf to extract all lines because can't match the table.
the bot also extract a paragraph with same title. - file name: 'table_content_output.png' is the bot output of the pdf table in one cell of the excel file and not structured as table to read the table content
- file name: ''Table_PDF.png' is the original table from pdf which is supposed to be extracted via BOT
we are using Pega robotics PDF connectors to extract text. we would really appreciate if there is any possible solution to extract table.
thanks
***Edited by Moderator: Lochan to update platform capability tags***