I used the methods to retrieve PDF text. But the text that's in tabular form is not retrieved as a table.
The PDF generated by itextshap utility are incomprehensible. The work around for me was to use itextsharp utility within the pega scripts to read the text which is plain text.
There is no method that directly gives any tabular report which a pdf would contain. An additional manipulation is needed to read the tabular lines and map it to respective table schema using space as delimiter. This is the work around and not a guaranteed way to map the table rows/lines as space size will vary in presence of any variable text field like name.
Please suggest if there is any way to read the table structure data contents.
Posted: 3 years ago
Posted: 27 Jul 2018 21:37 EDT
Andrew Grondin (grona)
GCS Solutions Engineer I