1. Brett Weaver
  2. PowerBuilder
  3. Wednesday, 3 April 2024 18:03 PM UTC

Hi Folks

We have a number of forms which are fillable PDF's which are completed by the general public and sent into us. I would like to automate the collection of the information from the PDF forms but I cannot seem to work out how to read the text from an existing PDF using the objects provided in PB2022R3.

Is it possible to read the text from an existing PDF using only PowerBuilder code? 

Cheers

Brett Weaver

Julie Jiang @Appeon Accepted Answer Pending Moderation
  1. Monday, 8 April 2024 06:49 AM UTC
  2. PowerBuilder
  3. # 1

Hi Brett,

FYI.  We have plan to support traversing and extracting values in the Form fields within PDF files in version 2025. Our engineering team is working on it.

Best regards, Julie

Comment
There are no comments made yet.
Miguel Leeuwe Accepted Answer Pending Moderation
  1. Thursday, 4 April 2024 08:30 AM UTC
  2. PowerBuilder
  3. # 2

PDFPig: https://uglytoad.github.io/PdfPig/

Write something in C# and import the DLL using the DLL import tool.

Comment
  1. Brett Weaver
  2. Thursday, 4 April 2024 17:35 PM UTC
Thanks for the suggestion. There are a lot of ways I can read the PDF if I use other technologies but I was trying to work out whether I can do it just with the PowerBuilder PDF objects. If I display the PDF in a Web Browser object I can Ctrl-A and Ctrl-C to copy all the text to the clipboard and use it from there for example.- I then have the job of finding the message numbers to Send to accomplish the Ctrl-A and Ctrl-C inside the program but I'm pretty sure its do-able.
  1. Helpful
  1. Miguel Leeuwe
  2. Friday, 5 April 2024 08:17 AM UTC
YW
  1. Helpful
There are no comments made yet.
Brett Weaver Accepted Answer Pending Moderation
  1. Wednesday, 3 April 2024 18:39 PM UTC
  2. PowerBuilder
  3. # 3

Hi Chris,

Thanks for getting back.

I can cope with all of the text, as I can identify the questions and isolate answers. This would be my preferred option.

If I could just read answers that would be OK too.

Cheers

Brett

 

 

Comment
  1. Brett Weaver
  2. Thursday, 4 April 2024 17:38 PM UTC
Thanks for your comment. I find it a bit weird that we are having to do so much stuff with PDF's nowadays. Instead of Websites, API's, JSON, XML etc. there seems to be a lot of data transfer happening on PDF's.
  1. Helpful
  1. Sivaprakash BKR
  2. Monday, 8 April 2024 12:34 PM UTC
Add, digitally signing the pdf to the requirement list.
  1. Helpful
  1. Andreas Mykonios
  2. Monday, 8 April 2024 13:20 PM UTC
I agree...
  1. Helpful
There are no comments made yet.
Chris Pollach @Appeon Accepted Answer Pending Moderation
  1. Wednesday, 3 April 2024 18:24 PM UTC
  2. PowerBuilder
  3. # 4

Hi Brett ;

  Do you mean only the text from the data entry fields?

Regards ... Chris

Comment
There are no comments made yet.
  • Page :
  • 1


There are no replies made for this question yet.
However, you are not allowed to reply to this question.