How to retrieve the text content based on line using Pdfium?

91 views
Skip to first unread message

Parthipan R

unread,
Aug 30, 2018, 6:58:14 AM8/30/18
to pdfium
I tried to retrieve the text content from the page as line by line using FPDFText_GetBoundedText API but it return the unwanted characters if the bounded rect contains the character "y p". Can any one please advice on to overcome this behavior? 

Ryan Harrison

unread,
Aug 30, 2018, 10:31:29 AM8/30/18
to rpart...@gmail.com, pdfium
That sounds like potentially a bug or something about the PDF being weird that isn't being handled correctly. 
I would recommend filing a bug report @ https://crbug.com/pdfium/new with an example PDF and reproduction steps.
-Ryan Harrison

On Thu, 30 Aug 2018 at 06:58, Parthipan R <rpart...@gmail.com> wrote:
I tried to retrieve the text content from the page as line by line using FPDFText_GetBoundedText API but it return the unwanted characters if the bounded rect contains the character "y p". Can any one please advice on to overcome this behavior? 

--
You received this message because you are subscribed to the Google Groups "pdfium" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pdfium+un...@googlegroups.com.
To post to this group, send email to pdf...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pdfium/bc9e7677-d7c2-4159-ac0f-c0f87eb6340a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages