Getting coordinates for each word in pdf file

125 views
Skip to first unread message

Raja

unread,
Sep 22, 2009, 3:15:00 AM9/22/09
to PDF2Text
Hi to Everyone,

I have requirement that need to convert pdf text into text file and
need to get the coordinates of the each word which is in the pdf
file.my platform is visual studio 2008(vb.net/c#.net).i downloaded
trail version of PDF2TEXT.DLL and while giving reference through add
reference in vb.net am getting error that "This is not a COM
component".can any one help me how to add pdf2text.dll in my vb.net
project and how to get the coordinates of the each word.

Thanks in Advance.

trn2

unread,
Sep 28, 2009, 9:28:51 PM9/28/09
to PDF2Text

PDF2Text is a C DLL (and is not a .NET/Com DLL).

Based on your requirements you may want to take a look at PDFNet SDK
(http://www.pdftron.com/pdfnet/) which is available as a .NET
component. Specifically you may want to take a look at TextExtract
sample project:
http://www.pdftron.com/pdfnet/samplecode.html#TextExtract
Reply all
Reply to author
Forward
0 new messages