Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

OT Tech: Convert PDF files to Word Document

5 views
Skip to first unread message

Martha Lundgren

unread,
Jun 19, 2002, 3:00:55 PM6/19/02
to
This is a MIME message. If you are reading this text, you may want to
consider changing to a mail reader or gateway that understands how to
properly handle MIME multipart messages.

--=_F6AAD667.72137D53
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Does anyone know of a product that can take a *.pdf file and convert it,
fully formatted, to Word and allow the user to edit the document at
will?

I've found several tools that convert *.pdf files to text, one (BCL
Drake) of which does a reasonable job of preserving the formatting.

However, BCL Drake brings text in in blocks. If, when those text
blocks are edited in Word, the text exceeds the size of the original
block, adjacent blocks do not reposition to accommodate the larger text
block. This is Word working as designed.

I've also looked at ConvertZone's CZ-pdf2txt and PDF Magic Pro. They
don't retain enough formatting to even consider using them.

Opening the *.pdf file with a full copy of Acrobat is also not a
solution - it has the same problems with text boxes as noted above.

Thanks for any help you can provide!

--=_F6AAD667.72137D53
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 8bit
Content-Description: HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=iso-8859-1" http-equiv=Content-Type>
<META content="MSHTML 5.00.3315.2870" name=GENERATOR></HEAD>
<BODY style="FONT: 12pt Verdana; MARGIN-LEFT: 2px; MARGIN-TOP: 2px">
<DIV>Does anyone know of a product that can take a *.pdf file and convert it,
fully formatted, to Word and allow the user to edit the document at will?</DIV>
<DIV>&nbsp;</DIV>
<DIV>I've found several tools that convert *.pdf files to text, one (BCL Drake)
of which does a reasonable job of preserving the formatting.</DIV>
<DIV>&nbsp;</DIV>
<DIV>However, BCL Drake brings text in in blocks.&nbsp; If, when those text
blocks are edited in Word, the text exceeds the size of the original block,
adjacent blocks do not reposition to accommodate the larger text block.&nbsp;
This is Word working as designed.</DIV>
<DIV>&nbsp;</DIV>
<DIV>I've also looked at ConvertZone's CZ-pdf2txt and PDF Magic Pro.&nbsp; They
don't retain enough formatting to even consider using them.</DIV>
<DIV>&nbsp;</DIV>
<DIV>Opening the *.pdf file with a full copy of Acrobat is also not a solution -
it has the same problems with text boxes as noted above.</DIV>
<DIV>&nbsp;</DIV>
<DIV>Thanks for any help you can provide!</DIV></BODY></HTML>

--=_F6AAD667.72137D53--

Susan Derylak

unread,
Jun 19, 2002, 4:19:53 PM6/19/02
to
Hi, Martha,

Adobe Acrobat 5.0 saves PDF files to RTF format, and I haven't seen any problem with text appearing in text boxes, it appears to be fully editable in Word. I have only worked with it a little, so may not have used the same formatting you have in your documents, but it may be a viable option for you.

Susan Derylak
Globalquest Solutions, Inc.
sder...@globalquestinc.com
www.globalquestinc.com

<DIV> </DIV>


<DIV>I've found several tools that convert *.pdf files to text, one (BCL Drake)
of which does a reasonable job of preserving the formatting.</DIV>

<DIV> </DIV>
<DIV>However, BCL Drake brings text in in blocks. If, when those text


blocks are edited in Word, the text exceeds the size of the original block,
adjacent blocks do not reposition to accommodate the larger text block.

This is Word working as designed.</DIV>
<DIV> </DIV>
<DIV>I've also looked at ConvertZone's CZ-pdf2txt and PDF Magic Pro. They
don't retain enough formatting to even consider using them.</DIV>
<DIV> </DIV>


<DIV>Opening the *.pdf file with a full copy of Acrobat is also not a solution -
it has the same problems with text boxes as noted above.</DIV>

<DIV> </DIV>

Ian Hanford

unread,
Jun 20, 2002, 3:17:34 AM6/20/02
to
This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C2182A.A5F6157A
Content-Type: text/plain;
charset="iso-8859-1"

Hi Martha,

I've only come across this once, but very recently. I was able to ctrl-a,
ctrl-c from Acrobat Reader and ctrl-v into a blank Word document - the .pdf
wasn't heavily formatted, (I only had to format the Word doc to use 2
columns per page) so the formatting was mostly carried through.

I had spent a lot of time looking for a shareware or freeware product on the
net that I could use (I only had one document to convert) but most of my
search results were either unavailable (aren't HTTP 403 messages so
joy-inspiring??) or not what I wanted, so I can't offer much help there I'm
afraid.

Rgds,
Ian

-----Original Message-----
From: Martha Lundgren [mailto:Martha....@TASB.ORG]
Sent: 19 June 2002 19:59
To: HDE...@WVNVM.WVNET.EDU
Subject: OT Tech: Convert PDF files to Word Document

Does anyone know of a product that can take a *.pdf file and convert it,
fully formatted, to Word and allow the user to edit the document at
will?

I've found several tools that convert *.pdf files to text, one (BCL
Drake) of which does a reasonable job of preserving the formatting.

However, BCL Drake brings text in in blocks. If, when those text
blocks are edited in Word, the text exceeds the size of the original
block, adjacent blocks do not reposition to accommodate the larger text
block. This is Word working as designed.

I've also looked at ConvertZone's CZ-pdf2txt and PDF Magic Pro. They
don't retain enough formatting to even consider using them.

Opening the *.pdf file with a full copy of Acrobat is also not a
solution - it has the same problems with text boxes as noted above.

Thanks for any help you can provide!

.


#######################################################################

This message is intended only for the use of the individual to whom or
the entity to which it is addressed and may contain information which
is confidential and privileged. Confidentiality and privilege are not
lost by this email having been sent to the wrong person. If you are not
the intended recipient or the person responsible for delivering it to
the intended recipient, please notify Postm...@CGI-Europe.com

Any distribution by an unintended recipient is prohibited.

#######################################################################
------_=_NextPart_001_01C2182A.A5F6157A
Content-Type: text/html;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; charset=3Diso-885=
9-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version 5.5.2650.1=
2">
<TITLE>RE: OT Tech: Convert PDF files to Word Document</TITLE>
</HEAD>
<BODY>

<P><FONT SIZE=3D2>Hi Martha,</FONT>
</P>

<P><FONT SIZE=3D2>I've only come across this once, but very recently.&nbs=
p; I was able to ctrl-a, ctrl-c from Acrobat Reader and ctrl-v into a bla=
nk Word document - the .pdf wasn't heavily formatted, (I only had to form=
at the Word doc to use 2 columns per page) so the formatting was mostly c=
arried through.</FONT></P>

<P><FONT SIZE=3D2>I had spent a lot of time looking for a shareware or fr=
eeware product on the net that I could use (I only had one document to co=
nvert) but most of my search results were either unavailable (aren't HTTP=
=20403 messages so joy-inspiring??) or not what I wanted, so I can't offe=
r much help there I'm afraid.</FONT></P>

<P><FONT SIZE=3D2>Rgds,</FONT>
<BR><FONT SIZE=3D2>Ian</FONT>
</P>

<P><FONT SIZE=3D2>-----Original Message-----</FONT>
<BR><FONT SIZE=3D2>From: Martha Lundgren [<A HREF=3D"mailto:Martha.Lundgr=
e...@TASB.ORG">mailto:Martha....@TASB.ORG</A>]</FONT>
<BR><FONT SIZE=3D2>Sent: 19 June 2002 19:59</FONT>
<BR><FONT SIZE=3D2>To: HDE...@WVNVM.WVNET.EDU</FONT>
<BR><FONT SIZE=3D2>Subject: OT Tech: Convert PDF files to Word Document</=
FONT>
</P>

<P><FONT SIZE=3D2>Does anyone know of a product that can take a *.pdf fil=
e and convert it,</FONT>
<BR><FONT SIZE=3D2>fully formatted, to Word and allow the user to edit th=
e document at</FONT>
<BR><FONT SIZE=3D2>will?</FONT>
</P>

<P><FONT SIZE=3D2>I've found several tools that convert *.pdf files to te=
xt, one (BCL</FONT>
<BR><FONT SIZE=3D2>Drake) of which does a reasonable job of preserving th=
e formatting.</FONT>
</P>

<P><FONT SIZE=3D2>However, BCL Drake brings text in in blocks.&nbsp; If, =
when those text</FONT>
<BR><FONT SIZE=3D2>blocks are edited in Word, the text exceeds the size o=
f the original</FONT>
<BR><FONT SIZE=3D2>block, adjacent blocks do not reposition to accommodat=
e the larger text</FONT>
<BR><FONT SIZE=3D2>block.&nbsp; This is Word working as designed.</FONT>
</P>

<P><FONT SIZE=3D2>I've also looked at ConvertZone's CZ-pdf2txt and PDF Ma=
gic Pro.&nbsp; They</FONT>
<BR><FONT SIZE=3D2>don't retain enough formatting to even consider using =
them.</FONT>
</P>

<P><FONT SIZE=3D2>Opening the *.pdf file with a full copy of Acrobat is a=
lso not a</FONT>
<BR><FONT SIZE=3D2>solution - it has the same problems with text boxes as=
=20noted above.</FONT>
</P>

<P><FONT SIZE=3D2>Thanks for any help you can provide!</FONT>
</P>

<HR><STRONG><FONT COLOR=3D#FF0000>
Attention: </FONT></STRONG><BR>
CGI Group (Europe) Ltd has the lawful right to record, monitor & inspect =
messages<CR>
between its employees and third parties. Messages shall be subject to law=
ful supervision<CR>
as CGI Group (Europe) Ltd deems necessary to protect its information, int=
erests and reputation.<CR>
<P>
CGI Group (Europe) Ltd prohibits and takes steps to prevent its systems f=
rom being used to view,<CR>
store or forward offensive or discriminatory material. If this message co=
ntains such material<CR>
please report it to ab...@cgi-europe.com<CR>
<P>
This message is intended only for the individual to whom it is addressed =
and may contain information<CR>
which is confidential and privileged. Confidentiality and privilege are n=
ot lost by this email having<CR>
been sent to the wrong person. If you are not the intended recipient or t=
he person responsible for<CR>
delivering it to the intended recipient, please notify the sender by repl=
y e-mail. Any distribution<CR>
by an unintended recipient is prohibited.<CR>
<P>
CGI Group (Europe) Ltd, Broadlands House, Primett Road, Stevenage, Herts.=
, SG1 3EE
<P>
</FONT><BR>CGI Group (Europe) Limited </FONT><BR>
<A href=3D"http://www.cgi.ca">www.cgi-europe.com</A></STRONG>
<HR></BODY>
</HTML>=

------_=_NextPart_001_01C2182A.A5F6157A--

Jim Helfer

unread,
Jun 20, 2002, 9:28:59 AM6/20/02
to
I think the main sticking point is that PDFs were designed to be a "final"
& uneditable document. One reason people send PDFs is that they can be
reasonably confident that what they send (i.e. specifications, contracts and
drawings in our case) isn't going to be immediately changed by the
recipient.

If you make it trivial to reproduce a PDF document in Word that's exactly
like the original PDF, how do I know that everything I send in pdf form to
you won't be immediately edited to your liking and then converted back into
a PDF file without my company's knowledge or consent? If that capability
becomes commonand easy, then I need to go find a new document format,
because pdf is worse than useless.

Jim Helfer

Hi Martha,

Rgds,
Ian


#######################################################################

------_=_NextPart_001_01C2182A.A5F6157A--


____________________________________________________________________
eMail Privacy Protected by QuarantineMail (http://www.quarantinemail.com)
Licensed to:

Thomas Schweich

unread,
Jun 20, 2002, 12:51:38 PM6/20/02
to
Jim,

You need to find a new document format because .pdf *is* worse than
useless, in that it gives a false sense of security. At home I use
OmniPage Pro (version 9, I believe) to run my scanner. One of the new
version 9 features is the ability to OCR a .pdf document. Works pretty
good, and can save an OCR-ed .pdf document in a variety of formats.

Personally, I hate .pdf documents. It seems to me that whether I can save
a .pdf, or whether "Find" works is under control of the author. The
File-Save option seems to be turned off most of the time, perhaps it's a
default, so I have learned to go to C:\Documents and Settings\Tom\Local
Settings\Temp to find the temp file and copy it to a permanent place. Also
some printers, mostly of the Canon variety in my experience, have a hard
time printing .pdf document coherently. Altho' the NT boys assure me that
it's a problem with the driver on my laptop and all I have to do is delete
the printer and redefine it (again .... again .... again ..... ......)

--
Tom "I hate .pdf and Java" Schweich IBM: (415) 545-3644 Chiron: (510)
923-2385
I/T Architect, Systems Management and Networking Services
mailto: schw...@us.ibm.com http://www.schweich.com


Jim Helfer
<JHelfer@WTWARCH. To: HDE...@WVNVM.WVNET.EDU
COM> cc:
Sent by: Help Subject: Re: OT Tech: Convert PDF files to Word Document
Desk Discussions
<HDE...@WVNVM.WV
NET.EDU>


06/20/2002 06:33
AM
Please respond to
Help Desk
Discussions

Marvin Lewis

unread,
Jun 21, 2002, 12:40:48 AM6/21/02
to
At one time, MS Office had a "Print-to-HTML" add-in that you could use.
Unfortunately, you lose some features and resolution. Acrobat 5 tightened
the default security settings for documents. Many documents created in
earlier versions of Acrobat had security disabled and you could
cut-and-paste text and images into your own documents.

The Generic Text printer driver should allow you to save all the
(non-bitmapped) text in a PDF file, but you would lose all formatting.

I just received an ad for ScanSoft's OmniPage Pro 11, which they claim can
convert raw PDFs to editable text. Check www.scansoft.com or
www.omnipage.com for more info.

Adobe acknowledges that there is a need for security in PDF files, including
the ability to add digital signatures, encryption, create "time-bomb"
expiring documents, and workflow management. Service bureaus such as print
shops, Kinko's, news agencies, etc have been working with Adobe to maintain
some editing capability while protecting the file creator's intellectual
property.

Marvin

Martha Lundgren

unread,
Jun 21, 2002, 7:54:12 AM6/21/02
to
Thanks to everyone who has replied so far.

To Jim's point - yup, that's what I generally understood PDFs to be.
However, anyone who has a full copy of Acrobat, as opposed to just the
reader, can open a PDF and edit it. Now, as I have found out, it's not
a graceful edit, but it can be done. And of course it can be converted
back.

Its other selling point was that anyone with the free Reader could read
your document. You can address a much larger audience that just those
people who share the same source application you used to create it. So
if your concern isn't absolute control of the content, it's still quite
useful for that.

I haven't looked to see if you can password protect a PDF file, but if
you can, there's a way out on controlling the content provided the
password can't be cracked.

Martha Lundgren
Manager, Information Center
Texas Association of School Boards
512-467-3677 x3108
fax 512-467-3644
>>> JHe...@WTWARCH.COM 06/20/02 08:27 AM >>>

Jim Helfer

Hi Martha,

Rgds,
Ian


#######################################################################

<P><FONT SIZE=3D2>However, BCL Drake brings text in in blocks.* If, =


when those text</FONT>
<BR><FONT SIZE=3D2>blocks are edited in Word, the text exceeds the size
o=
f the original</FONT>
<BR><FONT SIZE=3D2>block, adjacent blocks do not reposition to
accommodat=
e the larger text</FONT>

<BR><FONT SIZE=3D2>block.* This is Word working as designed.</FONT>
</P>

<P><FONT SIZE=3D2>I've also looked at ConvertZone's CZ-pdf2txt and PDF
Ma=

gic Pro.* They</FONT>

Martha Lundgren

unread,
Jun 21, 2002, 4:17:43 PM6/21/02
to
This is a MIME message. If you are reading this text, you may want to
consider changing to a mail reader or gateway that understands how to
properly handle MIME multipart messages.

--=_ABF78864.4B2A44A6


Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Thanks for the suggestion. Unfortunately, we've got 50 or more 50+
page
documents that are heavily formatted. Right now, they scan them in
and
OCR them and adjust any formatting as necessarily. They hope to
streamline that process, but that method does work.

I'm going to check out the suggestion that someone posted that we
might
be able to open them directly with a couple of OCR packages.

>>> Ian.h...@CGI-EUROPE.COM 06/20/02 02:18AM >>>

Hi Martha,

Rgds,
Ian


--=_ABF78864.4B2A44A6


Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 8bit
Content-Description: HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=iso-8859-1" http-equiv=Content-Type>
<META content="MSHTML 5.00.3315.2870" name=GENERATOR></HEAD>

<BODY style="FONT: 12pt Verdana; MARGIN-LEFT: 2px; MARGIN-TOP: 2px">Thanks for
the suggestion.&nbsp; Unfortunately, we've got 50 or more 50+ page<BR>documents
that are heavily formatted.&nbsp; Right now, they scan them in and<BR>OCR them
and adjust any formatting as necessarily.&nbsp; They hope to<BR>streamline that
process, but that method does work.<BR><BR>I'm going to check out the suggestion
that someone posted that we might<BR>be able to open them directly with a couple
of OCR packages.<BR><BR>&gt;&gt;&gt; Ian.h...@CGI-EUROPE.COM 06/20/02 02:18AM
&gt;&gt;&gt;<BR>This message is in MIME format. Since your mail reader does not
understand<BR>this format, some or all of this message may not be
legible.<BR><BR>------_=_NextPart_001_01C2182A.A5F6157A<BR>Content-Type:
text/plain;<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
charset="iso-8859-1"<BR><BR>Hi Martha,<BR><BR>I've only come across this once,
but very recently.&nbsp; I was able to ctrl-a,<BR>ctrl-c from Acrobat Reader and
ctrl-v into a blank Word document - the .pdf<BR>wasn't heavily formatted, (I
only had to format the Word doc to use 2<BR>columns per page) so the formatting
was mostly carried through.<BR><BR>I had spent a lot of time looking for a
shareware or freeware product on the<BR>net that I could use (I only had one
document to convert) but most of my<BR>search results were either unavailable
(aren't HTTP 403 messages so<BR>joy-inspiring??) or not what I wanted, so I


can't offer much help there

I'm<BR>afraid.<BR><BR>Rgds,<BR>Ian<BR><BR>-----Original Message-----<BR>From:
Martha Lundgren [<A
href="mailto:Martha....@TASB.ORG]">mailto:Martha....@TASB.ORG]</A><BR>Sent:
19 June 2002 19:59<BR>To: HDE...@WVNVM.WVNET.EDU<BR>Subject: OT Tech: Convert
PDF files to Word Document<BR><BR>Does anyone know of a product that can take a
*.pdf file and convert it,<BR>fully formatted, to Word and allow the user to
edit the document at<BR>will?<BR><BR>I've found several tools that convert *.pdf
files to text, one (BCL<BR>Drake) of which does a reasonable job of preserving
the formatting.<BR><BR>However, BCL Drake brings text in in blocks.&nbsp; If,
when those text<BR>blocks are edited in Word, the text exceeds the size of the
original<BR>block, adjacent blocks do not reposition to accommodate the larger
text<BR>block.&nbsp; This is Word working as designed.<BR><BR>I've also looked
at ConvertZone's CZ-pdf2txt and PDF Magic Pro.&nbsp; They<BR>don't retain enough
formatting to even consider using them.<BR><BR>Opening the *.pdf file with a
full copy of Acrobat is also not a<BR>solution - it has the same problems with
text boxes as noted above.<BR><BR>Thanks for any help you can
provide!<BR>.<BR><BR></BODY></HTML>

--=_ABF78864.4B2A44A6--

Kukla Fran and Ollie

unread,
Jun 21, 2002, 10:57:29 PM6/21/02
to
Don't forget 508 accessibility issues.

There is some disagreement whether of not PDFs meet accessibility standards
under the ADA. Version 5 of Adobe Acrobat is said to create 508-compliant
PDFs, however, as a side issue this apparently means you cannot password
protect it.

While Section 508 accessibility standards only apply to US federal
agencies, it is only a matter of time before Uncle Same starts requiring
state and local governments to adhere to the standard if they want to do
business with Uncle Same and/or received federal tax dollars. I can also
see this expanding to schools (colleges/universities, as well as local) if
they receive federal funding, too.

Dennis

wilson...@gmail.com

unread,
Dec 1, 2014, 4:50:04 AM12/1/14
to
If you want to convert one pdf file to word documetn, then there are two easy method
one method is by doing it through Adobe Acrobat and other method is by using a Free Online Conversion Service
0 new messages