Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Reading Microsoft Word files

3 views
Skip to first unread message

David Erickson

unread,
Mar 15, 2005, 12:53:06 PM3/15/05
to
I need to read some (about 15,000) word documents and add some wrappers
around them for importing into a database. What is the best way to read a
.doc file in VB? I would like to save the formatting as .RTF so it can be
included.

I can do this in Word VBA, but I would prefer to do it in straight VB
(either VB6 or .Net).

Thanks for any help on this.

David


Veign

unread,
Mar 15, 2005, 1:04:52 PM3/15/05
to
Word Automation would work for you..

Basic connection to the Word object for automation:
http://www.veign.com/vrc_codeview.asp?type=app&id=62

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/


"David Erickson" <eric...@earthlink.net> wrote in message
news:6iFZd.4952$qf2...@newsread2.news.atl.earthlink.net...

David Erickson

unread,
Mar 15, 2005, 1:12:46 PM3/15/05
to
Veign,

Thank you. I did not think of Word automation. How is the performance for
this? If I am changing 1000 files each day, how much system overhead will
this cause?

David

"Veign" <m...@home.com> wrote in message
news:8tFZd.4501$qW....@newsread3.news.atl.earthlink.net...

Veign

unread,
Mar 15, 2005, 2:19:52 PM3/15/05
to
I guess it really depends on your system and the size of the Word Documents.
The easiest way is to create a simple test...

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/


"David Erickson" <eric...@earthlink.net> wrote in message

news:yAFZd.5012$qf2....@newsread2.news.atl.earthlink.net...

David Erickson

unread,
Mar 23, 2005, 2:13:39 AM3/23/05
to
Thanks for your help. I have tried this and it works okay, but very slowly.
I thought that was going to be okay, until I talked to the user again and
found out there are actually more than 100,000 documents to process.

What I need to do is:
1. Read the doc file
2. extract account number, name, and date
3. convert to RTF
4. Add an HL7 wrapper around the RTF file
5. save into the application's HL7 inbox.

Is there a package that will allow me to open a .doc file more quickly than
word automation?

Thanks again!


0 new messages