Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Converting HTML pages to Word documents

0 views
Skip to first unread message

Rohit Gupta

unread,
Feb 10, 2004, 5:42:43 PM2/10/04
to
I have a requirement where a large no. of HTML pages need to be
converted into Word Document of a predefined format. In addition to
merely conversion I need that Word docs generated should include some
simple substitution of HTML tags for Word styles (like headers, titles
etc).

I have extensively searched the web for any such existing tools but
any of them didn't fulfilled my requirement. Do we have any such tool?

Thanks in advance!
Rohit Gupta


--
====== Please DELETE This Line and Everything Below It When Replying! ====
THIS NEWSGROUP is only for questions about newsgroups and the Internet.
IF YOU HAVE questions on other topics, search for appropriate newsgroups
using http://members.fortunecity.com/nnqweb/ngroups.html
LEARN about newsgroups at the news.newusers.questions Web site:
http://members.fortunecity.com/nnqweb/
======= The moderators append this notice to each posted article; ========
======= It does not imply that the article is on topic or correct ========

Boomer

unread,
Feb 10, 2004, 8:25:38 PM2/10/04
to
contac...@yahoo.com (Rohit Gupta) wrote in
news:nnq.d40fc92a.040...@posting.google.com:

> I have a requirement where a large no. of HTML pages need to be
> converted into Word Document of a predefined format. In addition
> to merely conversion I need that Word docs generated should
> include some simple substitution of HTML tags for Word styles
> (like headers, titles etc).
>
> I have extensively searched the web for any such existing tools
> but any of them didn't fulfilled my requirement. Do we have any
> such tool?
>
> Thanks in advance!
> Rohit Gupta

Hi

This group is about 'news' (usenet).

You might want to ask at alt.html or at a Microsoft Word news group.
I'm sure someone can answer your question in the correct group.

Good luck. :)

Rohit Gupta

unread,
Feb 11, 2004, 7:28:18 PM2/11/04
to
Boomer <Boomer...@mailinator.com> wrote in message news:<nnq.40298492$0$196$892e...@authen.yellow.readfreenews.net>...

> contac...@yahoo.com (Rohit Gupta) wrote in
> news:nnq.d40fc92a.040...@posting.google.com:
>
> > I have a requirement where a large no. of HTML pages need to be
> > converted into Word Document of a predefined format. In addition
> > to merely conversion I need that Word docs generated should
> > include some simple substitution of HTML tags for Word styles
> > (like headers, titles etc).
> >
> > I have extensively searched the web for any such existing tools
> > but any of them didn't fulfilled my requirement. Do we have any
> > such tool?
> >
> > Thanks in advance!
> > Rohit Gupta
>
> Hi
>
> This group is about 'news' (usenet).
>
> You might want to ask at alt.html or at a Microsoft Word news group.
> I'm sure someone can answer your question in the correct group.
>
> Good luck. :)
>
>
> --
Thanks for the Redirection!
Rohit

Kathy Morgan

unread,
Feb 12, 2004, 3:48:30 AM2/12/04
to
Rohit Gupta <contac...@yahoo.com> wrote:

> I have a requirement where a large no. of HTML pages need to be
> converted into Word Document of a predefined format. In addition to
> merely conversion I need that Word docs generated should include some
> simple substitution of HTML tags for Word styles (like headers, titles
> etc).

As Boomer said, this isn't really the right group for the question, but
something you might try (if you haven't already) is just to select the
material, copy (control-c) and paste into Word. That probably won't
convert all the tags, but it might get most of them.

--
Kathy - read reviews of other newsgroups in news:news.groups.reviews
Good Net Keeping Seal of Approval at <http://www.gnksa.org/>
OE-quotefix can fix OE:
<http://home.in.tum.de/~jain/software/oe-quotefix/>

Rohit Gupta

unread,
Feb 13, 2004, 4:02:19 PM2/13/04
to
kmo...@spamcop.net (Kathy Morgan) wrote in message news:<nnq.1g90n4n.i4uqa72u5tsN%kmo...@spamcop.net>...

> Rohit Gupta <contac...@yahoo.com> wrote:
>
> > I have a requirement where a large no. of HTML pages need to be
> > converted into Word Document of a predefined format. In addition to
> > merely conversion I need that Word docs generated should include some
> > simple substitution of HTML tags for Word styles (like headers, titles
> > etc).
>
> As Boomer said, this isn't really the right group for the question, but
> something you might try (if you haven't already) is just to select the
> material, copy (control-c) and paste into Word. That probably won't
> convert all the tags, but it might get most of them.
>
> --
> Kathy - read reviews of other newsgroups in news:news.groups.reviews
> Good Net Keeping Seal of Approval at <http://www.gnksa.org/>
> OE-quotefix can fix OE:
> <http://home.in.tum.de/~jain/software/oe-quotefix/>
>
>
> --
>
Thanks for the reply!

Yeah, It could work to some extent but I have a pages of around 60-70K
and doing this thing manually will cost thousands of bucks. I am
looking for some tool to automate this process to get this done
efficiently in less time and saving the money.

Let me know if you have any suggestions.

0 new messages