Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

html table to csv

31 views
Skip to first unread message

Brian Jordan

unread,
Nov 1, 2023, 11:44:29 AM11/1/23
to
I have a lot of html tables which I want to convert and then, via
Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
to eventually make a pdf booklet. The bit from fireworkz onwards is tried
and tested here and is fairly straightforward, the table to csv bit less
so. It looks like some fairly heavy duty searching and replacing will be
needed at first unless there is a program somewhere which might help me;
is there such a thing?
I am aware of some online stuff under Windows which might help but would
really like to do the whole job under RISC OS. Any thoughts appreciated.
Thanks
B

--
_____________________________________________________________________

Brian Jordan
brian....@btinternet.com
RISC OS 5.28 (16-Dec-20) on Raspberry Pi
_____________________________________________________________________

Paul Sprangers

unread,
Nov 1, 2023, 11:56:19 AM11/1/23
to
In article <5afc8e1380b...@btinternet.com>,
Brian Jordan <brian....@btinternet.com> wrote:

> It looks like some fairly heavy duty searching and replacing will be
> needed at first unless there is a program somewhere which might help me;
> is there such a thing?

You might give !ConvText a try. It's at
https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip

Paul

--
https://riscos.sprie.nl

Brian Jordan

unread,
Nov 1, 2023, 12:04:56 PM11/1/23
to
In article <5afc8f2...@sprie.nl>,
Paul Sprangers <Pa...@sprie.nl> wrote:
Thanks

[Snip]

> You might give !ConvText a try. It's at
> https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip

> Paul

Chris Newman

unread,
Nov 1, 2023, 12:32:23 PM11/1/23
to
In article <5afc8e1380b...@btinternet.com>, Brian Jordan
<brian....@btinternet.com> wrote:
> I have a lot of html tables which I want to convert and then, via
> Fireworkz, use to produce draw files for use in Ovation Pro and
> PrintPDF to eventually make a pdf booklet. The bit from fireworkz
> onwards is tried and tested here and is fairly straightforward, the
> table to csv bit less so. It looks like some fairly heavy duty
> searching and replacing will be needed at first unless there is a
> program somewhere which might help me; is there such a thing? I am
> aware of some online stuff under Windows which might help but would
> really like to do the whole job under RISC OS. Any thoughts
> appreciated. Thanks B

CSV Edit Bernard Veasey

CSVamp Ray Favre

--
Chris

Chris Newman

unread,
Nov 1, 2023, 12:42:12 PM11/1/23
to
In article <5afc8e1380b...@btinternet.com>, Brian Jordan
<brian....@btinternet.com> wrote:
> I have a lot of html tables which I want to convert and then, via
> Fireworkz, use to produce draw files for use in Ovation Pro and
> PrintPDF to eventually make a pdf booklet. The bit from fireworkz
> onwards is tried and tested here and is fairly straightforward, the
> table to csv bit less so. It looks like some fairly heavy duty
> searching and replacing will be needed at first unless there is a
> program somewhere which might help me; is there such a thing? I am
> aware of some online stuff under Windows which might help but would
> really like to do the whole job under RISC OS. Any thoughts
> appreciated. Thanks B

I have !UnHTML Mike Williams 1997. Very old but loaded in R5.19. Purpose
see below from its Help file. I can send toyou. So old I presume no
copyright problems.

Converts HTML to Plain Text, Impression, or Draw Textarea format.
Extracts bookmarks from links found on HTML pages.

Author: Mike Williams
mi...@econym.demon.co.uk

Usage: Choose the output format you require from the iconbar menu.
Drag a HTML file to the UnHTML icon on the iconbar
After a few seconds a save box will open
Drag the resulting text to a filer or application

Plain Text Format:
The HTML tags are ripped out, special characters are converted,
but no formatting is applied.

--
Chris

Brian Jordan

unread,
Nov 1, 2023, 12:49:40 PM11/1/23
to
Thanks

In article <5afc92...@npost.uk>,
Chris Newman <m...@npost.uk> wrote:

[Snip]

> CSV Edit Bernard Veasey

> CSVamp Ray Favre

Jean-Michel

unread,
Nov 1, 2023, 3:46:01 PM11/1/23
to
In message <5afc92b180b...@btinternet.com>
Brian Jordan <brian....@btinternet.com> wrote:

> Thanks

> In article <5afc92...@npost.uk>,
> Chris Newman <m...@npost.uk> wrote:

> [Snip]

>> CSV Edit Bernard Veasey

>> CSVamp Ray Favre

> B
Thanks for pointing out these programs, they are always useful.
Some time ago I worked on !Psifs and I used the SIBO to RISC OS converters
(They are on Thomas Millius' site). Very convenient.

I just took them out and was able to do the conversion you asked for from
a csv file extracted from !Fireworks.


I have sent you an example to test at your address.

--
Jean-Michel

Jean-Michel

unread,
Nov 1, 2023, 3:53:02 PM11/1/23
to
In message <eb30a4f...@jmc.bruck.orange.fr>
Sorry, I just reread your message and the conversion must be done the
other way!!! :-(

HTML to CSV not CSV to HTML....

--
Jean-Michel

Harriet Bazley

unread,
Nov 1, 2023, 5:51:13 PM11/1/23
to
On 1 Nov 2023 as I do recall,
Brian Jordan wrote:

> I have a lot of html tables which I want to convert and then, via
> Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
> to eventually make a pdf booklet. The bit from fireworkz onwards is tried
> and tested here and is fairly straightforward, the table to csv bit less
> so. It looks like some fairly heavy duty searching and replacing will be
> needed at first unless there is a program somewhere which might help me;
> is there such a thing?

Current versions of EasiWriter can load HTML - you won't get any
cascading style sheet formatting, but I've tested it on some pages with
tables that I originally hand-crafted before uploading them to the host
site, and I can load them back into EasiWriter and select the 'Table'
region to save as a selection. EW exports tables as TSV, not CSV, but
I think most things that understand the latter also understand the
former; in the case of Fireworkz, it is perfectly possible to import
tab-separated files into a document as tables provided that they are
*filetyped* as CSV (&DFE).

So it depends how your HTML tables were originally created and how clean
the coding of them is, I suspect. If they genuinely are just tables of
data and not messed up with all sorts of layout stuff then you can load
the pages into EasiWriter, save the tables out as tab-separated text
selections, and bulk-filetype those files as CSV in order to import them
into Fireworkz as tables and/or spreadsheet cells, depending on what you
want to do with them there.


--
Harriet Bazley == Loyaulte me lie ==

I mean to live forever - or die trying!

Brian Jordan

unread,
Nov 1, 2023, 8:15:45 PM11/1/23
to
Many thanks,

I have EasiWriter here although whether it's the current version will be
discovered in the morning.

In article <2fcfaefc5...@bazleyfamily.co.uk>,
Harriet Bazley <har...@bazleyfamily.co.uk> wrote:
> On 1 Nov 2023 as I do recall,
> Brian Jordan wrote:

[Snip my original query]

> Current versions of EasiWriter can load HTML - you won't get any
> cascading style sheet formatting, but I've tested it on some pages with
> tables that I originally hand-crafted before uploading them to the host
> site, and I can load them back into EasiWriter and select the 'Table'
> region to save as a selection. EW exports tables as TSV, not CSV, but
> I think most things that understand the latter also understand the
> former; in the case of Fireworkz, it is perfectly possible to import
> tab-separated files into a document as tables provided that they are
> *filetyped* as CSV (&DFE).
That sounds promising

> So it depends how your HTML tables were originally created and how clean
> the coding of them is, I suspect. If they genuinely are just tables of
> data and not messed up with all sorts of layout stuff then you can load
> the pages into EasiWriter, save the tables out as tab-separated text
> selections, and bulk-filetype those files as CSV in order to import them
> into Fireworkz as tables and/or spreadsheet cells, depending on what you
> want to do with them there.
I created the majority of these tables in lovingly hand crafted html back
in the mid '90s and before publishing them they were run through HTML
Tidy until they were squeaky clean. I am quite hopeful.

Brian Jordan

unread,
Nov 2, 2023, 5:53:12 AM11/2/23
to
Even better!

In article <5afcbcd322b...@btinternet.com>,
Brian Jordan <brian....@btinternet.com> wrote:
> Many thanks,

> I have EasiWriter here although whether it's the current version will be
> discovered in the morning.
Not only do I have a version which works as Harriet describes it does
even better in that exporting as pdf is available in my version thus
cutting out a number of middle men.

[Snip]

Harriet Bazley

unread,
Nov 2, 2023, 6:53:46 AM11/2/23
to
On 2 Nov 2023 as I do recall,
Brian Jordan wrote:

> Even better!
>
> In article <5afcbcd322b...@btinternet.com>,
> Brian Jordan <brian....@btinternet.com> wrote:
> > Many thanks,
>
> > I have EasiWriter here although whether it's the current version will be
> > discovered in the morning.
> Not only do I have a version which works as Harriet describes it does
> even better in that exporting as pdf is available in my version thus
> cutting out a number of middle men.
>

Oh, I assumed you actually needed the data in Fireworkz for calculation
purposes rather than layout....

--
Harriet Bazley == Loyaulte me lie ==

Eschew Obfuscation.

Brian Jordan

unread,
Nov 2, 2023, 7:19:13 AM11/2/23
to
In article <ec1ff6fc5...@bazleyfamily.co.uk>,
Harriet Bazley <har...@bazleyfamily.co.uk> wrote:
> On 2 Nov 2023 as I do recall,
> Brian Jordan wrote:

[Snip]

> Oh, I assumed you actually needed the data in Fireworkz for calculation
> purposes rather than layout....

Fair assumption but to explain...
The files I am converting are old Championship tables for a motor racing
club. I can't remember how they were produced, presumably in a
spreadsheet, but all calculations were completed before creating the web
tables. I no longer have the original files but am able to grab the
tables from the, soon to be closed, site. The club has asked if I can
grab all of the tables from 1996 to the present and produce an inclusive
pdf document. I have all the recent (post 2010) files here as Fireworkz
files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
and the Fireworkz route for the old files suggested itself to me. The
Easiwriter solution makes it all so much easier, thank you.

Harriet Bazley

unread,
Nov 2, 2023, 3:15:31 PM11/2/23
to
On 2 Nov 2023 as I do recall,
Brian Jordan wrote:


> The files I am converting are old Championship tables for a motor racing
> club. I can't remember how they were produced, presumably in a
> spreadsheet, but all calculations were completed before creating the web
> tables. I no longer have the original files but am able to grab the
> tables from the, soon to be closed, site. The club has asked if I can
> grab all of the tables from 1996 to the present and produce an inclusive
> pdf document. I have all the recent (post 2010) files here as Fireworkz
> files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
> and the Fireworkz route for the old files suggested itself to me. The
> Easiwriter solution makes it all so much easier, thank you.

Excellent news!

--
Harriet Bazley == Loyaulte me lie ==

Those of you who think you know everything are annoying those of us who do.

Richard Torrens (News)

unread,
Nov 3, 2023, 7:05:15 AM11/3/23
to
In article <5afc8e1380b...@btinternet.com>,
Brian Jordan <brian....@btinternet.com> wrote:
> I have a lot of html tables which I want to convert and then, via
> Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
> to eventually make a pdf booklet. The bit from fireworkz onwards is tried
> and tested here and is fairly straightforward, the table to csv bit less
> so. It looks like some fairly heavy duty searching and replacing will be
> needed at first unless there is a program somewhere which might help me;
> is there such a thing?
> I am aware of some online stuff under Windows which might help but would
> really like to do the whole job under RISC OS. Any thoughts appreciated.
> Thanks
> B

If you have Iris - it can export as Text. It uses TAB chars between cells.

--
------------------------------------------------------------------
Richard Torrens. News email address is valid - for a limited time only.
You must use the full News+...@Torrens.org as in the From address.
http://www.Torrens.org for genealogy, natural history, wild food, walks, cats
and more!

Brian Jordan

unread,
Nov 3, 2023, 10:23:44 AM11/3/23
to
In article <5afd7a1c06news*@Torrens.org>,
Richard Torrens (News) <News+...@Torrens.org> wrote:
> In article <5afc8e1380b...@btinternet.com>,
> Brian Jordan <brian....@btinternet.com> wrote:

[Snip my original request]

> If you have Iris - it can export as Text. It uses TAB chars between
> cells.

I do and this knowledge has added a further string to my bow, many thanks.
In the last few days through the help of folks in these parts I have gone
to an "I wonder if..." to a cup overflowing situation. Thanks to all for
your help.

Harriet Bazley

unread,
Nov 4, 2023, 8:57:59 AM11/4/23
to
On 3 Nov 2023 as I do recall,
Richard Torrens (News) wrote:

> In article <5afc8e1380b...@btinternet.com>,
> Brian Jordan <brian....@btinternet.com> wrote:
> > I have a lot of html tables which I want to convert and then, via
> > Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
> > to eventually make a pdf booklet. The bit from fireworkz onwards is tried
> > and tested here and is fairly straightforward, the table to csv bit less
> > so. It looks like some fairly heavy duty searching and replacing will be
> > needed at first unless there is a program somewhere which might help me;
> > is there such a thing?
> > I am aware of some online stuff under Windows which might help but would
> > really like to do the whole job under RISC OS. Any thoughts appreciated.
> > Thanks
> > B
>
> If you have Iris - it can export as Text. It uses TAB chars between cells.
>
Even Netsurf does that...

--
Harriet Bazley == Loyaulte me lie ==

"An American is a man with two arms and four wheels".

Richard Torrens (News)

unread,
Nov 5, 2023, 10:06:55 AM11/5/23
to
In article <fc560afe5...@bazleyfamily.co.uk>,
Harriet Bazley <har...@bazleyfamily.co.uk> wrote:
>
> > If you have Iris - it can export as Text. It uses TAB chars between
> > cells.
> >
> Even Netsurf does that...

But it uses spaces - not TABs!

Harriet Bazley

unread,
Nov 5, 2023, 11:13:37 AM11/5/23
to
On 5 Nov 2023 as I do recall,
Richard Torrens (News) wrote:

> In article <fc560afe5...@bazleyfamily.co.uk>,
> Harriet Bazley <har...@bazleyfamily.co.uk> wrote:
> >
> > > If you have Iris - it can export as Text. It uses TAB chars between
> > > cells.
> > >
> > Even Netsurf does that...
>
> But it uses spaces - not TABs!
>
I'm definitely getting tabs, both from select-and-drag and from
Export->Text. Maybe it depends on the way the table was defined/laid
out in the first place? I've only been testing it on my own tables....

--
Harriet Bazley == Loyaulte me lie ==

Those who can't write, write manuals.

Vince M Hudd

unread,
Jan 3, 2024, 11:52:37 AM1/3/24
to
On 01/11/2023 15:44, Brian Jordan wrote:


> I have a lot of html tables which I want to convert and then, via
> Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
> to eventually make a pdf booklet. The bit from fireworkz onwards is tried
> and tested here and is fairly straightforward, the table to csv bit less
> so. It looks like some fairly heavy duty searching and replacing will be
> needed at first unless there is a program somewhere which might help me;
> is there such a thing?

I see I'm late to the party on this one (this is my 'annual' usenet
catchup!) so you already have solutions suggested, but I may as well add
that WebChange (with the aid of a suitable script) can do this.

(Although I no longer have the script to hand that I was using at the
time, it's one of the things I used to do as a demo of the software).

The only flaw was that it would only be able to handle the first table
it encountered.

--
Vince M Hudd

Vince M Hudd

unread,
Jan 6, 2024, 1:26:42 PM1/6/24
to
On 03/01/2024 16:52, Vince M Hudd wrote:

> I see I'm late to the party on this one (this is my 'annual' usenet
> catchup!) so you already have solutions suggested, but I may as well add
> that WebChange (with the aid of a suitable script) can do this.

> (Although I no longer have the script to hand that I was using at the
> time, it's one of the things I used to do as a demo of the software).

> The only flaw was that it would only be able to handle the first table
> it encountered.
I was reminded today that I'd let the webchange.co.uk domain go, and
hadn't updated the softrock.co.uk site accordingly - so WebChange has been
missing in action for a while. (h/t to Bernard Boase for pointing it out)

As a quick fix, I've created a new subdomain - webchange.softrock.co.uk
and mapped it to the server space that webchange.co.uk was previously
using, and I've updated the link on the WebChange page on softrock.co.uk
to point to it.

So as of now, WebChange can once again be downloaded.

https://www.softrock.co.uk/products/webchange.html
http://webchange.softrock.co.uk/

The main site itself hasn't actually been properly updated, though (i.e. I
haven't run WebChange on it) - so other links remain broken. That'll give
lots of "we found problems on your website" spammers even more reasons to
email me. ;)

--
Vince M Hudd
0 new messages