Following your recipe almost works but leaves strange results I don't
understand.
There are 1042 rows each containing a version of that HTML table in
GemVerzTable. They smoothly extract to GemVerzTH and GemVerzTD. (see
shot1, prepended "dummy" by intention)
Splitting also looks very well. (shot2)
Now I set any column left of a "dummy" column to "dummy" to not have
several headers later. (shot3)
Columnizing by key column now works fine for those "dummy" columns. The
rest of the values of row 1 fall down to row 2. (shot4, note that there
are only 997 rows left from initially 1042)
All remaining values are just lost. Looks like they are virtually gone
through the top to row -1 and following.
Any clue what's going on here?
shot1: <
https://transfer.sh/OJW9t/shot1.png>
shot2: <
https://transfer.sh/12MyQm/shot2.png>
shot3: <
https://transfer.sh/jjD7v/shot3.png>
shot4: <
https://transfer.sh/K2Ffx/shot4.png>
Ettore Rizza <
ettor...@gmail.com> writes:
> So, you need to parse the HTML table. The best way to do that is to first
> extract each rows with this Grel formula :
>
> forEach(value.parseHtml().select('td'), e, e.htmlText()).join('|')
>
> Then to extract the headers with this one :
>
> forEach(value.parseHtml().select('th'), e, e.htmlText()).join('|')
>
> You can then split the new cells using the symbol | as separator. If you
> want the header values as column names, you need to do finally a "columnize
> by key value".
>
> Here is a screencast, it will be more clear than my explanations.
>
> <
https://lh3.googleusercontent.com/-qzjuwRUTQ08/War5LWAMl1I/AAAAAAAAUYs/YkG8QK8_Zn4cqs4kg74PgkKbyWRtO92YwCLcBGAs/s1600/screencast.gif>
>
>
>
>
> Le samedi 2 septembre 2017 20:06:42 UTC+2, Sebastian Lipp a écrit :
>>
>> Hello Ettore,
>>
>> Ettore Rizza <
ettor...@gmail.com <javascript:>> writes:
>> > Where is stored this HTML table? In a file on your computer or in a
>> single
>> > Open Refine cell?
>>
>> For any row a version of this table is stored in a single cell. I want
>> it to "expand" to several new cells.
>>
>
> --
> You received this message because you are subscribed to the Google Groups "OpenRefine" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
openrefine+...@googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.