"Store file source" for multi-sheet Excel document

27 views
Skip to first unread message

Markus Wust

unread,
Jul 6, 2022, 10:47:15 AM7/6/22
to OpenRefine
Dear all,

A colleague asked me to help with a data clean-up project, Using OpenRefine 3.5.2, I'm trying to import a Excel (.xlsx) spreadsheet with six worksheets. When selecting the worksheets that should be imported, I can see their names listed, preceeded by the filename and a hash.

However, when I select "Store file source" as a parsing option, the preview (and the final document) only show the filename and not the name of the worksheet. Some (slightly) older online tutorials suggested that the worksheet name would also be included. Has this feature been deprecated or is there anything else I should to in order to get the worksheet names to be displayed together with the filename?

Thank you
Markus Wust

Owen Stephens

unread,
Jul 7, 2022, 6:23:46 AM7/7/22
to OpenRefine
It looks to me like this changed with OpenRefine 3.5.0. Testing with OpenRefine 3.4.1 the sheet names import as expected. OpenRefine 3.5.0 they don't.

There was a large amount of work to fix some underlying issues with the importers which was done around this time. It's possible that this is a problem arising from that work - I'd be relatively confident that this was not a deliberate change and it's a bug rather than a deliberate feature deprecation - although I may be wrong.

I've created an issue on GitHub to see if this can be investigated/fixed https://github.com/OpenRefine/OpenRefine/issues/5034
Reply all
Reply to author
Forward
0 new messages