Hey,
Currently OpenRefine thanks to the record mode support complexe XML file import. Each child of the main element is stored in a new row and the record mode help to manipulate the data. The templating export is great to convert basic csv to XML.
However it will be even better if it support complexe XML export based on records. In this scenario the templating export will iterate for every non blank row within a record and create a child element.
This process will be the reverse of the current import function and will bridge the gap to have a complete tool for XML import, transformation / cleaning and export.
I wanted to open the discussion here and see if it worth creating an issue / request.
Thanks
Martin
I hope I'm understanding this correctly (I'm not a programmer AT ALL), but are you saying that if data in OpenRefine takes advantage of the Record mode (is that the same as Column Groups?), that it cannot be converted to XML (without what for me would be impossible scripting)? For example, I opened an Excel spreadsheet in OpenRefine to clean up some data. The spreadsheet describes some old radio programs, and each row represents an episode of the program series. The thing is, in a given episode, there may be 2-3 actors, and there may be alternate titles for some episodes, so to preserve that hierarchical structure (2 actors under one radio program), I used the Record feature in OpenRefine. I need to convert this, ultimately, to XML. I was going to use Talend Open Source for that, but I don't know how to represent the OpenRefine concept of Records in Talend. Talend looks at a"spreadsheet" as flat, but in the OpenRefine Record structure, it's really hierarchical.Are you saying I can't export Records from OpenRefine (at present) in a way that preserves that hierarchical structure represented in OpenRefine Records? Can Excel handle this kind of hierarchical structure at all?I work in a library, and I can see that OpenRefine, Talend, and similar tools could be so useful to us (in converting one form of data to another), but I'm just starting to learn all this.Many thanks.Marsha
--
You received this message because you are subscribed to a topic in the Google Groups "Open Refine" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/openrefine/l22PL_IQyTY/unsubscribe?hl=en-US.
To unsubscribe from this group and all its topics, send an email to openrefine+...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Are you saying I can't export Records from OpenRefine (at present) in a way that preserves that hierarchical structure represented in OpenRefine Records? Can Excel handle this kind of hierarchical structure at all?
(doesn't nearly all XML have hierarchy and repeating elements?). JSON is hierarchical; is this also difficult to map to flat structures?
--
You received this message because you are subscribed to the Google Groups "Open Refine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openrefine+...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to a topic in the Google Groups "Open Refine" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/openrefine/l22PL_IQyTY/unsubscribe?hl=en-US.
To unsubscribe from this group and all its topics, send an email to openrefine+...@googlegroups.com.
Does that include nested, repeating elements? It looks like XML can be imported into Excel, edited in "XML tables," and exported as XML. But what about opening an Excel spreadsheet in Excel and exporting it as hierarchical XML that can include repeating elements? Of course, we'd need to set up the Excel spreadsheet in a way that shows which cells in which rows should be exported into the same resulting XML element (as in the OpenRefine "record" model)
--