How to import Thesauri from csv?

32 views
Skip to first unread message

Leonidas

unread,
Apr 1, 2020, 1:11:56 PM4/1/20
to Arches Project
How can batch import Thesauri from csv file (or any other file format)?

Alina Myklebust

unread,
Apr 3, 2020, 9:46:51 PM4/3/20
to Arches Project
Hi Leonidas,

Sorry your question has gone unanswered. Hopefully, someone in the community can provide some input. 

If I find helpful information, I will follow up here.

All the best,

Alina

Adam Lodge

unread,
Apr 6, 2020, 11:27:59 AM4/6/20
to Leonidas, Arches Project
Hi Leonidas,

To my knowledge the RDM only imports data from a specific flavor of xml called SKOS.  Workflow documented here: https://arches.readthedocs.io/en/stable/rdm/#importing-a-scheme

I concur that ability to import to RDM from a simpler format like csv would be a welcome addition to Arches.

Adam

---
Adam Lodge
Geospatial Systems Consultant
Farallon Geographics

On Apr 1, 2020, at 10:11 AM, Leonidas <leo.d....@gmail.com> wrote:

How can batch import Thesauri from csv file (or any other file format)?

--
-- To post, send email to arches...@googlegroups.com. To unsubscribe, send email to archesprojec...@googlegroups.com. For more information, visit https://groups.google.com/d/forum/archesproject?hl=en
---
You received this message because you are subscribed to the Google Groups "Arches Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to archesprojec...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/archesproject/ca08a11d-b1e7-437e-8a25-2de3b0ae6ec5%40googlegroups.com.

Adam Cox

unread,
Apr 6, 2020, 4:40:08 PM4/6/20
to Adam Lodge, Leonidas, Arches Project
Hi Leonidas, I've been meaning to respond to this thread for a few days, always thinking I'd have a little more time to prepare... At any rate, no, the RDM in v4/5 has never imported CSV files (it more or less did in v3). However, a long time ago I made a conversion utility for a project I was on. You can find it here, somewhat buried within a v4/5 Arches package: https://github.com/legiongis/fpan-data/tree/master/utils. The top half of the README is relevant to you, and the scripts and sample data are inside of the "reference_data" directory. Feel free to copy those files out to your own repo if you want. Also note that I am in the process of making some changes related to the business_data side of the utils directory, so some updates may change the way that README looks in the future.

If you can get the scripts working, they take a set of CSV files and convert them into a single thesaurus, and a set of collections that mimic the structure each CSV. So, if you have a CSV called Styles, a new Top Concept called "Styles" will be created and then the contents of the CSV will be added as individual member concepts within that top concept. Then a new Collection is made by the same name and the concepts are added to it as well. Note that this is a naive structure, and doesn't support nested concepts, which I know many thesauri rely on (mine for that project didn't so I was ok with that limitation).

Hope that is somewhat helpful,
Adam

Reply all
Reply to author
Forward
0 new messages