RDA OAI Harvest from Redbox

48 views
Skip to first unread message

Jerome Apresto

unread,
Oct 14, 2013, 8:21:36 PM10/14/13
to redbo...@googlegroups.com
Hi Guys,

Need help regarding OAI harvest from Redbox to RDA.

Questions:
1.  How do you guys configure the harvester from RDA?
2.  Are you using Redbox and Mint as a data source? (e.g. http://118.138.241.234/redbox/published/feed/oai and http://118.138.241.234/mint/published/feed/oai)




Regards,
Jerome Apresto
Charles Darwin University


Duncan Dickinson

unread,
Oct 14, 2013, 8:47:23 PM10/14/13
to ReDBox Developer List
Hi Jerome,

From the ReDBox/Mint side - depending on which RIF objects you're pushing out you'll need to configure for:
  • Mint to provide Party (people/groups), Activity and Service records
    • If you're using the NLA Party IDs you don't need to feed Party (People) records to RDA
    • If you're only using ARC and NHMRC Activities these don't go to RDA as ANDS already have them in the system
  • ReDBox will provide the Collection records


Hope that helps!

Cheers,

Duncan


--
You received this message because you are subscribed to the Google Groups "ReDBox Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to redbox-dev+...@googlegroups.com.
To post to this group, send an email to redbo...@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/redbox-dev/a25caa7c-ab12-4b96-8a70-b5c9582dceb3%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.



--
Cheers,


Duncan


Duncan Dickinson
QCIF Project Manager 
Central Queensland University

Contact me:
monday to thursday
ph: 07 3138 2084
m: 0432 402 511
skype: de.dickinson

website | calendar | LinkedIn

Marianne Brown

unread,
Oct 14, 2013, 8:53:18 PM10/14/13
to redbo...@googlegroups.com
Hi Jerome,

I have two RDA data sources - one for ReDBox and one for Mint.  The URI you have put for redbox is the same. For Mint, we created a new view called RDA_Harvest so my URI for that feed is 
http://my.domain/mint/RDA_Harvest/feed/oai - which makes available the service, activity and party-group records to RDA but not the party records as they are going to NLA. To set the view up for harvesting required following these instructions (https://sites.google.com/site/fascinatorhome/home/documentation/technical/details/branding-appearance/views) and then adding it to the rif section in the oai-pmh section in the system-config.json file.

--
Marianne Brown
marianne...@gmail.com


Grant Jackson

unread,
Oct 14, 2013, 9:04:00 PM10/14/13
to redbo...@googlegroups.com
Hi Jerome,

As mentioned by Marianne, we also have one data source for ReDBox & one for Mint. Eg. for one of these:

 URI: http://MYSERVER/redbox/published/feed/oai

 Harvest Method:  Harvested (OAI-PMH)

 OAI Set: [blank]

 Advanced Harvest Mode: I'm currently using Incremental, but I'm pretty sure I've used Standard & Full in the past.

 Harvest Date: [Choose something which doesn't clash with server or network maintenance windows]

 Harvest Frequency: [Weekly or Daily if incremental]

Also, you may need to ask your network or system admins to poke a hole in your filewall and/or web server access controls to allow the RDA harvester (Demo and/or Production systems) to access your ReDBox/Mint harvest URLs.

Hope that helps.

Cheers, Grant


Jerome Apresto

unread,
Oct 14, 2013, 9:15:53 PM10/14/13
to redbo...@googlegroups.com
Thanks for the immediate response!

I think my harvest on RDA from Redbox and Mint are successful, however, I'm missing a related object on all the parties harvested from MINT.
We've created our own parties (People/Group) in Mint not on NLA. 
How do I know when my parties are push to NLA? Is that an automatic pushed to NLA whenever I do the csv file import from MINT?

Regards,
Jerome Apresto

Duncan Dickinson

unread,
Oct 14, 2013, 9:36:37 PM10/14/13
to ReDBox Developer List
Hi Jerome,

Just to check, you're pushing party records to the NLA? You'd have set this up in your config. Alternatively you could just push party records to RDA. If you do the latter using local identifiers or handles then this should be quite fast



Cheers,

Duncan


--
You received this message because you are subscribed to the Google Groups "ReDBox Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to redbox-dev+...@googlegroups.com.
To post to this group, send an email to redbo...@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.



--

Jerome Apresto

unread,
Oct 15, 2013, 12:01:47 AM10/15/13
to redbo...@googlegroups.com
Hi Duncan,

I'm not pushing to NLA at the moment, and havent set it up on the config.
I'd rather push to RDA but how do I push from MINT to RDA using local identifier?
Where can I set local identifier of a party? Is that on the csv file?


Regards,
Jerome



On Tuesday, October 15, 2013 9:51:36 AM UTC+9:30, Jerome Apresto wrote:

Jerome Apresto

unread,
Oct 15, 2013, 7:43:41 PM10/15/13
to redbo...@googlegroups.com
Hi Guys, 

Sorry for bugging you around.
This is the situation after harvesting from RDA.
There are 2 data source (CDU-Redbox and CDU-Mint) in RDA.

When I look at the collection record that has been harvested. I've got no problem in "Related Object" part. (Please see attached file for the image "collection_rel_obj")
But, when I look at the party record, the title of the collection do not display. (Please see attached file for the image name "party_rel_obj").

When we talk to ANDS team, they said our identifier is not resolving to ANDS data.

Thanks for the help in advance!

Regards,
Jerome Apresto







On Tuesday, October 15, 2013 9:51:36 AM UTC+9:30, Jerome Apresto wrote:
collection_rel_obj.JPG
party_rel_obj.JPG

Duncan Dickinson

unread,
Oct 15, 2013, 8:33:26 PM10/15/13
to ReDBox Developer List
Hi Jerome,

The harvest config sets up the identifier: 
It'd be useful for me if you can post the RIF record for the collection and the party records - preferably the ones produced in ReDBox and Mint for the published collection.

Cheers,

Duncan


--
You received this message because you are subscribed to the Google Groups "ReDBox Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to redbox-dev+...@googlegroups.com.
To post to this group, send an email to redbo...@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

Jerome Apresto

unread,
Oct 15, 2013, 8:37:08 PM10/15/13
to redbo...@googlegroups.com
RIF for Party

<registryObjects xsi:schemaLocation="http://ands.org.au/standards/rif-cs/registryObjects http://services.ands.org.au/documentation/rifcs/1.3/schema/registryObjects.xsd" >
<registryObject group="Charles Darwin University, Australia" >
<originatingSource>http://118.138.241.234/mint/Parties_People</originatingSource>
<party type="person" >
<identifier type="uri" >http://www.cdu.edu.au/research/profiles/SusanBandias.htm</identifier>
<name type="primary" >
<namePart type="given" >Susan </namePart>
<namePart type="family" >Bandias</namePart>
<namePart type="title" >Dr</namePart>
</name>
<location>
<address>
<electronic type="url" ></electronic>
</address>
<address>
<electronic type="email" >
<value>susan....@cdu.edu.au</value>
</electronic>
</address>
</location>
<description type="full" >Dr Susan Bandias is a Senior Lecturer in the School of Business at Charles Darwin University. Her research interests are telecommunications, social media, women in the information communication technology industry, business education pedagogy and the economics of work.</description>
<relatedObject>
<relation type="isCollectorOf" ></relation>
</relatedObject>
<relatedObject>
<relation type="hasAssociationWith" ></relation>
</relatedObject>
</party>
</registryObject>
</registryObjects>


RIF for Collection

<rif:registryObjects xsi:schemaLocation="http://ands.org.au/standards/rif-cs/registryObjects http://services.ands.org.au/documentation/rifcs/1.3/schema/registryObjects.xsd" >
<rif:registryObject group="Charles Darwin University, Australia" >
<rif:key>CDU-Collection-0010</rif:key>
<rif:originatingSource>http://118.138.241.234/redbox/default</rif:originatingSource>
<rif:collection type="datasetdateAccessioned="2013-09-26T00:00:00" >
<rif:identifier type="local" >CDU-Collection-0010</rif:identifier>
<rif:name type="primaryxml:lang="en" >
<rif:namePart>Australian Computer Society Women's Board Surveys 2008/2010/2012 Datatset (raw data surveys from 2003 women employed in the Australian information communication technology industry)</rif:namePart>
</rif:name>
<rif:location>
<rif:address>
<rif:electronic type="url" ></rif:electronic>
</rif:address>
</rif:location>
<rif:coverage>
<rif:temporal>
<rif:date type="dateFromdateFormat="W3CDTF" >2008-01-01T00:00:00Z</rif:date>
<rif:date type="dateTodateFormat="W3CDTF" >2013-01-01T00:00:00Z</rif:date>
</rif:temporal>
</rif:coverage>
<rif:coverage>
<rif:spatial type="iso31661xml:lang="en" >AU</rif:spatial>
</rif:coverage>
<rif:relatedObject>
<rif:relation type="hasCollector" ></rif:relation>
</rif:relatedObject>
<rif:relatedObject>
<rif:relation type="isManagedBy" >
<rif:description>Primary Contact</rif:description>
</rif:relation>
</rif:relatedObject>
<rif:relatedObject>
<rif:relation type="hasAssociationWith" >
<rif:description>Supervisor</rif:description>
</rif:relation>
</rif:relatedObject>
<rif:subject type="localxml:lang="en" >women</rif:subject>
<rif:subject type="localxml:lang="en" >information communication technology</rif:subject>
<rif:subject type="localxml:lang="en" >employment</rif:subject>
<rif:subject type="localxml:lang="en" >Australian Computer Society</rif:subject>
<rif:subject type="localxml:lang="en" >information technology</rif:subject>
<rif:subject type="localxml:lang="en" >career lifecycle</rif:subject>
<rif:subject type="anzsrc-forxml:lang="en" >089999</rif:subject>
<rif:subject type="anzsrc-forxml:lang="en" >150311</rif:subject>
<rif:description type="fullxml:lang="en" >An online survey of Australian Computer Society 'Women Members' undertaken in years 2008, 2010 and 2012. The cohort was women employed (past or present) in the Australian Information Communication Technology (ICT) industry. The surveys comprised questions covering demographic information, employment, influences and challenges, soft skills, career impacts and ACS membership. The dataset includes raw data for 678 participants in the 2008 survey, 787 participants in and 538 participants in the 2012 survey, giving a total of 2003 returned surveys. The collated raw data for each of the years is collated into separate Access databases, thus three databases.</rif:description>
<rif:rights>
<rif:rightsStatement>Intellectual Property held by the Australian Computer Society.</rif:rightsStatement>
<rif:accessRights>Mediated access. Please contact Susan Bandias about the dataset. Susan....@cdu.edu.au</rif:accessRights>
</rif:rights>
<rif:relatedInfo type="publication" >
<rif:identifier type="uri" >http://epubs.scu.edu.au/jesp/vol14/iss1/2/</rif:identifier>
<rif:title>Warne. L., Bandias. S. and Fuller.D. (2011) The Employment Experiences of Women in the Australian Information Communication Technology Industry, Journal of Economic and Social Policy: Vol. 14: Iss. 1, Article 2</rif:title>
</rif:relatedInfo>
<rif:relatedInfo type="publication" >
<rif:identifier type="uri" >1324-5945</rif:identifier>
<rif:title>Bandias.S. 2010 An overview of the 2010 ACS-W Survey. Information Age: March – April 2010</rif:title>
</rif:relatedInfo>
<rif:relatedInfo type="publication" >
<rif:identifier type="uri" >1324-5945</rif:identifier>
<rif:title>Bandias.S. and Warne. L.(2009) What Makes Women Work? Information Age: August-Sept 2009</rif:title>
</rif:relatedInfo>
<rif:relatedInfo type="publication" >
<rif:title>The Career Stage Effect on Womenin ICT: An Overview of the ACS-W Survey. Paper presented at the Working Women’s Conference Darwin NT</rif:title>
</rif:relatedInfo>
<rif:relatedInfo type="publication" >
<rif:identifier type="uri" >http://aisel.aisnet.org/acis2009/103/</rif:identifier>
<rif:title>Bandias.S. and Warne.L. Women in ICT – Retain and Sustain: An Overview of the ACS-WSurvey. 20th Australasian Conference on Information Systems. ACIS 2009 Conference Clayton Melbourne 2 – 4 December 2009</rif:title>
</rif:relatedInfo>
<rif:relatedInfo type="website" >
<rif:identifier type="uri" >http://www.acs.org.au/communities/acs-women/initiatives/surveys</rif:identifier>
<rif:title>Australian Computer Society Surveys webpage</rif:title>
</rif:relatedInfo>
</rif:collection>
</rif:registryObject>
</rif:registryObjects>


Thanks Duncan!

Regards,
Jerome Apresto

Duncan Dickinson

unread,
Oct 15, 2013, 9:28:36 PM10/15/13
to ReDBox Developer List
Hi Jerome,

Looking at the collection link:
<rif:relatedObject>
<rif:key>http://118.138.241.234/mint/published/detail/2af55b3b4af8b6608588bb3654192a24</rif:key>
<rif:relation type="hasCollector" ></rif:relation>
</rif:relatedObject>

And then back from the party:
<relatedObject>
<key>http://118.138.241.234/redbox/published/detail/664ed5847c79053ed4f164a5d475a038</key>
<relation type="isCollectorOf" ></relation>
</relatedObject>

The relationships look fine to me.

However, it's the Collection Key (CDU-Collection-0010) that's not quite right. My thought is that if you're using the local curation plugin with increments (http://www.redboxresearchdata.com.au/documentation/system-administration/integration/local-curation) to create this then the Party record should be updated to use the local ID at curation time. If all of the related records are curating correctly, the keys should be sorted out for you. In Mint you can login as admin and check each record to make sure that curation is happening and the IDs that are being created. 

Remember that it's after curation that all the ID/keys line up.

  • Log in as admin/admin
  • You'll see the curation data and status
  • It will show a link to a collection in ReDBox - you can go to that and login to demo RB with admin/admin
In the Mint record you'll see a metadata.json metadata stream and in ReDBox you'll see a <id>.tfpackage metadata stream.

The Mint record has the following segment:
{
"identifier": "http://demo.redboxresearchdata.com.au/redbox/published/detail/03abc59403657a3978e58d8b27bd486e",
"curatedPid": "http://demo.redboxresearchdata.com.au/redbox/published/detail/03abc59403657a3978e58d8b27bd486e",
"broker": "tcp://localhost:9101",
"isCurated": true,
"relationship": "hasAssociationWith",
"uniqueString": "{\"identifier\":\"http:\\/\\/demo.redboxresearchdata.com.au\\/redbox\\/published\\/detail\\/03abc59403657a3978e58d8b27bd486e\",\"curatedPid\":\"http:\\/\\/demo.redboxresearchdata.com.au\\/redbox\\/published\\/detail\\/03abc59403657a3978e58d8b27bd486e\",\"broker\":\"tcp:\\/\\/localhost:9101\",\"isCurated\":true,\"relationship\":\"hasAssociationWith\"}"
}

and the ReDBox record has the following segment:
{
            "field": "locrel:prc.foaf:Person.dc:identifier",
            "authority": true,
            "identifier": "redbox-mint.googlecode.com/parties/people/1242",
            "relationship": "hasAssociationWith",
            "reverseRelationship": "hasAssociationWith",
            "description": "Primary Contact",
            "broker": "tcp://localhost:9201",
            "isCurated": true,
            "curatedPid": "http://demo.redboxresearchdata.com.au/mint/published/detail/e6656a2c87d086b015db7e4d9e60c65e"
        }


You can see from this that the relationship has been curated and does link via the correct IDs. The correct links also display in the RIF feeds under the Published view:

At this point I don't have a definitive answer but feel it's a config issue. I've got a stack of work to get through today so will now need to leave it to your investigations.


Cheers,

Duncan



For more options, visit https://groups.google.com/groups/opt_out.

Grant Jackson

unread,
Oct 16, 2013, 12:07:27 AM10/16/13
to redbo...@googlegroups.com
Hi Jerome,

Here's a wild guess why the collection key might not be the expected one (but untested). I wonder if the problem might be that the following was *not* done during the creation/editing of the dataset record?

In the Management tab:
 - set Type of identifier to "Local identifier" and
 - for Identifier, check the checkbox "Use this record's ID"
 
That's my 2 cents worth.

Cheers, Grant


Jerome Apresto

unread,
Oct 16, 2013, 1:42:25 AM10/16/13
to redbo...@googlegroups.com
Thanks Grant! it works!

Regards,
Jerome
Reply all
Reply to author
Forward
0 new messages