problem of type of document while harvesting metadata through OAI-PMH

81 views
Skip to first unread message

Abid Fakhre Alam

unread,
Sep 28, 2020, 5:16:40 AM9/28/20
to AtoM Users
Dear All,

While working on OAI-PMH on atom to harvest the metadata to our discovery tool. I found one serious problem about the type of textual document on atom it show image but originally it is textual document (pdf).
when I harvest the metadata to our discovery tool it takes type of document as image.
when I saw the record structure in xml format on atom it show image type below is the reference, I choose ISAD standard.
when I access the below url, there the General material designation : Textual record and  Level of description : File
when I export that record in xml Dublin core, here the type is showing image.

 
<dc:date>Apr. 10, 1972</dc:date>
<dc:type>image</dc:type>
<dc:format>image/jpeg</dc:format>
<dc:format>1 folder of textual records (53 pages)</dc:format>
<dc:identifier>/council-meeting-minutes-apr-10-1972</dc:identifier>

Dan Gillean

unread,
Sep 30, 2020, 6:12:32 PM9/30/20
to ICA-AtoM Users
Hello Abid, 

I have checked a few of the PDF / text documents in our public demo site, and they seem to show the format correctly in the DC XML. For example: 
  1. https://demo.accesstomemory.org/certificates-and-correspondence
  2. https://demo.accesstomemory.org/chelmsford-growth-and-development-2
I noticed that in the digital object metadata area on these examples, the media type is listed as "Text". On the CVA site you sent as an example, it seems that the Visible elements module settings are not showing this information to public users, so I can't compare. However, the media type is an editable property from a controlled list. 

Can you try something for me, in light of this? 
  1. Log in and navigate to a description with a PDF where this is an issue
  2. Using the "More" button in the button block at the bottom of the page, select "Edit digital object"
  3. Check the value in the "Media type" drop-down below the Master digital object
  4. If it is not "Text," try changing it and saving your changes
  5. Check to see if the DC XML now properly lists the format information
I would also suggest creating a new test draft description with a new PDF upload, to see what happens by default. 

Please let me know what you find, and if this resolves the issue. If not, I'll do some further testing to determine if we need to file a bug ticket. 

Cheers, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory
he / him


--
You received this message because you are subscribed to the Google Groups "AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-user...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ica-atom-users/312bc8e7-1372-4fac-bc7a-009e10508514n%40googlegroups.com.

Abid Fakhre Alam

unread,
Oct 1, 2020, 6:07:09 AM10/1/20
to ica-ato...@googlegroups.com
Dear Dan,

As you suggest to check the media type is text or not. I have checked there I mention text but in XML format it is showing image.

Here I also attached screenshot for your reference.



You received this message because you are subscribed to a topic in the Google Groups "AtoM Users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/ica-atom-users/xtuSPjAurs8/unsubscribe.
To unsubscribe from this group and all its topics, send an email to ica-atom-user...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ica-atom-users/CAC1FhZ%2BV_hJMuRqNABWLjOseu4LLuJrY5BdWdZGP-pVtRdVUCg%40mail.gmail.com.


--
Best regards,
Abid Fakhre Alam
chrome_0amXf2CVxc.png

Abid Fakhre Alam

unread,
Oct 1, 2020, 6:11:16 AM10/1/20
to AtoM Users
Here I attached the xml record
chrome_svaiH7BVuL.png

Dan Gillean

unread,
Oct 1, 2020, 8:50:17 AM10/1/20
to ICA-AtoM Users
Ok, thank you for providing this, Abid. 

I will do some further testing and see if I can reproduce the issue you describe. Can you please tell what version of AtoM you are using?

Cheers, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory
he / him

Abid Fakhre Alam

unread,
Oct 1, 2020, 10:55:55 AM10/1/20
to ica-ato...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages