extract metadata and doi import

323 views
Skip to first unread message

sunit...@gmail.com

unread,
Feb 20, 2022, 12:25:33 AM2/20/22
to DSpace Technical Support
1. Dspace 7.2 documentation mentions about extracting metadata from uploaded pdf files I tried enabling following text in item-submission.xml and restarted tomcat and pm2 but nothing is visible on screen. Does this function work in 7.2

 <step-definition id="extractionstep">
            <heading>submit.progressbar.ExtractMetadataStep</heading>
            <processing-class>org.dspace.app.rest.submit.step.ExtractMetadataStep</processing-class>
            <type>extract</type>
        </step-definition>

2.  Will there be any plan to use doi in item submission and metadata will be populated automatically.

reg

Sunita Barve

Tim Donohue

unread,
Feb 23, 2022, 6:05:07 PM2/23/22
to DSpace Technical Support
To answer your questions.

1. The Extract Metadata step doesn't have a section associated with it.  It also will not work for PDFs at this time.  It primarily only support Bibliographic formats like documented at https://wiki.lyrasis.org/pages/viewpage.action?pageId=104566672 .  Uploading these same bibliographic files from the MyDSpace page will also trigger the metadata to be extracted in a brand new submission form.

2. DOI population already works if you perform a "lookup" from the MyDSpace page to start a new submission.  On the MyDSpace page, at the top, next to the Drag & Drop section, there's an import button.  If you click that, you can select to import from PubMed and other sources...and you can paste in a DOI there to find an article quickly, then click Import to start a new submission with its metadata.

We don't yet fully support lookup of a DOI after you've already started the submission.  This is a more complex scenario as it could require having to determine the difference between any metadata you've already entered and any metadata returned by the DOI.  It *is* a task on our Roadmap (and ranked #3 under "Highest Priority"): https://wiki.lyrasis.org/display/DSPACE/DSpace+Release+7.0+Status#DSpaceRelease7.0Status-Tier1:HighestPriorityTasksfor7.x

Unfortunately, we've had to temporarily delay it at this time because of the complexity of this metadata comparison. I do expect it still will happen, but I don't have a timeline as we need to investigate it further.

Tim

sunit...@gmail.com

unread,
Feb 26, 2022, 7:02:37 AM2/26/22
to DSpace Technical Support
Thank you Tim for updating. How can I enable other sources alongwith PubMed and ArXiv so that I an use that feature and import using doi.

reg

Sunita

Tim Donohue

unread,
Feb 28, 2022, 11:06:41 AM2/28/22
to DSpace Technical Support
DSpace 7 only supports limited import sources at this time. It's not possible to enable others, unless you create code to do the import yourself.  For more information, see https://wiki.lyrasis.org/display/DSDOC7x/Live+Import+from+external+sources

It is possible that DSpace 7.3 (due in June) might include some additional sources "out-of-the-box", as we are working on this ticket to migrate ~11 additional import sources into DSpace: https://github.com/DSpace/DSpace/issues/3359

Tim

Reply all
Reply to author
Forward
0 new messages