Getting rid of jats-Tags in CrossRef-Import

66 views
Skip to first unread message

Jarmo Schrader

unread,
Feb 28, 2024, 11:56:34 AM2/28/24
to dspac...@googlegroups.com
Hi Group,

abstracts imported from CrossRef often contain jats-tags (Journal Publishing Tag Set). DSpace does not seem to interpret these tags but instead shows them as normal text which is ugly an requires manual fixing.
Does anyone have an established solution to remove these tags during import?
Or is there a way to make DSpace interpret the tags for display so we could keep them?

Cheers
Jarmo

--
Dr. Jarmo Schrader
stellv. Bibliotheksleiter
Fachreferat und EDV
Universität Hildesheim
Universitätsbibliothek
Universitätsplatz 1
31141 Hildesheim

Tel: +49 (0) 5121 - 883 - 93004
jarmo.s...@uni-hildesheim.de

Sascha Szott

unread,
Feb 28, 2024, 12:32:38 PM2/28/24
to dspac...@googlegroups.com
Hi,

we are facing the same problem in the abstract field of the JSON
response returned by CrossRef.

I've created a Github issue in the DS CRIS project:
https://github.com/4Science/DSpace/issues/435

But now I realize that this function is not CRIS specific. Currently, we
are trying to provide a bug fix by removing the JATS markup.

I'll move the issue (and PR) to the DSpace Github project.

Best regards,
Sascha

Am 28.02.24 um 17:56 schrieb Jarmo Schrader:
> --
> All messages to this mailing list should adhere to the Code of Conduct:
> https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
> <https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx>
> ---
> You received this message because you are subscribed to the Google
> Groups "DSpace Technical Support" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to dspace-tech...@googlegroups.com
> <mailto:dspace-tech...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/dspace-tech/94025452-9be4-4550-9225-52a81d537dda%40uni-hildesheim.de <https://groups.google.com/d/msgid/dspace-tech/94025452-9be4-4550-9225-52a81d537dda%40uni-hildesheim.de?utm_medium=email&utm_source=footer>.


Sascha Szott

unread,
Feb 29, 2024, 8:40:16 AM2/29/24
to dspac...@googlegroups.com
Hi,

we've provided a PR in Github to remove JATS tags in CrossRef abstracts:

https://github.com/DSpace/DSpace/pull/9386

Best regards
Sascha


Am 28.02.24 um 18:32 schrieb Sascha Szott:
--
Sascha Szott
Abteilung Forschungsinformation und Publizieren
Universitätsbibliothek
Helmut-Schmidt-Universität
Universität der Bundeswehr Hamburg
Holstenhofweg 85
22043 Hamburg

📞 +49 171 6433825
🌍 https://ub.hsu-hh.de/
Reply all
Reply to author
Forward
0 new messages