Indexing Issues with University Institutional Repository

305 views
Skip to first unread message

Fares Mezrag

unread,
Sep 25, 2024, 8:24:42 AM9/25/24
to DSpace Technical Support
Dear Dspace Technical Team,

I have an issue with the indexing of our institutional repository, University of M'sila Institutional Repository (https://dspace.univ-msila.dz/), on Google Scholar.

In September 2023, our repository boasted approximately 28,900 indexed items. However, in December 2023, we witnessed a dramatic decrease. All previously indexed items have vanished, leaving our current count at zero on Google Scholar (March 2024). With the knowledge that we were using the old version of DSpace (Version 4.2), we have updated DSpace to version 7.6.1 since April 2024. Additionally, we followed the official search engine optimization guide. Unfortunately, the problem persists.

For your reference, here's a search for our repository on Google Scholar:

https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=site%3Adspace.univ-msila.dz&oq=

Our university's technical team has tirelessly attempted to resolve the issue, but to no avail.

We urgently request any guidance or information you can offer to rectify the indexing problems plaguing our repository. Additionally, if possible, we'd appreciate knowing the point of contact at Google Scholar who handles such matters.
Sincerely,
Dr Fares Mezrag
IT Manager
M’sila University

DSpace Technical Support

unread,
Oct 3, 2024, 12:21:48 PM10/3/24
to DSpace Technical Support
Hi,

You should get in touch with Google Scholar to see if they have advice on what might be going on.   See their Troubleshooting guide, which also has a link on how to contact them (see bullet 6 of that guide):  https://scholar.google.com/intl/en/scholar/inclusion.html#troubleshooting

Please be aware that DSpace 7.6.2 also did provide some SEO fixes.  See the "SEO improvements" in the 7.6.2 release notes: https://wiki.lyrasis.org/display/DSDOC7x/Release+Notes#ReleaseNotes-7.6.2ReleaseNotes

Finally, make sure you are obviously following all the SEO guidelines at https://wiki.lyrasis.org/display/DSDOC7x/Search+Engine+Optimization

Tim

Sean Carte

unread,
Oct 4, 2024, 6:39:09 AM10/4/24
to DSpace Technical Support
Hi Dr Mezrag

We have a similar problem with a 7.6.2 site:

I've gone through the guide repeatedly and have requested help via the email address, but had no response.

Please let me know if you find a way to resolve this.

Sean

--
All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
---
You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dspace-tech...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/15ec959b-5509-45b2-953a-61da6b34e6aen%40googlegroups.com.

Fares Mezrag

unread,
Oct 4, 2024, 3:30:34 PM10/4/24
to DSpace Technical Support
Thank you for the reply. I will follow the notes regarding the upgrade to version 7.6.2.

Jarmo Schrader

unread,
Oct 7, 2024, 1:14:16 PM10/7/24
to dspac...@googlegroups.com
Hi,

we had the same problem and I think I found the solution today:

Google Scholar needs certain metadata-tags in the <head> section of the site in order to index your repository properly. One of them is citation_pdf_url. This should be the url of the (PDF)fulltext document. On your server the tag looks like this: <meta name="citation_pdf_url" content="http://localhost:4000/bitstreams/59de15c5-8de0-4fb4-b90f-66979d4a1ee7/download">

Instead of the hostname there ist a "localhost" in the url. This is probably because you are using a proxy in front of dspace.

For us the solution was to add a line with "ProxyPreserveHost On" to our Apache-Configuration. Now citation_pdf_url lists the correct URL for the pdf. This is our apache config:

  ProxyPreserveHost On
  ProxyPass /server http://127.0.0.1:8080/server
  ProxyPassReverse /server http://127.0.0.1:8080/server
  ProxyPass / http://127.0.0.1:4000/
  ProxyPassReverse / http://127.0.0.1:4000/
  RequestHeader set X-Forwarded-Proto https
  RequestHeader set X-Forwarded-Host our.repository-domain.com

(I am not an Apache expert, there might be a better solution, but it seems to work. Since I only found this today, I am not 100% sure it will fix the problem with googl scholar but it definitely is something that needs to be fixed)

I found the solution by watching this helpful video from a Person working for Google Scholar with tips for repositories: https://www.youtube.com/watch?v=C-miRaROsaE
Here is the presentation from the video: https://www.carl-abrc.ca/wp-content/uploads/2021/01/Google_Scholar_webinar_Jan2021.pdf

Good luck with your repository!

Best regards
Jarmo
M’sila University --
All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
---
You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dspace-tech...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/9c7f409a-361f-4977-aa29-86b342a4272bn%40googlegroups.com.

--
Dr. Jarmo Schrader
stellv. Bibliotheksleiter
Fachreferat und EDV
Universität Hildesheim
Universitätsbibliothek
Universitätsplatz 1
31141 Hildesheim

Tel: +49 (0) 5121 - 883 - 93004
jarmo.s...@uni-hildesheim.de

Sean Carte

unread,
Oct 8, 2024, 6:36:46 AM10/8/24
to dspac...@googlegroups.com
Thanks, Dr Schrader. Unfortunately, that isn't the case with my repository, which does provide the citation_pdf_url meta tag in the item pages, yet is not indexed by Google Scholar.

Sean

Steli Vali

unread,
Jul 9, 2025, 10:39:15 AM7/9/25
to DSpace Technical Support
Same problem here. I have the citation_pdf_url but is not indexed. However, my repository uses 'Thumbnail generation' for cover pages,. What we also implemented is the truncation of the abstract in the SimplePAge. (DSpace 7.4, without entities)
Are there any updates?

Best regards,
Steli

Sean Carte

unread,
Jul 10, 2025, 2:10:11 AM7/10/25
to Steli Vali, DSpace Technical Support
Hi Steli

No, unfortunately no improvement and no response from Google.

Sean

Michael Plate

unread,
Jul 10, 2025, 7:16:45 AM7/10/25
to dspac...@googlegroups.com
We have the same problem with Google Scholar since the update to Dspace
8 (from 5.10).
However, as an easy tip to view the metadata in your pages…in Firefox
this is very easy done with <CTRL>-<I> (at least Linux / Windows) on an
item page. Chrome does not have any like this.
Just if anyone is stuck viewing the source of an Angular generated HTML
page :) to find the metadata tags.

Michael


Am 10.07.25 um 08:09 schrieb Sean Carte:
[…]

Martin Kijumi

unread,
Jul 10, 2025, 7:26:11 AM7/10/25
to DSpace Technical Support
Dear, members on the same i am having challenges configuring to receive  notification when i try to register a new user. please help how do i do configuration on emails step by step 

DISCLAIMER

This email and its attachments are, unless the context clearly indicates otherwise, the property of Mbarara University of Science and Technology. It is confidential, private and intended for the addressee only. If you are not an intended recipient you must not use, disclose, distribute, copy, print or rely on this email. Mbarara University of Science and Technology accepts no liability to anyone whatsoever for any of the contents of the email or any of its attachments.


Steli Vali

unread,
Jul 10, 2025, 9:35:18 AM7/10/25
to DSpace Technical Support
Thank you Michael and Sean for clarification. I heard that GoogleScholar will need some time before answering, but let's hope for the best.
I will let you all know in case I receive any feedback from them or find a solution to our problem. 
I was thinking aboud DSpace8 upgrade, but it seems this problem is still encountered in the last version. 
Reply all
Reply to author
Forward
0 new messages