Exclude specific IPs from DSpace Solr Statistics?

36 views
Skip to first unread message

James Michelich

unread,
Dec 4, 2019, 10:03:09 AM12/4/19
to DSpace Technical Support
I am new to DSpace and Solr, but have been tasked with finding a way to exclude specific IPs (or IP ranges) from the pageview and download statistics in our DSpace 6 install (XMLUI).  The idea is to exclude stats generated by views from some of our internal departments.  Is this possible?  Thanks.

James Michelich

unread,
Dec 6, 2019, 3:42:39 PM12/6/19
to DSpace Technical Support
Our goal is to get a more accurate view of statistics around views and downloads of content in our institutional repository.  Thus, I'm attempting to find a way to filter out stats generated by accesses from certain people/departments internal to our organization.

Alan Orth

unread,
Dec 7, 2019, 3:19:23 PM12/7/19
to James Michelich, DSpace Technical Support
Dear James,

I only use block lists based on user agent strings, so I'm not entirely sure, but it looks like you can just drop a list of IPs in here:


Check your dspace.log and you will see them being loaded.

Cheers,

--
All messages to this mailing list should adhere to the DuraSpace Code of Conduct: https://duraspace.org/about/policies/code-of-conduct/
---
You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dspace-tech...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/7e226a46-33ea-48f7-b590-d493bfdf1344%40googlegroups.com.

James Michelich

unread,
Dec 12, 2019, 2:27:00 PM12/12/19
to Alan Orth, DSpace Technical Support
Hello Alan,

Thank you.  That's exactly what I needed.  I have created a new file with a list of IP addresses in the spiders directory.  I'm then using the dspace stats command line utility to mark the corresponding (existing) queries in Solr as spiders (flips isBot flag to true).  I'm hoping this will flow through and reflect in the usage statistics displayed through the XMLUI, but I don't have a good way to verify.  Any advice?  

Also, do you know if it's possible to define a block of ip addresses in the spiders txt file using a wildcard, i.e., 192.168.0.*?  Or will I need to enumerate them all for the range of ip's I want to flag?

Thanks,
James
Reply all
Reply to author
Forward
0 new messages