One thing I found curious is that I first ran it with -pno -d, then -pyes and got a different result each time:
dspace@ir:/home/dspace$ scripts/check-spider-hits.sh -u
http://localhost:8080/solr -f /dspacecris-dut/config/spiders/agents/example -pno -d
(DEBUG) Using spiders pattern file: /dspacecris-dut/config/spiders/agents/example
(DEBUG) Checking for hits from spider: AllenTrack
(DEBUG) Checking for hits from spider: Arachmo
(DEBUG) Checking for hits from spider: ContentSmartz
(DEBUG) Checking for hits from spider: DSurf
(DEBUG) Checking for hits from spider: EmailSiphon
(DEBUG) Checking for hits from spider: EmailWolf
(DEBUG) Checking for hits from spider: GetRight
(DEBUG) Checking for hits from spider: Googlebot
Found 325498 hits from Googlebot in statistics
(DEBUG) Checking for hits from spider: HTTrack
Found 1366 hits from HTTrack in statistics
(DEBUG) Checking for hits from spider: LOCKSS
(DEBUG) Checking for hits from spider: MSNBot
(DEBUG) Checking for hits from spider: Milbot
(DEBUG) Checking for hits from spider: MuscatFerre
(DEBUG) Checking for hits from spider: NABOT
(DEBUG) Checking for hits from spider: NaverBot
(DEBUG) Checking for hits from spider: OurBrowser
(DEBUG) Checking for hits from spider: Readpaper
(DEBUG) Checking for hits from spider: Strider
Found 1 hits from Strider in statistics
(DEBUG) Checking for hits from spider: Teoma
Found 2 hits from Teoma in statistics
(DEBUG) Checking for hits from spider: Wanadoo
Found 7 hits from Wanadoo in statistics
(DEBUG) Checking for hits from spider: WebCloner
(DEBUG) Checking for hits from spider: WebCopier
(DEBUG) Checking for hits from spider: WebReaper
(DEBUG) Checking for hits from spider: WebStripper
(DEBUG) Checking for hits from spider: WebZIP
(DEBUG) Checking for hits from spider: Webinator
(DEBUG) Checking for hits from spider: Webmetrics
(DEBUG) Checking for hits from spider: Wget
Found 170 hits from Wget in statistics
(DEBUG) Checking for hits from spider: alexa
Found 238 hits from alexa in statistics
(DEBUG) Checking for hits from spider: almaden
(DEBUG) Checking for hits from spider: appie
(DEBUG) Checking for hits from spider: architext
(DEBUG) Checking for hits from spider: arks
Found 18 hits from arks in statistics
(DEBUG) Checking for hits from spider: asterias
(DEBUG) Checking for hits from spider: atomz
(DEBUG) Checking for hits from spider: autoemailspider
(DEBUG) Checking for hits from spider: awbot
(DEBUG) Checking for hits from spider: baiduspider
(DEBUG) Checking for hits from spider: bbot
(DEBUG) Checking for hits from spider: biadu
(DEBUG) Checking for hits from spider: biglotron
(DEBUG) Checking for hits from spider: bjaaland
(DEBUG) Checking for hits from spider: bloglines
(DEBUG) Checking for hits from spider: blogpulse
(DEBUG) Checking for hits from spider: bot
Found 520514 hits from bot in statistics
(DEBUG) Checking for hits from spider: bspider
Found 72 hits from bspider in statistics
(DEBUG) Checking for hits from spider: bwh3_user_agent
(DEBUG) Checking for hits from spider: celestial
(DEBUG) Checking for hits from spider: cfnetwork|checkbot
(DEBUG) Solr query returned HTTP 400, skipping cfnetwork|checkbot.
(DEBUG) Checking for hits from spider: combine
(DEBUG) Checking for hits from spider: contentmatch
(DEBUG) Checking for hits from spider: core
(DEBUG) Checking for hits from spider: crawl
Found 15205 hits from crawl in statistics
(DEBUG) Checking for hits from spider: crawler
Found 15191 hits from crawler in statistics
(DEBUG) Checking for hits from spider: cursor
(DEBUG) Checking for hits from spider: custo
Found 4 hits from custo in statistics
(DEBUG) Checking for hits from spider: daumoa
(DEBUG) Checking for hits from spider: docomo
(DEBUG) Checking for hits from spider: dtSearchSpider
(DEBUG) Checking for hits from spider: dumbot
(DEBUG) Checking for hits from spider: easydl
(DEBUG) Checking for hits from spider: exabot
Found 133 hits from exabot in statistics
(DEBUG) Checking for hits from spider: fast-webcrawler
(DEBUG) Checking for hits from spider: favorg
(DEBUG) Checking for hits from spider: feedburner
(DEBUG) Checking for hits from spider: ferret
(DEBUG) Checking for hits from spider: findlinks
Found 10626 hits from findlinks in statistics
(DEBUG) Checking for hits from spider: gaisbot
(DEBUG) Checking for hits from spider: geturl
(DEBUG) Checking for hits from spider: gigabot
(DEBUG) Checking for hits from spider: girafabot
(DEBUG) Checking for hits from spider: gnodspider
(DEBUG) Checking for hits from spider: google
Found 327642 hits from google in statistics
(DEBUG) Checking for hits from spider: grub
(DEBUG) Checking for hits from spider: gulliver
(DEBUG) Checking for hits from spider: harvest
(DEBUG) Checking for hits from spider: heritrix
Found 765 hits from heritrix in statistics
(DEBUG) Checking for hits from spider: hl_ftien_spider
(DEBUG) Checking for hits from spider: holmes
(DEBUG) Checking for hits from spider: htdig
(DEBUG) Checking for hits from spider: htmlparser
(DEBUG) Checking for hits from spider: httrack
(DEBUG) Checking for hits from spider: iSiloX
(DEBUG) Checking for hits from spider: ia_archiver
Found 243 hits from ia_archiver in statistics
(DEBUG) Checking for hits from spider: ichiro
Found 1153 hits from ichiro in statistics
(DEBUG) Checking for hits from spider: iktomi
(DEBUG) Checking for hits from spider: ilse
(DEBUG) Checking for hits from spider: internetseer
(DEBUG) Checking for hits from spider: intute
(DEBUG) Checking for hits from spider: java
Found 2 hits from java in statistics
(DEBUG) Checking for hits from spider: jeeves
(DEBUG) Checking for hits from spider: jobo
(DEBUG) Checking for hits from spider: kyluka
(DEBUG) Checking for hits from spider: larbin
(DEBUG) Checking for hits from spider: libwww
Found 113 hits from libwww in statistics
(DEBUG) Checking for hits from spider: lilina
(DEBUG) Checking for hits from spider: linkbot
(DEBUG) Checking for hits from spider: linkcheck
(DEBUG) Checking for hits from spider: linkchecker
(DEBUG) Checking for hits from spider: linkscan
(DEBUG) Checking for hits from spider: linkwalker
(DEBUG) Checking for hits from spider: lmspider
(DEBUG) Checking for hits from spider: lwp
(DEBUG) Checking for hits from spider: megite
(DEBUG) Checking for hits from spider: milbot
(DEBUG) Checking for hits from spider: mimas
(DEBUG) Checking for hits from spider: mj12bot
(DEBUG) Checking for hits from spider: mnogosearch
(DEBUG) Checking for hits from spider: moget
(DEBUG) Checking for hits from spider: mojeekbot
(DEBUG) Checking for hits from spider: momspider
(DEBUG) Checking for hits from spider: motor
Found 8 hits from motor in statistics
(DEBUG) Checking for hits from spider: msiecrawler
(DEBUG) Checking for hits from spider: msnbot
Found 8993 hits from msnbot in statistics
(DEBUG) Checking for hits from spider: myweb
(DEBUG) Checking for hits from spider: nagios
(DEBUG) Checking for hits from spider: netcraft
(DEBUG) Checking for hits from spider: netluchs
(DEBUG) Checking for hits from spider: no_user_agent
(DEBUG) Checking for hits from spider: nomad
(DEBUG) Checking for hits from spider: nutch
Found 68 hits from nutch in statistics
(DEBUG) Checking for hits from spider: ocelli
(DEBUG) Checking for hits from spider: onetszukaj
(DEBUG) Checking for hits from spider: perman
(DEBUG) Checking for hits from spider: pioneer
(DEBUG) Checking for hits from spider: powermarks
(DEBUG) Checking for hits from spider: psbot
Found 3 hits from psbot in statistics
(DEBUG) Checking for hits from spider: python
Found 1 hits from python in statistics
(DEBUG) Checking for hits from spider: qihoobot
(DEBUG) Checking for hits from spider: rambler
(DEBUG) Checking for hits from spider: redalert|robozilla
(DEBUG) Solr query returned HTTP 400, skipping redalert|robozilla.
(DEBUG) Checking for hits from spider: robot
Found 56183 hits from robot in statistics
(DEBUG) Checking for hits from spider: robots
Found 43145 hits from robots in statistics
(DEBUG) Checking for hits from spider: rss
(DEBUG) Checking for hits from spider: scan4mail
(DEBUG) Checking for hits from spider: scientificcommons
(DEBUG) Checking for hits from spider: scirus
(DEBUG) Checking for hits from spider: scooter
(DEBUG) Checking for hits from spider: seekbot
(DEBUG) Checking for hits from spider: seznambot
(DEBUG) Checking for hits from spider: shoutcast
(DEBUG) Checking for hits from spider: slurp
Found 104 hits from slurp in statistics
(DEBUG) Checking for hits from spider: sogou
Found 2178 hits from sogou in statistics
(DEBUG) Checking for hits from spider: speedy
Found 139 hits from speedy in statistics
(DEBUG) Checking for hits from spider: spider
Found 23341 hits from spider in statistics
(DEBUG) Checking for hits from spider: spiderman
(DEBUG) Checking for hits from spider: spiderview
(DEBUG) Checking for hits from spider: sunrise
(DEBUG) Checking for hits from spider: superbot
(DEBUG) Checking for hits from spider: surveybot
(DEBUG) Checking for hits from spider: tailrank
(DEBUG) Checking for hits from spider: technoratibot
(DEBUG) Checking for hits from spider: titan
(DEBUG) Checking for hits from spider: turnitinbot
(DEBUG) Checking for hits from spider: twiceler
(DEBUG) Checking for hits from spider: ucsd
(DEBUG) Checking for hits from spider: ultraseek
(DEBUG) Checking for hits from spider: urlaliasbuilder
(DEBUG) Checking for hits from spider: urllib
Found 66 hits from urllib in statistics
(DEBUG) Checking for hits from spider: voila
(DEBUG) Checking for hits from spider: webcollage
(DEBUG) Checking for hits from spider: weblayers
(DEBUG) Checking for hits from spider: webmirror
(DEBUG) Checking for hits from spider: webreaper
(DEBUG) Checking for hits from spider: wordpress
(DEBUG) Checking for hits from spider: worm
(DEBUG) Checking for hits from spider: xenu
(DEBUG) Checking for hits from spider: yacy
Found 2 hits from yacy in statistics
(DEBUG) Checking for hits from spider: yahoo
Found 153 hits from yahoo in statistics
(DEBUG) Checking for hits from spider: yahoofeedseeker
(DEBUG) Checking for hits from spider: yahooseeker
(DEBUG) Checking for hits from spider: yandex
Found 8591 hits from yandex in statistics
(DEBUG) Checking for hits from spider: yodaobot
(DEBUG) Checking for hits from spider: zealbot
(DEBUG) Checking for hits from spider: zeus
(DEBUG) Checking for hits from spider: zyborg
(DEBUG) Checking for hits from spider: parsijoo
Found 38 hits from parsijoo in statistics
(DEBUG) Checking for hits from spider: validator
Total number of hits from bots: 1361976
dspace@ir:/home/dspace$ scripts/check-spider-hits.sh -u
http://localhost:8080/solr -f /dspacecris-dut/config/spiders/agents/example -pyes
Purging 325498 hits from Googlebot in statistics
Purging 1366 hits from HTTrack in statistics
Purging 1 hits from Strider in statistics
Purging 2 hits from Teoma in statistics
Purging 7 hits from Wanadoo in statistics
Purging 170 hits from Wget in statistics
Purging 238 hits from alexa in statistics
Purging 18 hits from arks in statistics
Purging 195014 hits from bot in statistics
Purging 72 hits from bspider in statistics
Purging 14714 hits from crawl in statistics
Purging 4 hits from custo in statistics
Purging 10626 hits from findlinks in statistics
Purging 2271 hits from google in statistics
Purging 765 hits from heritrix in statistics
Purging 5 hits from ia_archiver in statistics
Purging 598 hits from ichiro in statistics
Purging 2 hits from java in statistics
Purging 113 hits from libwww in statistics
Purging 8 hits from motor in statistics
Purging 1 hits from python in statistics
Purging 103 hits from slurp in statistics
Purging 2178 hits from sogou in statistics
Purging 139 hits from speedy in statistics
Purging 20938 hits from spider in statistics
Purging 66 hits from urllib in statistics
Purging 49 hits from yahoo in statistics
Purging 38 hits from parsijoo in statistics
Total number of bot hits purged: 575004