I'm trying to set up our GSA on our new domain. It was configured on
the old domain, so I thought I could just change the network
information and the crawl URLs/database source to point to our new
server[s]. Right now it's crawling an application database, our
corporate Web site, and our user fileshare on the server.
It appears to be crawling successfully: I can search and get results,
but when I click the link in the results (primarily those from the
fileshare), it gives me a 404 Not Found error. I checked for some of
the files, and most of them do exist. I made sure our crawler access
account has permission to the share and its contents, and I went
through these two FAQ/procedures (created a virtual directory for the
fileshare, removed special characters from the filename, and checked
the other security/permission settings)
https://support.google.com/enterprise/faqs?&question=337
https://support.google.com/enterprise/faqs?&question=232
The URLs listed below (in the results) say the correct server, but I
was wondering if the results could be stale and I just haven't given
it enough time to recrawl. If I just replaced Server01 with Server02
in the Crawl URLs, would it make that replacement in the search
results URLs (even if it hadn't recrawled those yet)? Or does anyone
have another explanation or idea I can try for why it would give me
the 404 error if a file does exist?
Most of the Web pages come up fine, so that's another reason I
wondered (since that name/crawl URL didn't change)... but maybe I'm
just not configuring something right.
Thanks for any help,
Whitney