Hello, Sherry.
After getting some background on this feature from our developer, Leonid, I was able to get some answers for you.
According to getFileIdsInVersionFromSolr in DatasetPage.java, the only fields that search box searches are FILE_NAME and FILE_DESCRIPTION.
That would explain why you couldn't get all the files you tagged returned in the results. Now, the reason your searches for file extensions like "sib" and "xml" didn't work as expected is because the FILE_NAME is saved in Solr as a string, and it doesn't do a very good job of separating out the file extensions from the rest of the text string.
That said, you can use wildcard search syntax to get the results you expected. In your example dataset, if you search "*sib", using the asterisk or star for the wildcard, you would get your 348 files with the sib file extension.
Not sure why you get 1 file when you search "xml", but my guess is that it might have something to do with the parenthesis as some kind of separator in the file name string... I don't know, I am just guessing.
Hopefully that helps. As always, feel free to open a
new issue in GitHub if you believe this feature, or the documentation for it, can and should be improved.
Happy searching!
Mike
--
Michael Heppler
User Interface Designer + Developer, Dataverse Project
Institute for Quantitative Social ScienceHarvard University
1737 Cambridge Street, Rm K333, Cambridge, MA 02138
www.iq.harvard.edu