Hello David,
Thank you for your question about a problem with the search index for your bigBed. We are able to see the issue with your bigBed file, but have been unable to reproduce it with anything we create ourselves. Are you able to send us the data files and the program binaries (e.g., bedToBigBed) that you used to construct your smps.bb file? You can send them to me privately to avoid sharing with the mailing list if you prefer.
One of our engineers notes that search names are not required to start with "Chr_"; we suspect that another bug is responsible.
If you have any further questions, please reply to gen...@soe.ucsc.edu or genome...@soe.ucsc.edu. Questions sent to those addresses will be archived in publicly-accessible forums for the benefit of other users. If your question contains sensitive data, you may send it instead to genom...@soe.ucsc.edu.
--
Jonathan Casper
UCSC Genome Bioinformatics Group
--
Hello David,
One of our engineers looked into this issue and reports that the problem has to do with your environment settings. Your LC_COLLATE environment variable is most likely set to something like en_US.UTF-8. The problem is that this does not provide a case-sensitive sort, which our tools expect to find. Our engineer is now updating bedToBigBed to explicitly check for unsorted input. You can make sure that you get case-sensitive sorting by re-running your sort command as
$ LC_COLLATE=C sort -k1,1 -k2,2n smps-shortChromNames.bed > smps-shortChromNames-sorted.bed
I hope this is helpful. If you have any further questions, please reply to gen...@soe.ucsc.edu or genome...@soe.ucsc.edu. Questions sent to those addresses will be archived in publicly-accessible forums for the benefit of other users. If your question contains sensitive data, you may send it instead to genom...@soe.ucsc.edu.
--
Jonathan Casper
UCSC Genome Bioinformatics Group
--