H
ello again
Dan,
Here is some additional information that may be helpful to you
, based on input from one of our engineers:
hgFindSpec searches apply to tracks based on SQL database tables (e.g. loaded from regular BED), but not file formats such as bigBed. Sorry that our documentation didn't make that clear. Adding a SQL query to the search definition doesn't work because we don't query bigBed files using SQL.
For bigBed files, searches are actually defined using the trackDb settings referred to earlier, searchIndex and optionally searchTrix, in the "track ..." stanza not the separate "searchName ..." stanza. Unfortunately there is no support for truly fuzzy matching, but it's possible to use the searchTrix setting to get a very limited kind of fuzzy matching, or to define your own fuzzy matches.
For starters, to get exact matches on names like "GRMZM2G059865_T01", you can add a new setting to your "track ensemblGenesGrmzm" stanza:
searchIndex name
To support fuzzy matching, you will need to create a text file with desired matches and then run our ixIxx program as described
The simplest kind of input file would simply map names to themselves like this:
GRMZM2G059865_T01 GRMZM2G059865_T01
If you enter a search term "GRMZM2G059865_T" or "GRMZM2G059865_T0" then trix would match it with the name "GRMZM2G059865_T01", because trix's idea of "fuzzy" is that the all characters of the search term are matched, and the name has only one or two additional characters. That's because trix was designed for matching keywords like "kinase" / "kinases". If you are looking to match "GRMZM2G059865" then you'll need to include that in your input file like this:
GRMZM2G059865_T01 GRMZM2G059865_T01 GRMZM2G059865
This awk command should work to make such a file from your bed file:
awk '{abbrev = $4; sub(/_.*$/, "", abbrev); print $4, $4, abbrev;}' ensemblGenes.bed > ensemblGenes.ixTerms.txtThen run our ixIxx program on that file as described above, move the generated .ix and .ixx files to the same directory as your bigBed file, and add a searchTrix setting like this:
searchTrix /vault/gbdb/zeaMayB73_v3/bbi/ensemblGenesGrmzm.ixYou will need to keep the "searchIndex name" setting if you use searchTrix, so we know where to look for the IDs from the trix search. If you happen to have keywords associated with your IDs, you can add those words to the .ixTerms.txt file to get even better search.
Thanks for including so many details in your question!