



--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To post to this group, send email to gen...@soe.ucsc.edu.
Visit this group at https://groups.google.com/a/soe.ucsc.edu/group/genome/.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CAKHG4HoOzHocNw9CAXOebX1nETUDzQbpUmoT9L0gBeK7zSaQ0w%40mail.gmail.com.
For more options, visit https://groups.google.com/a/soe.ucsc.edu/d/optout.
Hi Alex,
Thanks for the follow-up. If you look closely at the PSL format (http://genome.ucsc.edu/FAQ/FAQformat.html#format2), you will see that there is no score field.
The matches column, that you might be thinking of as a score, is defined as "Number of bases that match that aren't repeats."
There is a further calculation using the PSL input to create score information only seen on the "web BLAT" page. This score is a result of a second step to generate that information, in a process like the shared pslScore script.
If you look at that script, toward the bottom there is an indication that beyond just the matches column, it also incorporates the repMatches, misMatches, qNumInsert, and tNumInsert in the PSL data.
The way to think about it might be that the PSL is the raw information, and then when the browser goes to display results as hyperlinks (not raw PSL data), a score is calculated on the data in an attempt to process that raw information and sort it so the best results are near the top. If matches = best results, then these would sort identically, but it is likely sometimes there are other pieces of information that need to be considered. You may wish to look in our archives to see some discussion about this topic: https://groups.google.com/a/soe.ucsc.edu/forum/#!searchin/genome/matches$20PSL$20score%7Csort:date
For example, here is an excerpt from our archives:
- Can I just take the alignment that has the highest number in the
"match"-column and take this entry as the "best" alignment?This is a valid approach. However, some subtleties will be missed. For
instance, a perfect match that has no gaps on either the target or the
query side would be treated the same as a match where each base matched
perfectly, but the matches were interrupted by non-matching sequence.
I hope this was helpful. If you have any further questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
All the best,
Brian
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/CAKHG4HrxkCf0aXM1tPLwY%2BzhXVh7cwiFA0ATMYm3e3DzPEXH0g%40mail.gmail.com.