Hi Chen,
Because transcript assembly is not perfect, clustering is also not perfect. It's possible, for example, that a large low coverage region in the middle of a gene is not assembled meaning that the transcripts of a gene are split in two or more fragments. e.g. one from 5' end to low coverage region and then another from the low coverage region to 3' end. Usually the clustering algorithm has no way of knowing that these two fragments came from the same gene, and if will assign them to different clusters. Let call these clusters #1 and #2 as an example. Then after preforming differential expression analysis with something like edgeR we can get a p-value ranked list like:
cluster p-value
#1 0.0001
#3 0.001
#2 0.1
#4 0.999
..etc..
For the purposes of "unique true positives" we build the ROC curves without #2 because the gene it represents is already counted in #1 and #1 has a lower p-value. The logic of doing this is that in a real analysis, is that we look at the top-ranked genes first. If any part of the gene comes up at significantly different then that gene could be interesting to follow up on the biology.
Hope this helps and happy to clarify further if needed.
Cheers,
Nadia.