CT (concordant translocation), HT (halfmapping translocation), UT (unpaired translocation). Here is an example:
HWI-ST678:296:C437KACXX:7:1101:12820:30889 99 chr13 45970104 40 47M54H chr12 6687256 209 ... MD:Z:47 NH:i:1 HI:i:1 NM:i:0 SM:i:40 XQ:i:40 X2:i:0 XO:Z:CT XS:A:- XT:Z:GT-AG,2.00,2.00,-chr12@6687195..-chr13@45970104
HWI-ST678:296:C437KACXX:7:1101:12820:30889 99 chr12 6687195 40 47H54M = 6687256 209 .... MD:Z:54 NH:i:1 HI:i:1 NM:i:0 SM:i:40 XQ:i:40 X2:i:0 XO:Z:CT XS:A:- XT:Z:GT-AG,2.00,2.00,-chr12@6687195..-chr13@45970104
HWI-ST678:296:C437KACXX:7:1101:12820:30889 147 chr12 6687256 40 48M271N53M = 6687195 -209 .... MD:Z:101 NH:i:1 HI:i:1 NM:i:0 SM:i:40 XQ:i:40 X2:i:0 XO:Z:CT XS:A:- XG:Z:A
Even though the sam/bam file is sorted by queryname, featureCounts will always try to resort the file. My temporary solution is, I filter out these reads and run featureCounts on the rest.
I don't do the filtering for small files because featureCounts is rather quick, does the sorting and gives me counts, but for big files it's a pain to do it ...
Do you think there could be a solution for this problem? If you want I can send you some test data.
best
Mathias