Hi, Alex,
I adjusted parameters according to your suggestions for
draft genome mapping. Mapping has been completely. Then I happened to
find a mapped read in sam file slightly different from the input read:
Here is what I see:
Input (from R2 reads file):
@M04650:15:000000000-ARB7E:1:1103:6869:18593 2:N:0:14
TGGCCCGGGGCTGGGCGCTTAACTAGTGAGTATAGTGTCCTGGTCAGCATGTTTTCGTGAGGGAGTGGAGGGGCACTGAGCTGACTGGCTTTGAGGCTGGACGGATGGATAGACATGGCGG
+M04650:15:000000000-ARB7E:1:1103:6869:18593 2:N:0:14
CCCCCFGDGFGGEGGGG77CE<CFAF<E6CF,EFFGGG9CFEFGF9@FF,EFGGGAF<FEGGGGE,@D@EFEEDEGGGGGGGGGGADFC,CEA9B@<FACEGGGEGGGFFFGAFFGGECC@
Out.sam file:
M04650:15:000000000-ARB7E:1:1103:6869:18593
16 GL193112.1 75683 255 123M * 0 0
TGGCCCGGGGCTGGGCGCTTAACTAGTGAGTATAGTGTCCTGGTCAGCATGTTTTCGTGGGGGAGTGGAGGGGCACTGAGCTGACTGGCTTTGAGGCTGGACGGATGGATAGACATGGCGGAG
:+CCFC:FF@C?4:@FACGFGFGFEDGDADGGGFDGDGBE<DGFAEGFGGGGGFBEFF8,FDFGGEEGFFC,EGGGGGEECECCDCGDFEFAFF@FFC:FFBEFFECFFE<FAFC:FF@<<CC
NH:i:1 HI:i:1 AS:i:119 nM:i:1
The output sequence has
extra "AG" at the end. I've globally checked how many there are those
kind of reads. Wonder if you've every had such issue and how to explain
that?
Thanks.
C.
On Monday, February 27, 2017 at 1:13:03 PM UTC-5, Alexander Dobin wrote: