what happened to -rur option

43 views
Skip to first unread message

pemh...@gmail.com

unread,
Dec 18, 2012, 11:12:20 AM12/18/12
to mosaik-...@googlegroups.com
Hi there,

I am trying to find out the -rur option in the latest version. But it seems that it doesn't support it anymore. Any way to find out the unaligned reads? Thanks.

Peter

Wan-Ping Lee

unread,
Dec 19, 2012, 8:02:10 PM12/19/12
to mosaik-...@googlegroups.com
Hi Peter,

MOSAIK currently keeps all reads in outpted bam including mapped and unmapped. So, you may use bamtools filter to catch them.

> bamtools filter -isMapped false -in <your_bam>

Best,
Wan-Ping
--
Wan-Ping Lee
Postdoctoral Research Associate
Department of Biology, Boston College

Tel: +1 (617) 552-2922
wanpi...@bc.edu

pemh...@gmail.com

unread,
Jan 3, 2013, 4:18:21 PM1/3/13
to mosaik-...@googlegroups.com, wanpi...@bc.edu
Thanks Wan-Ping. I tried your suggestions and it works. I am now able to find the unaligned reads using both mosaik and bamtools.

Another question, do you know a way to split chimeric reads?

I read snownebula's reply on SEQanswer, but I don't know if it will fit our analysis. For example, he mentioned that one need to use a cut off for the minimal size of alignment such as 32 bp. In addition, one needs to use -mmal option. But I think -mmal option is not there anymore?

Any suggestions? Thanks a lot for your kind help.

Best,

Peter

Wan-Ping Lee

unread,
Jan 4, 2013, 3:38:36 PM1/4/13
to mosaik-...@googlegroups.com
Hi Peter,

Could you please describe chimeric reads more? What kind of reads that you will think them chimeric?

Best,
Wan-Ping

pemh...@gmail.com

unread,
Jan 7, 2013, 3:45:07 PM1/7/13
to mosaik-...@googlegroups.com, wanpi...@bc.edu
Hi Wan-Ping,

Thanks for the kind email reply.

In my work, we use NGS to sequence our samples and we find out that some reads map to different reference sequences. Our reference sequence contains three difference genes. So for a read map to more than one gene is consider a chimeric read as these three reference genes are completely different. I use blast to search for the match and half maps to one gene and another half maps to another gene.

As more than 10% of reads are chimeric, we'd like to split the chimeric reads up so that we could align them to their reference sequences well without causing too much variations (if it is not splitted) for SNP discovery.

Any suggestions? Thanks a lot for your kind help!

Best,

Peter

Wan-Ping Lee

unread,
Jan 9, 2013, 2:47:15 PM1/9/13
to mosaik-...@googlegroups.com
Hi Peter,

I got your question. If the alignments are obtainable without splitting, I may try to modify cigar tags. I assume that those reads will be mapped with complex cigars, e.g. 20M1D1M3I2M5D2M. Then, I'll modify the cigar like 20M8S such SNP callers may not touch the complex part, 1D1M3I2M5D2M, and thus you may not hold too many false positive calls.

Best,
Wan-Ping

pemh...@gmail.com

unread,
Jan 10, 2013, 10:55:01 AM1/10/13
to mosaik-...@googlegroups.com, wanpi...@bc.edu
Thanks Wan-Ping. I think I might have a way to get it. I am using bwa bwasw hard-clipping which will get rid of the unmatched region and give me the desired match portion. Thanks a lot!

Best,

Peter
Reply all
Reply to author
Forward
0 new messages