Hi, dear all!
I want to perform de novo assembly with four libraries (insert size were 270bp, 500bp, 2K and 5K), and the read are paired and the length is 150bp. After mapping the reads to reference with BWA, there are about 1/3 reads were chimeric for the two mate-pair libraries (2K and 5K). I don't know whether I should filter out these reads? There are little information after google. Considering the short libraries were used for constructing contig, then the reads from long libraries are mapped to contigs to link these contigs, in my opinion, the assembly tool still could use chimeric reads to link the contigs. However, my mate think there were rare chimeric reads in previous experiment since reads were short, and the assemble tool may can't deal with chimeric reads. Furthermore, I think if I filter out these reads, then this wouldn't be a true de novo assemble. So should I filter out the chimeric reads in mate-pair library before using SOAPdenovo?
Also, there is library with insert size is 270bp, and according to paper titled "Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques", the overlap reads were filter out when using SOAPdenovo, should I filter out these reads to use SOAPdenovo?
Any suggestion would be grateful!
--
You received this message because you are subscribed to the Google Groups "BGI-SOAP" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bgi-soap+u...@googlegroups.com.
To post to this group, send email to bgi-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bgi-soap/91d0c9a5-9fb7-477c-b73f-a4c8917a09d1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/bgi-soap/1ff08d47-693f-4773-97bd-fcd4850b3344%40googlegroups.com.