Hi everybody,
I am very new to the environment and I am desperately try to find a program/script to merge paired end sequences from Illumina. I am working with double digest, paired end data. For every individual I have 2 files containing the first and the paired read respectively, each of which contain reads 86 bp long (they have been already filtered for quality and barcodes+restriction site have been trimmed away).
I have been trying to use join_paired_ends.py and it seems to work fine but I don't fully understand how it handles several situations, and I don't seem to be able to find an explanation anywhere in the web. It could be great if some of you with more experience on this than me could answer some of my questions :) If you are aware of any group/page where these issues are discussed it could be great even just to have a link to :)
For example:
What happens
if reads do not overlap at all?
No output with unmatched paires is created and I don't seem to have any read in the output that is simply the sum of the first and paired original read. The length range in my output is min 86 bp and maximum 164 bp (so I don't have any that is 86*2=172 bp).
What happens to reads that don't have a mate? Are they discarded or kept as 86 bp long reads?
Again, I don't get any output with singletons and I can't find any explanation on what happens online. Are the 86 bp reads in my output unpaired reads or the result of two reads that overlap completely?
What happens if an overlap has too many differences (mismatches) between the two reads or if the reads have different bases in the overlap but same quality score?
Does join_paired_ends.py requires reads in the same order in the first and paired read's files?
I will be extremely thankful to any help provided.
Many thanks already from now,
Best regards and Happy Easter
Vivi
--
---
You received this message because you are subscribed to the Google Groups "Qiime Forum" group.
To unsubscribe from this group and stop receiving emails from it, send an email to qiime-forum...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
What happens if reads do not overlap at all?
No output with unmatched paires is created and I don't seem to have any read in the output that is simply the sum of the first and paired original read. The length range in my output is min 86 bp and maximum 164 bp (so I don't have any that is 86*2=172 bp).
What happens to reads that don't have a mate? Are they discarded or kept as 86 bp long reads?
Again, I don't get any output with singletons and I can't find any explanation on what happens online. Are the 86 bp reads in my output unpaired reads or the result of two reads that overlap completely?
What happens if an overlap has too many differences (mismatches) between the two reads or if the reads have different bases in the overlap but same quality score?
Does join_paired_ends.py requires reads in the same order in the first and paired read's files?
You can supply 3 -o arguments, for un1, un2, join files, or one
argument as a file name template. The suffix 'un1, un2, or join' is
appended to the file, or they replace a %-character if present.