Hi,
I currently try to run the vsearch --fastq_mergepairs tool and found that quite many reads were discarded due to "too low alignment score". So I tried to increase the "--fastq_maxdiffs" value, but it apparently has no effect on the output after some point.
So my question is if there is another parameter rather than "--fastq_maxdiffpct", "--fastq_maxdiffs" and "--fastq_minovlen" that influences the merging output and might help to increase the number of merged reads?
Usearch in comparison merged 97% of the reads in this example with equivalent settings.
Here are the commands and the output from vsearch and the equivalent usearch:
VSEARCH command: vsearch --fastq_mergepairs file1_r1.fastq --reverse file1_r2.fastq --fastqout merged_file1.fastq --fastq_maxdiffpct 25 --fastq_maxdiffs 99 --fastq_minovlen 16 --quiet
VSEARCH Output:
106059 Pairs
90083 Merged (84.9%)
15976 Not merged (15.1%)
Pairs that failed merging due to various reasons:
17 too few kmers found on same diagonal
211 multiple potential alignments
15665 alignment score too low, or score drop to high
10 overlap too short
73 staggered read pairs
Statistics of all reads:
294.79 Mean read length
Statistics of merged reads:
364.55 Mean fragment length
2.84 Standard deviation of fragment length
1.25 Mean expected error in forward sequences
4.57 Mean expected error in reverse sequences
0.31 Mean expected error in merged sequences
1.61 Mean observed errors in merged region of forward sequences
6.33 Mean observed errors in merged region of reverse sequences
7.94 Mean observed errors in merged region
USEARCH command: usearch --fastq_mergepairs file1_r1.fastq --reverse file1_r2.fastq --fastqout merged_file1.fastq --fastq_pctid 75 --fastq_maxdiffs 99 --fastq_minovlen 16
USEARCH Output:
00:01 101Mb 100.0% 97.7% merged
Totals:
106059 Pairs (106.1k)
103667 Merged (103.7k, 97.74%)
2026 Alignments with zero diffs (1.91%)
2160 Too many diffs (> 99) (2.04%)
232 No alignment found (0.22%)
0 Alignment too short (< 16) (0.00%)
88 Staggered pairs (0.08%) merged & trimmed
225.11 Mean alignment length
364.43 Mean merged length
1.54 Mean fwd expected errors
5.23 Mean rev expected errors
0.53 Mean merged expected errors
thanks in advance
Till