i've been struggling on this issue quite a while now; as suggested i've added the /1 /2 tags at the end of the reads and filtered them; however i still got some errors while running abyss-pe.
ABYSS -k25 -q3 --coverage-hist=coverage.hist -s 14-bubbles.fa -o 14-1.fa ../17_R1_a.fastq ../17_R2_a.fastq
ABySS 1.2.7
ABYSS -k25 -q3 --coverage-hist=coverage.hist -s 14-bubbles.fa -o 14-1.fa ../17_R1_a.fastq ../17_R2_a.fastq
Reading `../17_R1_a.fastq'
warning: discarded 46616 reads shorter than 25 bases
Reading `../17_R2_a.fastq'
warning: discarded 271567 reads shorter than 25 bases
Loaded 52984920 k-mer
Minimum k-mer coverage is 11
Using a coverage threshold of 8...
The median k-mer coverage is 64
The reconstruction is 7187818
The k-mer coverage threshold is 8
Setting parameter e (erode) to 8
Setting parameter E (erodeStrand) to 1
Setting parameter c (coverage) to 8
Generating adjacency
Generated 107957676 edges
Eroding tips
Eroded 25611226 tips
Eroded 0 tips
Trimming short branches: 1
Trimmed 13 k-mer in 13 branches
Trimming short branches: 2
Trimmed 10 k-mer in 5 branches
Trimming short branches: 4
Trimmed 28 k-mer in 10 branches
Trimming short branches: 8
Trimmed 99 k-mer in 16 branches
Trimming short branches: 16
Trimmed 189 k-mer in 18 branches
Trimming short branches: 25
Trimmed 214 k-mer in 11 branches
Trimming short branches: 25
Trimmed 73 branches in 6 rounds
Marked 3508774 edges of 1680299 ambiguous vertices.
Removing low-coverage contigs (mean k-mer coverage < 8)
Found 27370460 k-mer in 2160840 contigs before removing low-coverage contigs
Removed 20172909 k-mer in 893323 low-coverage contigs
Split 1786625 ambiguous branches
Eroding tips
Eroded 27552 tips
Eroded 0 tips
Trimming short branches: 1
Trimmed 155 k-mer in 155 branches
Trimming short branches: 2
Trimmed 272 k-mer in 202 branches
Trimming short branches: 4
Trimmed 658 k-mer in 261 branches
Trimming short branches: 8
Trimmed 1837 k-mer in 359 branches
Trimming short branches: 16
Trimmed 3879 k-mer in 376 branches
Trimming short branches: 25
Trimmed 3160 k-mer in 182 branches
Trimming short branches: 25
Trimmed 1 k-mer in 1 branches
Trimming short branches: 25
Trimmed 1536 branches in 7 rounds
Popping bubbles
Removed 2118 bubbles
Removed 2118 bubbles
Marked 16761 edges of 7960 ambiguous vertices.
2705 unassembled k-mer in circular contigs
Assembled 7100808 k-mer in 10730 contigs
Removed 45822202 k-mer.
The signal-to-noise ratio (SNR) is -8.05998 dB.
AdjList -k25 -m30 14-1.fa >14-1.adj
PopBubbles -j2 -k25 -p0.9 -g 14-3.adj 14-1.fa 14-1.adj >14-1.path
MergeContigs -k25 -o 14-3.fa 14-1.fa 14-1.adj 14-1.path
The minimum coverage of single-end contigs is 8.
The minimum coverage of merged contigs is 12.6522.
Consider increasing the coverage threshold parameter, c, to 12.6522.
awk '!/^>/ {x[">" $1]=1; next} {getline s} $1 in x {print $0 "\n" s}' \
14-1.path 14-1.fa >14-indel.fa
KAligner -i -j2 -k25 ../17_R1_a.fastq ../17_R2_a.fastq 14-3.fa \
|ParseAligns -k25 -h 14-3.hist \
|sort -snk3 -k4 \
|gzip >14-3.sam.gz
Mateless 11933308 100%
Unaligned 0
Singleton 0
FR 0
RF 0
FF 0
Different 0
Multimap 0
Split 0
Total 11933308
gunzip -c 14-3.sam.gz \
|DistanceEst -j2 -k25 -s100 -n10 -o 14-3.dist 14-3.hist
error: the histogram `14-3.hist' is empty
make: *** [14-3.dist] Error 1
make: *** Deleting file `14-3.dist'
Below a read example:
@HISEQ1:85:D0C0FABXX:3:1101:1410:2166#1:N:0:CGATGT/1
ATTGCCCAGCAGAATCTCCAGAAATTCGAACTGCGGGACGTGACGACCGAGAACGACAAGGCACACCGTCAGCGGCGTCGACAGCACCAATCCGACCGGCC
+
@@@FBFB?BFDDF@GHGIIIFHIIECGGIGHGEEG<FAF<;C@FCDEHF8AA@>/38=@BBBBBCBBB+9@BCCBBBBBBBB39BC<?#############
Other really similar reads with the old pipeline perform really well.
Thanks in advance for any suggestion you may have