non-standard fasta header format

21 views
Skip to first unread message

Matteo

unread,
Jul 22, 2016, 12:16:41 PM7/22/16
to ABySS
Hi there!

I've tried googling my problem, but I really couldn't find any proper answer or meaningful explanations to that. 

The vast majority of my sequences in the abyss contig fasta files has the standard header format (eg. >217 1452 43433)

A few of them show a non standard header looking as the following: >23892451 612 30983 145290-,2958386+,6879596+ or >23914100 434 17555 1186186+,...,8272178+

I would be really grateful if you could help me on this matter and tell me how I should interpret this.

Many thanks in advance!

Shaun Jackman

unread,
Jul 22, 2016, 12:20:53 PM7/22/16
to ABySS
Hi, Matteo.

>217 1452 43433
is a unitig, that was created early on using only k-mers and did not involve paired-end reads.

>23892451 612 30983 145290-,2958386+,6879596+
is a paired-end contig that resulted from merging three unitigs, listed above.

>23914100 434 17555 1186186+,...,8272178+
is a paired-end contig that resulted from merging four or more unitigs, listed in the associated .path file.

Cheers,
Shaun
Reply all
Reply to author
Forward
0 new messages