Raw sequencing data as input for Cortex

52 views

Skip to first unread message

Joe Colgan

unread,

Dec 15, 2015, 8:00:35 PM12/15/15

to cortex_var

Hi,

I was wondering about the appropriateness of using raw sequencing (i.e. non-filtered) data as input for a Cortex run?

As Cortex is involved in the analysis of genetic variation through de novo assembly, and with the built-in filtering steps (e.g. --quality score threshold), would it be better to incorporate all sequences as opposed to a filtered dataset that comprises 80-90% of the original raw data?

I am just concerned about missing SNPs through stringent filtering and therefore, would it be advisable to run on raw sequencing data to reduce potential gaps during the assembly process?

Kind regards,

Joe

Zamin Iqbal

unread,

Dec 15, 2015, 8:29:47 PM12/15/15

to cortex_var

Hi Joe

As Cortex is involved in the analysis of genetic variation through de novo assembly, and with the built-in filtering steps (e.g. --quality score threshold), would it be better to incorporate all sequences

Yes

as opposed to a filtered dataset that comprises 80-90% of the original raw data?

Basically, the only filtering I recommend is removal of adapters

I am just concerned about missing SNPs through stringent filtering

Me too

and therefore, would it be advisable to run on raw sequencing data to reduce potential gaps during the assembly process?

Yes :-)

regards

Zam

Reply all

Reply to author

Forward

0 new messages