human_genome_database needs to be updated

170 views
Skip to first unread message

Drishti Kaul

unread,
Apr 19, 2019, 2:32:17 PM4/19/19
to kneaddata users
Hi KneadData users!

I have some skin microbiome data and as a first step, ran KneadData in order to filter out the human host contaminant reads. I looked into the *kneaddata_paired_*.fastq files, and after some downstream species profiling, it seems like even after the host decontamination, there were still a few reads that were human. These were probably chromosomes, mitochondrial genome, etc that aren't present in the scaffolded reference per say.

I ran the same analysis with the most recent human genome ref (GCA_000001405.15_GRCh38_no_alt_analysis_set.fna and built the db using bowtie2-build), and it seems like KneadData had filtered out more contaminant reads, and was completely devoid of the host.

It'd be great if the human_genome reference file could be updated to the most recent version for complete host removal.

Thanks,
Drishti

Josh L. Espinoza

unread,
May 30, 2019, 3:06:32 PM5/30/19
to kneaddata users
Reply all
Reply to author
Forward
0 new messages