NCBI taxonomy data file missing

425 views
Skip to first unread message

Josiah Wagner

unread,
Jan 18, 2018, 3:09:20 PM1/18/18
to PhyloSift
Hi all,

Yesterday I obtained PhyloSift from Git and manually installed the Markers database from the updated link (https://figshare.com/articles/PhyloSift_markers_database/5755404/1).

However, I can't figure out how to manually install the NCBI taxonomy data since the http://edhar.genomecenter.ucdavis.edu/~koadman/ncbi.tgz is unavailable.  

So after running:

./phylosift all --paired tutorial_data/HMP_1.fastq.gz tutorial_data/HMP_2.fastq.gz

I get the error:

NCBI taxonomy data not found and unable to connect to update server, please check your phylosift configuration and internet connection!

On another post here, it was suggested taxonomy data could be obtained from ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/ but I am unsure which of those files PhyloSift requires in the share/phylosift/ncbi folder.

Thank you for any help!

Sincerely,
Josiah

Guillaume Jospin

unread,
Jan 18, 2018, 3:21:03 PM1/18/18
to phyl...@googlegroups.com
I did forget to add the ncbi part... I'll see about fixing that. Thanks for pointing it out.

You need the following files in the ncbi directory you mention.
delnodes.dmp
merged.dmp
nodes.dmp
names.dmp

Sorry about the misleading information.

--
You received this message because you are subscribed to the Google Groups "PhyloSift" group.
To unsubscribe from this group and stop receiving emails from it, send an email to phylosift+unsubscribe@googlegroups.com.
To post to this group, send email to phyl...@googlegroups.com.
Visit this group at https://groups.google.com/group/phylosift.
For more options, visit https://groups.google.com/d/optout.

Guillaume Jospin

unread,
Jan 18, 2018, 3:25:44 PM1/18/18
to phyl...@googlegroups.com
For some reason the link you share only shows the markers archive.
The ncbi archive is showing when using the following link:



Josiah Wagner

unread,
Jan 18, 2018, 7:22:43 PM1/18/18
to PhyloSift
Fixed, thanks so much.

Josiah
To unsubscribe from this group and stop receiving emails from it, send an email to phylosift+...@googlegroups.com.

Morgan Sobol

unread,
Oct 14, 2018, 4:51:16 PM10/14/18
to PhyloSift
Hello, 
I followed the guidance below but I am still receiving the error "NCBI taxonomy data not found and unable to connect to update server, please check your phylosift configuration and internet connection!" error.  
The NCBI folder is stored in my phylosift_v1.0.1 directory and contains delnodes.dmp, merged.dmp, nodes.dmp, names.dmp.
I also changed the ncbi directory path in the phylosiftrc file to this directory:

$marker_path="";

# $ncbi_path="";

# $marker_dir="/work/Geomicrobiology/msobol/bin/phylosift_v1.0.1/markers/";

# $markers_extended_dir="";

# $ncbi_dir = "work/Geomicrobiology/msobol/bin/phylosift_v1.0.1/ncbi/";

$marker_base_url = "";

$ncbi_url = "";


Can you please help me?

Thanks,
Morgan S. 

On Thursday, January 18, 2018 at 2:21:03 PM UTC-6, Guillaume Jospin wrote:
I did forget to add the ncbi part... I'll see about fixing that. Thanks for pointing it out.

You need the following files in the ncbi directory you mention.
delnodes.dmp
merged.dmp
nodes.dmp
names.dmp

Sorry about the misleading information.
On Thu, Jan 18, 2018 at 12:09 PM, Josiah Wagner <jtw...@gmail.com> wrote:
Hi all,

Yesterday I obtained PhyloSift from Git and manually installed the Markers database from the updated link (https://figshare.com/articles/PhyloSift_markers_database/5755404/1).

However, I can't figure out how to manually install the NCBI taxonomy data since the http://edhar.genomecenter.ucdavis.edu/~koadman/ncbi.tgz is unavailable.  

So after running:

./phylosift all --paired tutorial_data/HMP_1.fastq.gz tutorial_data/HMP_2.fastq.gz

I get the error:

NCBI taxonomy data not found and unable to connect to update server, please check your phylosift configuration and internet connection!

On another post here, it was suggested taxonomy data could be obtained from ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/ but I am unsure which of those files PhyloSift requires in the share/phylosift/ncbi folder.

Thank you for any help!

Sincerely,
Josiah

--
You received this message because you are subscribed to the Google Groups "PhyloSift" group.
To unsubscribe from this group and stop receiving emails from it, send an email to phylosift+...@googlegroups.com.

Guillaume Jospin

unread,
Oct 14, 2018, 4:58:46 PM10/14/18
to phyl...@googlegroups.com
Can you try removing the # at the beginning of the lines of interest please?
So in front of $marker_dir and $ncbi_dir

Morgan Sobol

unread,
Oct 14, 2018, 5:09:00 PM10/14/18
to PhyloSift
Thanks for the quick reply! I feel dumb for not seeing that earlier. Thank you!
It fixed the NCBI error message, but now I am getting this one:

Use of uninitialized value $timestamp in scalar chomp at /work/Geomicrobiology/msobol/bin/phylosift_v1.0.1/bin/../lib/Phylosift/Utilities.pm line 365, <$FH> line 1.

Usage: my $f = File::NFSLock->new('/pathtofile/file',

'BLOCKING|EXCLUSIVE|NONBLOCKING|SHARED', [blocking_timeout, stale_lock_timeout]);

(You passed "" and "2") at /work/Geomicrobiology/msobol/bin/phylosift_v1.0.1/bin/../lib/Phylosift/Utilities.pm line 632.



I can open a new post if need be.

Guillaume Jospin

unread,
Oct 14, 2018, 5:16:37 PM10/14/18
to phyl...@googlegroups.com
Ah yes, it might be due to the failed earlier PS runs. You might need to manually remove files with the word LOCK in them.
I assume you have write permission to the database directory? Sometimes if you are using a shared DB it can be a bit problematic.

So from your earlier post, I would look in /work/Geomicrobiology/msobol/bin/phylosift_v1.0.1/
Maybe inside markers as well.

Morgan Sobol

unread,
Oct 14, 2018, 9:00:49 PM10/14/18
to PhyloSift
Would lock be in the file name? If so, I am not finding anything like that. 

Morgan Sobol

unread,
Oct 16, 2018, 1:26:00 AM10/16/18
to PhyloSift
I fixed it by uninstalling and reinstalling the program.
Thanks!

Guillaume Jospin

unread,
Oct 16, 2018, 8:27:45 AM10/16/18
to phyl...@googlegroups.com
Oh good. Sorry I forgot to get back to this thread yesterday. 
Thanks for sharing the fix. 

Sent from my iPhone
Reply all
Reply to author
Forward
0 new messages