Updating to new Greengenes

211 views
Skip to first unread message

Paul R.

unread,
Oct 31, 2012, 3:06:58 PM10/31/12
to qiime...@googlegroups.com
Hello,

I am having some issues updating Greengenes in a QIIME Virtual Box.  Is there a tutorial or list of instructions somewhere that I can use to replace the old version of Greengenes with the new one, and have QIIME automatically use it whenever it assigns taxonomy?

Thanks for any help in advance!

Daniel McDonald

unread,
Oct 31, 2012, 3:36:47 PM10/31/12
to qiime...@googlegroups.com
Hey Paul,

I don't believe a tutorial was put together... so lets do that now :)

1) open a terminal, and change to your home directory (cd ~) or to a
directory of your choosing
2) download the gg_12_10_otus.tar.gz (curl -O
ftp://greengenes.microbio.me/greengenes_release/gg_12_10/gg_12_10_otus.tar.gz)
3) unpack the file (tar xzf gg_12_10_otus.tar.gz)
4) if you're using QIIME 1.5.0, patch the taxonomy for RDP support
(see https://groups.google.com/d/msg/qiime-forum/t74u1H4Y-Ns/vV05O0SBOEcJ)
5) open your QIIME config file located at ~/.qiime_config
6) update the "assign_taxonomy_id_to_taxonomy_fp" key to point to the
patched RDP taxonomy file
7) update the "assign_taxonomy_reference_seqs_fp" key to point to the
corresponding rep_set file

Let me know if you have any issues!
Daniel
> --
>
>
>

Paul R.

unread,
Oct 31, 2012, 3:53:44 PM10/31/12
to qiime...@googlegroups.com
Hi Daniel,

Thank you for your help and creating the tutorial!  I am having an issue running step 4.  I get an error back that says "couldn't connect to host" when I run the curl -O command for the patch.  I also tried to download the file outside of the virtual box, and I am having the same issue.  Is this link down, or is it something to do with my internet connection or computer?

Thanks again!

Paul R.

unread,
Oct 31, 2012, 3:55:33 PM10/31/12
to qiime...@googlegroups.com
Update:  I was able to get it downloaded using your command from later on that thread:  curl -O ftp://thebeast.colorado.edu/pub/gg_12_10_otus.rdp22_patch.tar.gz 

Thanks again for your help!

Paul R.

unread,
Oct 31, 2012, 4:29:07 PM10/31/12
to qiime...@googlegroups.com
Hi Daniel,

I am having another issue with the patch.  I have it downloaded and unzipped, and I'm trying to re-train the RDP.  I run the following command:

assign_taxonomy.py -i otus/rep_set/seqs_rep_set.fasta -t /home/qiime/gg_12_10_otus/taxonomy/97_otu_taxonomy.rdp22_train.txt -r /home/qiime/gg_12_10_otus/rep_set/97_otus.fasta -o otus/rdp_assigned_taxonomy_gg/

However, I get an error message back that says:

assign_taxonomy.py: error: option -i: file does not exist: 'otus/rep_set/seqs_rep_set.fasta'

How would I go about updating this command so that it will re-train correctly?

Thank you!

Daniel McDonald

unread,
Nov 1, 2012, 2:19:55 PM11/1/12
to qiime...@googlegroups.com
Hey Paul,

That error indicates it cannot find your representative sequences from
your study on the file system. You'll need to switch back to your
analysis directory for that command to work.
-Daniel
> --
>
>
>

Paul Rosenthal

unread,
Nov 1, 2012, 2:39:00 PM11/1/12
to qiime...@googlegroups.com
Thanks again for your help, Daniel.  I don't currently have any data for my study, I anticipate that in the next few months.  Is there a way I can proceed with updating the Greengenes database if I don't have data for the study yet?

--






--
Paul Rosenthal
Senior Computer and Technical Specialist
PHR Technologies, SP.
 

If urgent technical assistance is needed: Please send a short description of the issue and your name to paulro...@vtext.com.  I will respond as soon as possible.


Daniel McDonald

unread,
Nov 1, 2012, 2:42:54 PM11/1/12
to qiime...@googlegroups.com
Simply obtaining and unpacking the gg_12_10_otus.tar.gz is sufficient
-Daniel
> --
>
>
>

Paul Rosenthal

unread,
Nov 1, 2012, 2:46:22 PM11/1/12
to qiime...@googlegroups.com
Great, thank you for your help.  When I do have study data (I am currently using sample data we have), will I need to change any of the commands when processing the data?

Daniel McDonald

unread,
Nov 1, 2012, 2:50:42 PM11/1/12
to qiime...@googlegroups.com
I suggest updating your qiime_config to have the
"assign_taxonomy_id_to_taxonomy_fp" key and the
"assign_taxonomy_reference_seqs_fp" key to point to the gg_12_10 RDP
patch and the gg_12_10 rep seqs respectively
-Daniel

On Thu, Nov 1, 2012 at 12:46 PM, Paul Rosenthal
> --
>
>
>

Paul Rosenthal

unread,
Nov 1, 2012, 3:09:44 PM11/1/12
to qiime...@googlegroups.com
How do I go about modifying the qiime_config file?  Do I need to replace the keys already in it, or just add yours?  I also see this when I open the file: 

# WARNING: DO NOT EDIT OR DELETE Qiime/qiime_config
# To overwrite defaults, copy this file to $HOME/.qiime_config or a full path

Daniel McDonald

unread,
Nov 1, 2012, 3:13:15 PM11/1/12
to qiime...@googlegroups.com
Copy the file:

cp path/to/original/qiime_config $HOME/.qiime_config

Then open the file in an editor. I don't recall what graphical editors
are available within the VM, but I suspect kedit is:

kedit $HOME/.qiime_config

The paths you place into the qiime_config need to be correct for your
environment. The paths in the step-by-step instructions should be
correct AFAIK
-Daniel
> --
>
>
>
Reply all
Reply to author
Forward
0 new messages