download taxon tree?

81 views
Skip to first unread message

Rick Levy

unread,
Jan 26, 2018, 2:28:02 PM1/26/18
to Symbiota
Is there a place where I can download the taxon tree? Looking for SEINet, preferably SQL or .csv

Thanks

Rick

Mary Barkworth

unread,
Jan 27, 2018, 1:15:22 AM1/27/18
to symbio...@googlegroups.com

Would love to be able to download, preferably as a csv file (about all I can handle). Perhaps separate files for accepted names and synonyms (to assist those of us who are IT challenged).

Mary

Robert Pace

unread,
Jan 27, 2018, 3:32:13 PM1/27/18
to Symbiota
In May of 2017 I systematically exported/downloaded the entirety of the taxonomic data from itis.gov. I used this data to build a taxon table for my symbiota test portal.

Here is a link to an archive of the itis data that I downloaded.

Mary Barkworth

unread,
Jan 28, 2018, 8:05:35 AM1/28/18
to symbio...@googlegroups.com

My interest is in downloading trees, or at least names, from  one Symbiota network for use in another. I invested considerable effort in looking up names and authors several insect names not in Scan. I do not want to have to repeat that work in order to b able to use them in another network.  Nor do I want to overwrite names I have obtained from reputable sources (e.g., International Ornithological Union or whatever it is called) with names in ITIS. So – I really want to be able to download at least the names from one Symbiota network to use in another, possibly using a different taxonomy.

Mary   

Robert Pace

unread,
Jan 28, 2018, 12:36:20 PM1/28/18
to Symbiota
As far as I know Symbiota doesn't have a feature to export specific taxonomic tree data for importation into another instance of Symbiota.  One can copy the entirety of the Symbiota's taxonomic data to another installation using mysqldump to backup the specific Symbiota database tables, and import them into another installation using the mysql command, but this is something only a person with server admin privileges can perform.  This sounds like a fantastic new feature request for Symbiota.  I personally wish that Symbiota could load it's taxonomy data from a local installation of the ITIS database instead of through piecemeal exports from ITIS.gov.

Herrick Brown

unread,
Jan 28, 2018, 6:37:22 PM1/28/18
to symbio...@googlegroups.com
There IS an option for toggle manual upload of ITIS (or other large files) that are manually loaded to the server. 

--
You received this message because you are subscribed to the Google Groups "Symbiota" group.
To unsubscribe from this group and stop receiving emails from it, send an email to symbiotagrou...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Mary Barkworth

unread,
Jan 28, 2018, 9:43:32 PM1/28/18
to symbio...@googlegroups.com

Yes. But new feature – the ability to download from a Symbiota site.  Seriously, having added names, including author names for many taxa that were not available in ITIS, I really want to avoid redoing that work.  IT IS is very American (North, Central, and South) centric.

Herrick Brown

unread,
Jan 28, 2018, 9:48:03 PM1/28/18
to symbio...@googlegroups.com
Right.  What I meant was that a dump file even from a Symbiota portal might be too large for upload on the front end. ITIS focus is NA, an obvious limitation as you say. 

On Sun, Jan 28, 2018 at 9:43 PM Mary Barkworth <Mary.Ba...@usu.edu> wrote:

Yes. But new feature – the ability to download from a Symbiota site.  Seriously, having added names, including author names for many taxa that were not available in ITIS, I really want to avoid redoing that work.  IT IS is very American (North, Central, and South) centric.

--

Mary Barkworth

unread,
Jan 28, 2018, 9:54:46 PM1/28/18
to symbio...@googlegroups.com

Good point. How about the ability to download, for example, scarab names from SCAN. 

Might be a more reasonable size.  Then I could download other insect groups as needed. Not ideal – but better than redoing all that work.

Mary

Robert Pace

unread,
Jan 29, 2018, 8:58:19 AM1/29/18
to Symbiota
One of the most time consuming steps in establishing a new Symbiota portal is loading the taxonomic data.  This process is further complicated by the fact that ITIS limits the size of data that you can export from their database.  It can take almost a day simply downloading their dataset in relatively small chunks sporting only a serial number for a filename.  ITIS does offer the entirety of their dataset in a few database formats such as .sql but I haven't observed any way of importing sql data into Symbiota's taxonomic tables.

I think a few things would make working with the taxonomy tables a bit easier in Symbiota.

1.  It would be nice if we could easily bulk load itis (or any other taxonomic dataset) data into Symbiota from a .sql file.

2.  It would also be nice if we could export the taxonomic data of symbiota based on a query of a kingdom/class/order/genera/etc., with some toggles of what specific data types you would like to export (eg. taxon author, references, common names, etc.)

These are a few things which could  make setting up a portal a bit easier.

egbiodi...@gmail.com

unread,
Jan 30, 2018, 11:29:11 AM1/30/18
to symbio...@googlegroups.com

Portal managers can pass you a flat CSV download of the taxa data using the SQL statement below. They can also run a MySQL dump of the taxa tables, but installing the dump files will copy over any other taxa data previously loaded into your system. You will need to contact them directly for this info.

 

Not all portals want to make their taxonomic data available as a pubic download. There are numerous taxonomic authorities that are better resourced to play that role. I only know of 2 Symbiota portals that are being used to in an attempt to establish a taxonomic authority. Most just use their internal thesaurus as a method to quickly resolve taxonomic discrepancies commonly associated with specimen data, which is the original primary purpose of the internal Symbiota thesaurus. Most thesauri are not maintained regularly enough to merit authoritative status. For this reason, using one portal’s thesaurus to create another might not be the best options (e.g. propagating error).

 

For various reasons, the ITIS import has been problematic to maintain and thus is being replaced with some new tools (note: ITIS imports will still be possible through the generic flat-file taxonomic import tool). An alternative method of building your thesaurus is to load your collection data (does not initially require a taxonomic thesaurus to exist) and then use the taxonomic cleaning tools (made available within collection management menu as of Sept 2017) to evaluate the names associated with specimen data. This tool makes use of web services available from Catalog of Life, WoRMS, TROPICOS, and/or EOL to validate and import scientific names, authors, acceptance status, and full taxonomic hierarchy. These resources tend to be more actively maintained, and thus are often a better taxonomic data source for setting up a new portal. The preferred taxonomic authorities can be activated within the portal’s configuration file ($TAXONOMIC_AUTHORITIES variable). TROPICOS web services requires an API key that needs to be requested from them. Once the tool has been fully tested, it will be integrate into the checklist module and the taxonomic import tool. We are also planning on soon adding Index Fungorium or MycoBank as an authority option.

 

 

SELECT t.tid, t.kingdomName, ts.family, t.sciname, t.author, t.unitname1, t.UnitInd1, t.unitname2, t.UnitInd2, t.unitind3, t.unitname3,

t.rankid, t.SecurityStatus, ts.parenttid, t2.sciname as parentStr, ts.tidaccepted, t3.sciname as acceptedStr

FROM taxa t INNER JOIN taxstatus ts ON t.tid = ts.tid

INNER JOIN taxa t2 ON ts.parenttid = t2.tid

INNER JOIN taxa t3 ON ts.tidaccepted = t3.tid

WHERE ts.taxauthid = 1;

 

 

-Ed

Mary Barkworth

unread,
Jan 30, 2018, 5:48:47 PM1/30/18
to symbio...@googlegroups.com

The problem I had in the early days of creating a network was that uploading a new taxonomy did not overwrite the existing one, it often created a second usage of a name that was then in conflict with the previous usage – but both were in the system. In the end, I had Curtis delete all names that had two different parent taxa.

As I noted earlier, it is subsets of taxa that I want to be able to copy, those that I have added to one portal by consulting other sources such as those suggested by Ed. I would add one other source – Wikipedia – particularly for authors of higher level taxa of animals. The times that I checked, it was reliable. 

Mary

Rick Levy

unread,
Feb 5, 2018, 12:01:59 PM2/5/18
to Symbiota
Thank you Ed

-Rick
Reply all
Reply to author
Forward
0 new messages