RobI agree with Stan.I think the idea MUST be pursued if we expect to make progress on interconnecting distributed data in Biology. I believe we now have the proof of concept applications that allow us to say that we have the right infrastructure in place, and will make good progress with further investment in bringing content together and in improving the customizability of serrvices. That is, I think this is a very timely idea, and has the capacity to deliver good results in the near future.Global names has many dimensions, but overcoming the challenges associated with variation in names for the same taxon motivated much of the work at MBL.You can see and example of our work at http://demo.globalnames.org/reconciler?token=wN6HUIvWQuG1JdEVdaJ7cw. This demonstrates name finding and resolution (pink highlights). We are also in the final stages of finishing off a new indexing service for Biodiversity Heritage Library, a service that not only finds names, but reconciles what it finds and offers links to other sources - i.e. has to option of 'resolving' the name (assuming resolution = provide me with the name that is preferred by an aiuthoritative source of your choosing). You will see that the system can break the back of the problems, but we still have some issues to address.I understand this space reasonably well, but our code has been build by David Mozzherin and Dave Shorthouse (copied here). There is some very time-costly stuff in this area (collection of the taxonomic synonyms being one of them) that I do not believe it makes any sense to try to build reconciliation and resolution services de nova. Rather, our best best is to have a common and open foundation that can be expanded by ohers to meet their needs.We would be delighted to collaborate with enthusiastic users. Depending on how many people might come together, I'd recommend at least Dima and myself attend. Dima worked closely with the iPlant TNRS group.In addition, I have recently send an email to some of the NESCENT guys about collaborating on a proposal to act as a taxonomic learing house for the OToL project. Support for that would ensure we have the funds to collect a lot of taxonomic information.I think it would be very valuable for all for me to add some text to the http://www.evoio.org/wiki/TNRS_Meeting_Proposal page as it makes no mention of the Global Names work and progress to date. I have been reticent to promote GN too much before we have some robust services out there, but it is important to ensure that we don't try to reinvent wheels. I have tried to log in under various names but always end with an error. I presume that I need to be assigned some rights to contribute. Can you arrange this and I will put some information about GN onto that page.As these issues have been central to our thinking for many years, we should be able to contribute a lot to a catalysis proposal.ThanksPaddy--On Sun, Nov 18, 2012 at 6:39 AM, Blum, Stan <SB...@calacademy.org> wrote:
Rob, this is relevant to the work all three of us are doing (or vice versa). I'm sure everyone in GNA would like to be there.I'm traveling for another 48 hours or so, and I'm sure I'm going to walk in under a huge mound of stuff that will fall on me. If you guys schedule a conference call, please count me in.-StanFrom: Robert Guralnick <Robert.G...@colorado.edu>
Date: Sunday, November 18, 2012 4:11 AM
To: Richard Pyle <deep...@bishopmuseum.org>, David Patterson <dpatt...@eol.org>, Stan Blum <sb...@calacademy.org>
Cc: Arlin Stoltzfus <ar...@umd.edu>
Subject: possible Catalysis meeting on names
Hey Stan, Paddy, Rich --- I wanted to touch base with you guys really quickly about the idea for a Catalysis meeting proposal to be held at NESCent on the topic of Taxonomic Name Resolution and how to build services across projects. Arlin Stoltzfus has been trying to get a lot folks involved in such activities in the same place to discuss how to avoid duplication of effort both in terms of assembling resources and in terms of how these are compiled and used by other projects/consumers. The proposal is due in just a couple weeks but I personally think its an exciting and timely topic and likely to yield significant value especially if we can leverage the time together and maybe have the meeting and outcomes also associated with help on key aspects of the project. As it turns out, my student Gaurav Vaidya will be at NESCent on a graduate fellowship next semester, so the timing could work particularly well.I am not sure when we will next talk as a group, but hopefully Arlin can chime in and maybe we can pull this off? Are you interested? I absolutely am. For more details on where we are, check here: http://www.evoio.org/wiki/TNRS_Meeting_ProposalBest, Rob
___________________________________
David J Patterson
Senior Scientist, Marine Biological Laboratory
7 MBL Street, Woods Hole, MASS 02543, USA.
Research Professor
School of Life Sciences, Arizona State University
Tempe, AZ 85287-4501
Professor (MBL) Ecology and Evolutionary Biology
Brown University, Providence, Rhode Island
Life Sciences Lead, Data Conservancy dataconservancy.org
globalnames.org
On Nov 19, 2012, at 3:20 PM, Robert Guralnick wrote:My guess is that much of the content needed to write up the above exists in grant proposals and papers on which many of you have worked. The big challenge will be focusing on synthesis, but I think all of us have a pretty good idea about the next obvious steps forward.
One note post-call here - I sensed some hesitation along the lines of "I'll put my stuff in once I see a skeleton and where this is going". My recommendation is that everyone who has a paragraph (or more) motivating and arguing what might turn into proposal objectives throw it up there on a Google Doc. It's much easier then to go through as a group to prioritize, and subsequently for a few people to edit into a more coherent whole. The earlier it's up there for everyone to look at even it it's a hodgepodge, the more time for everyone to stew over it and ideas / thoughts for streamlining to flow from that.
--
--
--