|
Mark Diggory (Schedule a Meeting) 2888 Loker Avenue East, Suite 305, Carlsbad, CA. 92010 Esperantolaan 4, Heverlee 3001, Belgium http://www.atmire.com |
--
You received this message because you are subscribed to the Google Groups "dryad-dev" group.
To post to this group, send email to drya...@googlegroups.com.
To unsubscribe from this group, send email to dryad-dev+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/dryad-dev?hl=en.
Actually, both. And, because these are early days in my exploration,
I'm looking for ways to ensure that name-crawling procedures can be as
simple as possible, either upon ingestion or throughout the lifecycle
of the data package. As you know, name-crawling and indexing cannot be
a one-off event for any one data package. Rather, taxonomy is a moving
target so packages will have to be re-crawled for names.
After having examined many data sets already accessible via Dryad, I
see all manner of idiosyncratic ways taxonomic names have been
expressed, which is not overly suprising. Crawling will work well in
some instances (i.e. if near code-compliant), but not so well in
others (e.g. genera lowercase, underscores between genus and epithet,
etc.). So, crawling for names will have to hit both the metadata and
the data packages (and ideally, the published works as well) in an
attempt to get as much coverage as possible.
> Programmatic access to the actual data files is possible, but not quite as
> straightforward and clean as we want to be. The progress and aims are
> documented here: http://wiki.datadryad.org/Data_Access
Excellent. This is a great start.
> I think the DataONE data access API will be live within the week or two,
> if it isn't already since with the just-released v1.11. (Ryan can update us
> on that.)
>
> -hilmar
The very early GN APIs are accessible at: http://gnrd.globalnames.org
and http://resolver.globalnames.org. The latter is *very* early days,
but will give you a sense for what it can do. It's rudimentary API doc
is here: https://github.com/GlobalNamesArchitecture/gni/wiki/Name-resolution-API-(early-public-ALPHA).
There's also a chance we might deprecate the former and roll it into
the latter, which would eliminate an API call. Needless to say, we've
been concentrating more on the backend robustness, algorithms, and
names list acquisition/ingestion.
Cheers,
David P. Shorthouse
> The very early GN APIs are accessible at: http://gnrd.globalnames.org
Do I understand correctly that in contrast to the other one this interface doesn't allow the choosing of data sources?
And BTW if I feed it a data package URL (http://datadryad.org/resource/doi:10.5061/dryad.743) it (erroneously) complains in the response that "That URL was inaccessible". Any thoughts on what might be happening?
> and http://resolver.globalnames.org. The latter is *very* early days, but will give you a sense for what it can do. It's rudimentary API doc is here: https://github.com/GlobalNamesArchitecture/gni/wiki/Name-resolution-API-(early-public-ALPHA).
BTW the HTML output isn't working for that (it doesn't show any metadata for the list datasources example). Aside from that, the documentation seems to be suggesting that you need to present it with a list of name string candidates already, rather than with a URL to a page or a PDF that may contain such name strings. Is that true, or is the documentation behind? Also, there is no good example for how a resolution query would actually look like.