--
You received this message because you are subscribed to the Google Groups "Taxonomic Literature" group.________________________________________
From: "Peter A. Schäfer" [Peter....@univ-montp2.fr]
Sent: Friday, 18 December 2009 10:09 p.m.
To: Stephen Thorpe; taxo...@googlegroups.com
Subject: Re: [TaxonLit] serials series journals
Hi,
an other nice example of difficult open access:
I have searched for some time for the following protologue
> Polygonum flagelliforme Loisel. (1827) Mémoires Soc. Linn. Paris vol. 6 page 409
without success, but since several month/years the relevant article is
on-line at> http://bibdigital.rjb.csic.es/ing/index.php
unfortunately not in the list of periodic publications but under titles:
> Nouvelle notice Sur les plantes à ajouter à la Flore de France... ,
which of course is not mentioned on IPNI
and fortunately under authors:
>
> Loiseleur-Deslongchamps, Jean-Louis-Auguste
> Nouvelle notice Sur les plantes à ajouter à la Flore de France..., 1827
I am also lucky as this is not a reprint with different pagination but
apparently an extract from the journal with normal pagination (396-432)
and my protologue starts at the bottom of 409 and continues on 410.
I guess the explanation of this different citations are reprints.
Formerly taxonomists would get reprints in their speciality and would
cite them as "Author, title" and not necessarily mention the journal but
modern standards are "Author, journal".
Good luck in linking all that together!
Best wishes
Peter
Peter A. Schäfer (MPU)
1. Bibliographic references are really more than just pointers to the
original pages. They contain information about those original
publications that goes beyond just a way to get the page.
2. Any definitive bibliographic system for taxonomy will need to
retain the information about all changes to the reference content,
including the person who made the change and why it was made.
In this instance, just correcting the page numbers wouldn't help. The
next researcher who (unknowingly) was looking at an original of the
reprint would justifiably "correct" the page numbers back again,
unless there was a change record containing this explanation for the
corrected page numbers.
Why haven't I added links to relevant references at Wikispecies?
Because I have thousands of them. Unless someone were to pay me for
the 5 minutes apiece it would take to make the entries, I can't do it.
Even if someone _did_ offer to pay me to do it, I'd be damned
reluctant to do it because I've already done that linking work once.
My time would be better spent working toward a Grand Unified Taxonomic
Reference & Paper system into which I could deposit my
references+PDFs, along with everyone else's.
That's a core problem I think we're trying to solve here. Trying to
create a major component for taxonomic work (covering references and
papers) so that no one has to do that kind of crappy clerical work
more than once, and even better, only one poor bugger per reference
has to do it at all. No more "everyone makes their own reference
collection" and no more "oh, won't you just take a few minutes of your
time [times 3000] to update just one more slightly-different online
repository with your valuable data?".
One central place for papers and their metadata, then everyone gets to
feed from that.
-Dean
--
Dean Pentcheff
pent...@gmail.com
2009/12/18 Stephen Thorpe <s.th...@auckland.ac.nz>:
This is *exactly* my vision as well! Right now, we're starting with the
metadata (essentially the Card Catalog of Biodiversity literature +
literature-like stuff). I'd love to see the PDF repository built around it,
but the BHL page images of pre-copyright literature is a HUGE step in the
right direction -- which is why it's so important that big initiatives that
rely on literature citations (I'm thinking right now of GNUB and BHL -- but
it would be great to include wikispecies and any number of other
taxon-literature initiatives) are built upon the same infrastructure (rather
than built as separate but cross-linked databases), and thus are
automatically cross-linked.
Rich
-Dean
--
Dean Pentcheff
pent...@gmail.com
2009/12/19 Richard Pyle <deep...@bishopmuseum.org>:
Why haven't I added links to relevant references at Wikispecies?
Because I have thousands of them. Unless someone were to pay me for
the 5 minutes apiece it would take to make the entries, I can't do it.
Even if someone _did_ offer to pay me to do it, I'd be damned
reluctant to do it because I've already done that linking work once.
My time would be better spent working toward a Grand Unified Taxonomic
Reference & Paper system into which I could deposit my
references+PDFs, along with everyone else's.
That's a core problem I think we're trying to solve here. Trying to
create a major component for taxonomic work (covering references and
papers) so that no one has to do that kind of crappy clerical work
more than once, and even better, only one poor bugger per reference
has to do it at all. No more "everyone makes their own reference
collection" and no more "oh, won't you just take a few minutes of your
time [times 3000] to update just one more slightly-different online
repository with your valuable data?".
There is a fallacy of reasoning here! Say you have 3000 references. Sure, if YOU were to add all 3000 refs to Wikispecies, it would be a major undertaking, but presumably at least hundreds of other people have overlap with you on those 3000 refs? If 300 people each put 10 refs on Wikispecies, it would take approx 0.5-1 hr of their time for each person, and we would have 3000 refs added! Not having time to add all 3000 refs isn't really a good reason for not adding any! Wikispecies is the closest existing thing to a GUTR & P system. Make max use of what we already have ... Besides, with Wikispecies, you build up the taxonomic database simultaneously with the reference library, which saves time in the long run, and makes what is there more useful as soon as it is added ...
Stephen
________________________________________
From: taxo...@googlegroups.com [taxo...@googlegroups.com] On Behalf Of Dean Pentcheff [pent...@gmail.com]
Sent: Sunday, 20 December 2009 4:30 a.m.
To: taxonlit
OK. Let me try again.
I am currently the curator of the reference list for the
genus-and-above taxa of the Decapoda. It took me, with the active
collaboration of numerous highly knowledgeable systematists, years to
amass and error-check that list.
Yes, it overlaps with the reference lists of many other workers. Many
of them donated their reference lists to the project. We spent
unreasonable amounts of time resolving the trivial but important
differences between all those overlapping versions of the same
references. We (between us all) spent equally unreasonable amounts of
time finding the originals of those references to verify the
bibliographic information.
There is probably only a handful of references in the curated list
that are identical to what was submitted to us originally by
well-intentioned donors.
But here's the point: the list is now of very high quality. It is not
a dog's breakfast of some verified and some
fourth-generation-reference-list copies from someone's master's
thesis.
Am I about to claim that we now have the definitively correct version
of every reference? Absolutely not. But will I claim that we have the
best-verified and most likely to be correct collection of references
for that particular taxonomic slice of the pie? Yes.
That's why decapod biologists now use our list. That's why decapod
biologists now invest time in sending us corrections to the list.
Because they know that the reference list they download from us is of
higher quality than the one they could develop on their own.
Much though I'd love to claim that's because of the generosity of the
community in contributing to this wonderful central resource, that's
untrue. It's because the NSF, courtesy of the Decapoda AToL, paid us
to spend the time to do it. I don't have a salaried job where I could
choose to spend part of my time contributing to the community. I
worked as a consultant to (a) do the decapod literature; and (b) pay
my rent.
Now. You are asking me to find each taxon on Wikispecies for which we
have an authority reference and then patch in the bibliographic data.
You estimated five minutes per taxon. I think that's a reasonable
estimate. Some of them will be quick cut-and-pastes, but inevitably
some will end up being time-consuming investigations of possible
synonymies, corrections, confirmations, etc. Average five minutes per
update sounds reasonable.
When I've finished doing that, Wikispecies will be blessed with a
snapshot of today's version of our bibliographic work.
Do you think that when I get a correction to one of those references
I'll correct our database, then think "Oh. Right. Must trot off to
Wikispecies and fix that up, too."? No.
Before you suggest it: No, I also don't think it's a good idea for me
to go to Wikispecies and put in a link to our reference database or
website for each taxon or reference. I'm sitting on a stupid little
home made reference database running on a shoebox in a museum lab.
What happens when I quit? What happens every week when the museum
network goes wonky?
I will spend time on a centralized, curatable, change-tracking
taxonomic reference database because then I can put our references
there and quit being a half-assed database manager.
We need a centralized resource that can absorb and maintain the
scholarship and curation that it takes to make a professional quality
taxonomic bibliographic database. Until that time, you (and anyone
else) is welcome to download from or link to the work we do, but don't
ask us to plug it, piece by piece, into a system where we cannot
coherently maintain it. And honestly, I wouldn't advise any long-term
links to our resource: we're working as hard as we can to get someone
else to host it.
But to host it where it can be coherently maintained. That's critical
to get community buy-in. Wikipedia and Wikispecies are extraordinary
wonderful tools whose strongest aspect is their ability to hoover in
unstructured contributions from all over the place. Where they're not
well suited is to curate a rigidly systematic and focused set of data
(e.g. references and associated papers) curated and maintained by
taxonomic fanatics.
-Dean
--
Dean Pentcheff
pent...@gmail.com
2009/12/19 Stephen Thorpe <s.th...@auckland.ac.nz>:
Stephen
________________________________________
From: Dean Pentcheff [pent...@gmail.com]
Sent: Sunday, 20 December 2009 5:10 p.m.
To: Stephen Thorpe; taxonlit
The reason I don't think the present Wiki* resources are well suited
to hosting a bibliographic database for taxonomy is that Wiki* systems
(in their current form) are openly-structured, loosely specified data
containers. Don't get me wrong: that's not a problem, that's their
strength. That's why they're so wildly successful at receiving and
presenting information in very flexible ways.
There's an intrinsic tradeoff between tightly specified systems that
enable complete and efficient implementation of well-specified
functionality vs. permissively specified systems that permit a broad
variety of uses.
Because the data content and functionality of bibliographic systems
are so constrained and specifiable, they are on the tight end of the
spectrum, whereas Wiki* lives on the permissive end of the spectrum.
We will want to be able to do things with a bibliographic system that
just won't be possible with references embedded in Wiki* pages:
"Show me all the publications by Mary Rathbun between 1913 and 1916,
but exclude anything published in the Proceedings of the United States
National Museum."
"Give me a reference list (in Endnote format) of all of the references
tagged "decapoda" that were verified for correctness by Sammy De
Grave."
"Show me all the references from the two journals "Bulletin du Muséum
national d’Histoire naturelle" and "Bulletin du Muséum d’Histoire
naturelle", sorted by year, so that I can correct any that are
misattributed before and after the name changed in 1907"
It's not fair to ask a Wiki* system to enable those sorts of queries.
If it could, it would have such a tightly constrained data model that
it would lose the flexibility that is the hallmark of a wiki.
But it is fair to ask a bibliographic database to implement that
functionality. We're in the process of constraining the data model as
tightly as possible (but no tighter!) so that we get the best
functionality in the very limited data domain of taxonomic
bibliographic references.
Another example -- none of the query examples I gave included anything
to do with a species or taxon. That's deliberate. I fully agree that
what we're building is not a taxon-to-reference database. We don't
want that sort of additional information -- other systems are being
built for that.
What we want is a highly specified, highly functional bibliographic
repository to which other systems can point. If some system has a
taxon name list, then that other system would do well to point to an
entity in the bibliographic system we're developing for its authority
reference.
The first is that Wikis need not be unstructured, indeed Wikipedia has
considerable structure built in via templates, and Semantic Mediawiki
http://semantic-mediawiki.org/wiki/Semantic_MediaWiki has considerable
structure, and a query language built in as well. One strength
Semantic Mediawiki has is that the database schema in effect becomes
part of the wiki, and hence can be edited and modified as new
functionality becomes important.
There are practical issues (Semantic Mediawiki is a something of a
hack), but once one goes down the route of openly editable, versioned,
queryable data, then one pretty much ends up developing something like
Semantic Mediawiki. So I think the choice boils down to the economics
of developing something from scratch, versus repurposing existing
software. Other wiki-style options include the software underlying
http://openlibrary.org/.
I also think we should look closely at other existing tools for
storing bibliographic metadata. Projects such as Zotero (http://
www.zotero.org) and Mendeley (http://www.mendely.com) have a lot of
traction (and money), and are likely to outlast and outperform
anything our community puts together in this space. If everybody
dumped their bibliographies into a central space such as Zotero or
Mendeley, and shared it, we'd have a massive data set to play with.
Lastly, personally I want names + literature, if for no other reason
than this is one of the ultimate goals, and taxonomic databases are a
major source of bibliographic metadata. I want to be able to browse
literature collections by author, taxon, and geography, and I'd
suggest projects like "CiteBank" (whatever that is) would get a lot
more enthusiastic support if it made such tools available.
Bibliographies by themselves are pretty lifeless, and actually not
much use. It's the links between the publications, their authors, and
the subjects they deal with that makes it come alive.
Regards
Rod
> pentch...@gmail.com
>
> 2009/12/19 Stephen Thorpe <s.tho...@auckland.ac.nz>:
>
>
>
> > Well, I'm glad you got that out of your system! My slight rant in return is about people who rant about things but put the critical bit right at the end, in an all too quick and short sentence, without sufficiently fleshing it out! Namely: 'Where they're [Wikispecies, etc.] not well suited is to curate a rigidly systematic and focused set of data (e.g. references and associated papers) curated and maintained by taxonomic fanatics'.
> > Why are they not well suited? They can't host the PDFs (though, possibly PDFs could be uploaded to the Wikimedia Commons, being "media files"), but otherwise I don't see a problem ...
>
> > Stephen
>
> > ________________________________________
> > From: Dean Pentcheff [pentch...@gmail.com]
> > pentch...@gmail.com
>
> > 2009/12/19 Stephen Thorpe <s.tho...@auckland.ac.nz>:
> >> With reference to this bit:
>
> >> Why haven't I added links to relevant references at Wikispecies?
> >> Because I have thousands of them. Unless someone were to pay me for
> >> the 5 minutes apiece it would take to make the entries, I can't do it.
> >> Even if someone _did_ offer to pay me to do it, I'd be damned
> >> reluctant to do it because I've already done that linking work once.
> >> My time would be better spent working toward a Grand Unified Taxonomic
> >> Reference & Paper system into which I could deposit my
> >> references+PDFs, along with everyone else's.
> >> That's a core problem I think we're trying to solve here. Trying to
> >> create a major component for taxonomic work (covering references and
> >> papers) so that no one has to do that kind of crappy clerical work
> >> more than once, and even better, only one poor bugger per reference
> >> has to do it at all. No more "everyone makes their own reference
> >> collection" and no more "oh, won't you just take a few minutes of your
> >> time [times 3000] to update just one more slightly-different online
> >> repository with your valuable data?".
>
> >> There is a fallacy of reasoning here! Say you have 3000 references. Sure, if YOU were to add all 3000 refs to Wikispecies, it would be a major undertaking, but presumably at least hundreds of other people have overlap with you on those 3000 refs? If 300 people each put 10 refs on Wikispecies, it would take approx 0.5-1 hr of their time for each person, and we would have 3000 refs added! Not having time to add all 3000 refs isn't really a good reason for not adding any! Wikispecies is the closest existing thing to a GUTR & P system. Make max use of what we already have ... Besides, with Wikispecies, you build up the taxonomic database simultaneously with the reference library, which saves time in the long run, and makes what is there more useful
>
> ...
>
> read more »
But would we have the freedom & access to implement the algorithms and
cross-linking that we wish to do for our community data? And are the Zotero
identifiers reliably persistent? And would any institution be able to
maintain a local mirror copy of the entire database locally, with features
to maintain it?
> Lastly, personally I want names + literature, if for no other
> reason than this is one of the ultimate goals, and taxonomic
> databases are a major source of bibliographic metadata. I
> want to be able to browse literature collections by author,
> taxon, and geography, and I'd suggest projects like
> "CiteBank" (whatever that is) would get a lot more
> enthusiastic support if it made such tools available.
99% of my motivation in pushing CiteBank is as a cross-link for GNUB. In
other words, I'm in this game for *exactly* the reasons you state above --
to cross-link to taxon names.
Rich
________________________________________
From: taxo...@googlegroups.com [taxo...@googlegroups.com] On Behalf Of Rod Page [r.p...@bio.gla.ac.uk]
Sent: Monday, 21 December 2009 6:31 a.m.
To: Taxonomic Literature
Subject: [TaxonLit] Re: serials series journals
Regards
Rod
--
Rod
On Dec 20, 9:42 pm, Stephen Thorpe <s.tho...@auckland.ac.nz> wrote:
> I think you are getting your Wikis a little mixed up, Rod. Wikispecies is more structured than Wikipedia, due to templates...
>
> ________________________________________
> From: taxo...@googlegroups.com [taxo...@googlegroups.com] On Behalf Of Rod Page [r.p...@bio.gla.ac.uk]
> Sent: Monday, 21 December 2009 6:31 a.m.
> To: Taxonomic Literature
> Subject: [TaxonLit] Re: serials series journals
>
> A couple of quick thoughts.
>
> The first is that Wikis need not be unstructured, indeed Wikipedia has
> considerable structure built in via templates, and Semantic Mediawikihttp://semantic-mediawiki.org/wiki/Semantic_MediaWikihas considerable
> structure, and a query language built in as well. One strength
> Semantic Mediawiki has is that the database schema in effect becomes
> part of the wiki, and hence can be edited and modified as new
> functionality becomes important.
>
> There are practical issues (Semantic Mediawiki is a something of a
> hack), but once one goes down the route of openly editable, versioned,
> queryable data, then one pretty much ends up developing something like
> Semantic Mediawiki. So I think the choice boils down to the economics
> of developing something from scratch, versus repurposing existing
> software. Other wiki-style options include the software underlyinghttp://openlibrary.org/.
>
> I also think we should look closely at other existing tools for
> storing bibliographic metadata. Projects such as Zotero (http://www.zotero.org) and Mendeley (http://www.mendely.com) have a lot of
> ...
>
> read more »
Rod
I wonder if the funders of these projects fully appreciate that the core taxonomic information can be handled rather well by the already existing and free Wikispecies infrastructure, and that they are paying out millions just to be able to do a few fancy searches of no primary taxonomic significance ... no doubt you will disagree! :)
Stephen
________________________________________
From: Dean Pentcheff [pent...@gmail.com]
Sent: Monday, 21 December 2009 3:58 a.m.
I think we're all ultimately interested in the linked relationship
between taxa, taxon names, authority metainformation, and publication
text. None of those alone (except the taxa themselves, rooting around
happily in the mud) is interesting at all.
What we're trying to figure out is how best to do that. I tend to
agree with a viewpoint that we want targeted databases for each of the
components (names, references, documents, etc.) and infrastructure to
link them. The possible linkages are too diverse and too difficult to
specify in advance. If taxon names are part of a bibliographic
database, then we have to pick a name database schema (that's likely
to conflict with everyone else's schema).
It's much more powerful and future-proof to keep the homogeneous data
types together with their kin, and permit diversification and
innovation in the linking.
I do plead ignorance on the possibilities for Semantic Mediawiki. But
your argument, Rod, is on target -- once we've committed to a
versioned, schema-based database, all the rest is implementation
chaff. Important, difficult implementation chaff, but not fundamental
to the problem. The existence of the letters "W", "I", "K", and one
more "I" in the name of a possible platform shouldn't cause a rash.
Maybe we could hook into Zotero or other systems. But what I'd like to
see first is our list of needs and desires (as encoded in a schema or
textual description -- don't care which). THEN we can see what
existing platforms might work to host those needs, or decide to build
one ourselves.
As to the difference of taxon vs. bibliographic focus: yes, there are
numerous different perspectives out there! Practicing taxonomists,
though, tend to live down in the nuts and bolts of the data, below the
level of looking for taxa. They already know everything there is to
know about a taxon (yeah, exaggerated for impact), so they're looking
to comprehensively identify and hoover up the very low-level details.
Like complete, checked bibliographic information for authority
references (ideally with a direct link to the original work).
-Dean
--
Dean Pentcheff
pent...@gmail.com
2009/12/20 Stephen Thorpe <s.th...@auckland.ac.nz>:
So much to rant about in reply to one sentence!
>below the level of looking for taxa
You've strawmanned me! I wasn't suggesting at all that the primary purpose of the Wikispecies pages was simply "looking for taxa"! I said it was for seeing the current "state of play" for a taxon (which may be a family, a genus, a species, etc.) My experience of taxonomists is that they have to find out and follow the state of play - it doesn't just pop into their heads by magic. There are plenty of cases around of taxonomists "missing things". So, imagine, if you will, being able to instantly Google a solid treasure trove of current info on any given taxon. Besides, I don't see why Wikispecies can't provide solid bibliographic information to allow taxonomists to "hoover up" low level details - actually it is rather good for that, and it can also handle links to the original work when such exists ...
For example, I was just now making a start at tidying up Wikispecies Decapoda, and I immediately find a nomenclatural tangle, see:
http://species.wikimedia.org/wiki/Anaglyptus_Milne-Edwards
it may be more complex, with the beetle genus possibly with Anaclyptus as the correct original spelling, but Anaglyptus is currently in use, and forms the basis of tribe Anaglyptini ... A mess to be sorted out, but highlighted by trying to synthesise taxonomic info accross the board on Wikispecies ...
Stephen
________________________________________
From: Dean Pentcheff [pent...@gmail.com]
Sent: Monday, 21 December 2009 12:49 p.m.