So we just released a new section on the observation pages that
summarizes the "data quality" of the observation and introduces the
concept of data quality grades. Basically all observations are
"casual" grade by default, but if it meets a number of requirements,
we consider it "research" grade. To be research grade, an obs must
* have an ID agreed upon by the community
* have a date
* have coordinates
* have a photo
* be wild/naturalized
* have a reasonable-looking location
Some of those require explanation. For an ID to be "agreed upon by
the community," the majority of identifications must match the
observer's ID or be of a taxon that descends from the observer's
opinion. So if I add an observation of Rodentia and biosam identifies
it as Rattus rattus, that's an agreement.
Whether or not the obs is wild or in a reasonable location is up to
the community: everyone (including the observer) can vote yes or no on
these questions. If the majority thinks it's *not* wild, then it
won't reach research grade. These two conditions can be met with 0
votes, so they're really mostly there to let people demarcate stuff
from gardens or stuff with locations that are clearly errors or very
rough estimations.
Ok, that's the rather long summary. Now, some questions for you all:
1) in this system, a more conservative ID than the observer's will
count as a *disagreement*. So if biosam ID's my rodent as Mammalia,
that's a disagreement. But if I add an obs without an ID and biosam
ID's it as Mammalia and I later ID it as Rattus rattus, biosam's ID
will still count as a disagreement, even though he was probably just
trying to help me get to the right ID. He can still be outvoted by
others, or change his ID, but I feel like this is still uncomfortable.
What do you guys think? Most of the solutions I've thought of are
unsatisfactory for various reasons.
2) Do you guys want to be notified when others vote on whether their
obs are wild or if the location looks good? Right now there's no
notification, but I could make it send an email or generate an update
on the dashboard or something.
3) I considered adding the ability to opt out of this entirely, but
eventually decided it wasn't necessary. Was I wrong?
4) any bugs, typos, confusing language?
Ok, congrats if you made it this far.
-ken-ichi
--
You received this message because you are subscribed to the Google Groups "iNaturalist" group.
To post to this group, send email to inatu...@googlegroups.com.
To unsubscribe from this group, send email to inaturalist...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/inaturalist?hl=en.
--
Winged Wolf
My Stores:
http://www.EclipseMetaphysical.com
EclipseMeta on Twitter
http://www.EclipseExotics.com
EclipseExotics on Twitter
Matt:
a) I agree "more things to fill out" can be a turnoff for newbies,
though none of this quality stuff *needs* to be filled out. I
actually chose the term "casual" to try and emphasize that it's not
bad, just normal. Frankly, it's quite hard to gauge the impact of
these things on new users without polling them or running a/b tests on
them (people generally don't reply to surveys and we still don't have
enough people on the site for significant a/b tests). If anyone
reading this just signed up and has some input, please chime in!
b) "community supported" and "community unsupported" are ok, but I
really want to avoid negative labels, for exactly the reason you
pointed out above: As far as the word "research" goes, most of the
researchers who've expressed an interest in iNat data have been
interested in it for distribution modeling and basic monitoring of
wild/naturalized organisms, so I consider things like cattle or pets
to be edge cases (cases that we can pretty easily deal with if people
want the data by selecting only obs of cattle or cats or what have
you).
The fact that you're "from-the-hip" CA observations are considered
research-grade while your obs conducted during actual research are
considered casual-grade is ironic, but I think it's ok given the
social notion of data quality we're trying to embrace: we have a lot
of CA naturalists who can verify observations from CA, but less people
who know Africa.
barbarb: we've considered adding a way to gauge the confidence of your
IDs, but I'm not sure it addresses the issue of suggestions ("looks
like Vespidae") vs. corrections ("it's not Vespa cabro, Vespidae is
the better ID"), because in both cases, the person is probably 100%
confident of their conservative ID.
wingedwolfpsion: yup, definitely a possibility, but there's this
problem: I add an unknown, biosam ID's it as Rodentia, and I add an ID
of Rattus rattus. In the system you proposed, biosam's ID is *not* a
disagreement. However, what if biosam thinks I'm wrong? He can
remove and re-add his ID to make it a disagreement, but that seems a
bit silly to me.
What do people think of a "correction" checkbox on the IDs to indicate
that a more conservative ID is intended as a disagreement rather than
a tip in the right direction? To much additional complexity?
-ken-ichi
> wingedwolfpsion: yup, definitely a possibility, but there's this
> problem: I add an unknown, biosam ID's it as Rodentia, and I add an ID
> of Rattus rattus. In the system you proposed, biosam's ID is *not* a
> disagreement. However, what if biosam thinks I'm wrong? He can
> remove and re-add his ID to make it a disagreement, but that seems a
> bit silly to me.
Not really--editing an ID is extremely fast and easy, after all. I
don't think people would find it too cumbersome.
> What do people think of a "correction" checkbox on the IDs to indicate
> that a more conservative ID is intended as a disagreement rather than
> a tip in the right direction? To much additional complexity?
>
> -ken-ichi
That would work, too.
Nick
This is what I meant by a social conception of data quality: if sites
like Wikipedia have shown us anything, it's that large groups of
amateurs are often able to create more accurate and/or more complete
datasets than individuals or even small groups of experts.
Anyway, maybe we should just replace the word "research" with
"community" or something, so people don't feel like "casual"
observations are somehow not worthy of research.
Location accuracy is definitely serious and difficult issue. You
might have noticed that the iPhone app actually records positional
accuracy (which is often terrible, another issue). We can start
recording whether people entered coordinates manually or whether they
entered the name of a place and let Google look up the numbers. We
could also let people start manually editing the positional accuracy,
but we'd also have to start recording whether they did so. Even if we
did these things, though, I'm not sure any of it is really better than
the system we have now. They would basically allow observers to
voluntarily mark their locations as inaccurate (which they can already
do), but they would do nothing prevent people from exaggerating their
accuracy. We can't stop people from lying, so I think providing tools
for the community to call out obviously inaccurate stuff is the best
we can do. It might be comforting for consumers of the data to see an
accuracy number, but just because it's there doesn't really inspire
any more trust in the observer if you know they could have fudged it.
The flocking issue and the fact that unidentified stuff seems to get
more IDs than ID Please observations is super interesting, and
something that we can probably address with changes to the interface,
maybe doing a better job of visually indicating that an obs needs ID
help, as barbarb suggested.
flapack, we definitely want to start subscriptions for things you've
commented on / ID'd. That's a pretty conventional model from other
social networking sites, just haven't gotten there yet. Overhaul of
the update emails is long overdue, too.
Nick, you can add IDs of any taxon, so if you're only confident
calling something a bird, you can ID it as Aves. We might some day
start designating experts in certain places / taxa based on their ID
performance (some function of matching the observer's ID and time?),
but I'd say that's a ways off yet.
Whew! This is a really cool discussion. It's so great getting all
this feedback!
-ken-ichi
-ken-ichi
> --
> You received this message because you are subscribed to the Google Groups
> "iNaturalist" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/inaturalist/-/R0P3SFzXqIIJ.