Really really strange pdf identification

15 views
Skip to first unread message

Andrew Brown

unread,
Apr 23, 2011, 9:06:20 AM4/23/11
to zoter...@googlegroups.com
I was sent by the publishers  a pdf copy of Pope Benedict XVI's latest book, "The light of the World". I put it into my Zotero library, using the firefox extension. Testing the standalone Zotero I right-clicked and asked for it to be identified from the metadata. With very little hesitation, this result came back:

others. “Gene and protein expression profiles of Shewanella oneidensis during anaerobic growth with different electron acceptors.” Omics: a journal of integrative biology 6, no. 1 (2002): 39–60.

(in fact there are ten authors in the Info pane, but  "others" is listed as the first).

Is there any possible, obvious explanation for this result?


--
Andrew Brown
What I edit: www.guardian.co.uk/belief/
What I'm up to: www.thewormbook.com/helmintholog/
mobile: +44-(0)7968-945618

Avram Lyon

unread,
Apr 23, 2011, 10:14:54 AM4/23/11
to zoter...@googlegroups.com
More details of what was looked for should be in the debug output, but
since the book doesn't have a DOI, the approach is to take a snipped
of text from the PDF and give it to Google Scholar.

The "others" issue is because Google Scholar gives pretty bad author data.

In general, the PDF identification is pretty unreliable for
non-academic or non-article PDFs, since Google Scholar doesn't do much
for them.

Avram

> --
> You received this message because you are subscribed to the Google Groups
> "zotero-dev" group.
> To post to this group, send email to zoter...@googlegroups.com.
> To unsubscribe from this group, send email to
> zotero-dev+...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/zotero-dev?hl=en.
>

Sean Takats

unread,
Apr 23, 2011, 10:20:53 AM4/23/11
to zoter...@googlegroups.com
And I guess you're placing all the blame on the Zotero side because of papal infallibility? The metadata lookup basically pulls a DOI from the text or searches Google Scholar for a snippet of extracted text. If you run Zotero from the terminal, you should be able to see what it's posting to Google. It's not a perfect system, but it does work most of the time. -Sean

Andrew Brown

unread,
Apr 23, 2011, 11:02:37 AM4/23/11
to zotero-dev
Sean -- thanks. That made me laugh out loud. Avram, yes, this
explanation makes perfect sense.
Reply all
Reply to author
Forward
0 new messages