This looks great and works well in my testing. I will commit it to the
repository soon.
I do have a question about your MARC changes, however -- are these due
to shortcomings in MARC and UNIMARC support that should be changed in
MARC.js? It looks like you have some expertise in MARC, so your
suggestions for improvements to the main MARC translator would be
welcome as well.
2010/10/3 ziche <zi...@noos.fr>:
> Added support for documents available in digital form. The appropriate
> Gallica (see http://gallica.bnf.fr) links will be stored in the item's
> url property.
Would it be possible to attach the original images from Gallica as
attachments to the Zotero item? Also, support for gallica.bnf.fr
search result and item pages would be nice to add in the future.
My suggestions above are just suggestions for future improvements --
the translator as it stands is very useful already and I look forward
to getting it to users.
- Avram
2010/10/3 ziche <zi...@noos.fr>:
> 1. mappings from the UNIMARC terminology to Zotero properties,
[..]
> much more than a reasonable starting point. We could, however, include
> things like my mapping of UNIMARC relator codes to Zotero creatorTypes
> in the next MARC translator.
>
> 2. the MARC translator currently does not cover or expose all the
> specifics of UNIMARC records. There is no comfortable way to access
> multiple subfields for a single identifier (meaning that you might get
> "London: SomePublisher" instead of "London, New York, Tokio:
> SomePublisher" when using high level record methods). Another (though
> rather esoteric) missing feature is support for embedded fields
> (potentially used in the 4## tags). These are shortcomings that could
> be fixed in a newer version. I will let you know if I can find the
> time to do this.
These are precisely the kinds of changes that would be good to address
at some point. As you note correctly, the MARC translator is rarely
sufficient on its own for a specific library, but it forms the core of
MARC processing for almost all the Zotero library translators, meaning
that improvements to its core functionality will immediately improve
Zotero's performance with thousands of library catalogs. Of course,
that also means that if we make a mistake in modifying MARC.js, we
will break thousands of catalogs. Oh well. :)
If you find the time to work on MARC/UNIMARC, your efforts would be
very welcome. Fortunately, it works sufficiently well already, so
improvements are not urgent.
> Concerning Gallica: there exists a translator for Gallica contents
> (Gallica.js, though it appears to be broken for search result pages).
> It would indeed be possible to use my BnF-MARC-processor instead of
> its current screen scraping, but I do not wish to interfere with
> somebody else's work. When you suggest to attach Gallica images to
> Zotero items: I suppose you are not thinking about actual snapshots
> (~60kB for a single page of text), but about adding one attachment per
> page, linking to the corresponding image URL?
If the Gallica site is similar enough to the rest of the BnF catalogs
to make including it fairly straightforward, please feel free to be
bold and handle it from the main BnF translator. Sylvain Machefert,
the author of the Gallica translator, is active on the mailing list
(he posted in September). I imagine he would be amiable to merging
them, so long as the functionality was not lost.
My suggestion for adding images was actually that you attach a PDF of
the digital object to the item
(http://www.pacifier.com/~tpope/Accessing_Manuscripts.htm#Downloading_with_Gallica),
as is done with many article databases, and as in the recently
committed Papers Past and National Archives of Australia translators.
In the case of Gallica, there is some risk that the document will be
overly large, so perhaps downloading shouldn't be automatic, but such
downloading can be incredibly useful. It appears that every, or at
least many, Gallica items have a download link, and it should be
possible to fetch the PDF automatically. Whether you decide to
implement this has to do with whether you see Gallica as similar to
Google Books, where downloading is not automatic, or to the National
Archives of Australia, where it is. For smaller items, automatic
downloading would be great. For larger ones, it might not be.
Regards and thanks for your contributions,
Avram
Avram
- Avram
2010/10/3 Avram Lyon <ajl...@gmail.com>:
Click on the box with an arrow pointing out of it, next to the (i)
information button at the top of the document. It is exposed, in a way
somewhat similar to some of the PDF download pages for journal
archives.
> [..] downloads are of a peculiar kind, anyway: the BnF server will create
> PDFs on request and store them for 2 days on a public FTP server (30MB
> of Galilei's Opera omnia are stll waiting for me). Immediate download
> would be our only option, then. Doing all this synchronously would
> make the saving of a single Zotero item impossibly slow [..]
Item saving in Zotero is asynchronous already-- Firefox and Zotero
should not block while the item is being saved. The question here is
whether users _want_ to be saving the full-text documents in their
personal collections. I don't know the answer to that question.
Avram
2010/10/8 ziche <zi...@noos.fr>:
> Uploaded a BnF.js version fixing a silly typo ("contibutor" instead of
> "contributor"...).
>
> --
> You received this message because you are subscribed to the Google Groups "zotero-dev" group.
> To post to this group, send email to zoter...@googlegroups.com.
> To unsubscribe from this group, send email to zotero-dev+...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/zotero-dev?hl=en.
>
>