Cool.
As far as eliminating dups between the catalog and SFX, I actually went
further/different, you may be interested in what I did, and in the tools
Umlaut gives you to do similar.
The problem I had, was sometimes SFX did not return any responses, but
that's because SFX KB knew that we didn't have coverage of the
particular article requested. (Say, we wanted 1950, but the KB knew our
full text coverage begins in 1960).
The Catalog MARC is not really capable of telling that -- it doesn't
have coverage info in machine-readable format.
So sometimes there'd be cases where SFX returned no response, and if we
just went by that to say, sure, include 856 fields, then we'd get 856's
from the catalog for something the SFX kb correctly determined we did
not have access for.
My solution? I actually sweep the SFX kb database and try to pull out
any URLs that belong to activated targets in SFX.
Then, if a 856 has a hostname matching one of those hostnames pulled
from SFX, we don't display it, regardless of whether SFX returned a
response or not. (We may only do this suppression for citations that
look like articles, not ebooks, I forget). We still get some dups and
false negatives, but it works out reasonably well.
The exact logic for doing this is probably going to end up needing to be
localized to your particular needs, I haven't tried to make it generic.
But if you want to go this route, there are some things built into
Umlaut to make it pretty easy --
If you have a direct (read-only, preferably) db connection to SFX set up
in your database.yml under "sfx_db"; then every time you run `rake
umlaut:nightly_maintenance` (which is recommended), Umlaut is already
pulling likely URLs into a local Umlaut database. In your own local
service code, you can just check `SfxUrl.sfx_controls?
"
http://whatever"`, and it'll return true if it thinks this is something
SFX should know about (basically just on hostname match).
Curious to know if you end up using any of that!
Jonathan
On 10/2/14 3:57 PM, Kevin Reiss wrote:
> Thanks for the information on how to work with the connection pool
> Jonathan. Your suggestion worked out perfectly. My algorithm for
> deciding what to do with 856 fields is pretty straightforward. If the
> SFX knowledge base returns a full-text match for the OpenURL and the
> MARC 856 field indictors are set properly I just don't add the OPAC link
> to the fulltext responses.
>
> We will definitely add ourselves to the Umlaut implementors group on the
> github wiki. I'm also planning to get approval to post our umlaut
> project as a public repo on github. Our beta system is located at
>
http://getit.princeton.edu.
>
> On Saturday, September 20, 2014 4:26:36 PM UTC-4, Jonathan Rochkind wrote:
>
> Hi Kevin!
>
> I'm so glad to hear you guys are experimenting with Umlaut --
> if/when you go live, can you make sure to let us know, and possibly
> list yourselves here:
>
https://github.com/team-umlaut/umlaut/wiki/Umlaut-installations
> <
https://www.google.com/url?q=https%3A%2F%2Fgithub.com%2Fteam-umlaut%2Fumlaut%2Fwiki%2FUmlaut-installations&sa=D&sntz=1&usg=AFQjCNEDAZWx_bRgzAcATS9SjsjSg4wwWw>
https://github.com/team-umlaut/umlaut/wiki/Umlaut-installations
> ------------------------------------------------------------------------
> *From:*
umlaut-...@googlegroups.com <javascript:>
> [
umlaut-...@googlegroups.com <javascript:>] on behalf of Kevin Reiss
> [
kevin...@gmail.com <javascript:>]
> *Sent:* Saturday, September 20, 2014 4:14 PM
> *To:*
umlaut-...@googlegroups.com <javascript:>
> *Subject:* [umlaut] Re: Umlaut 4.0 beta! Beta testers needed!
> --
> You received this message because you are subscribed to the Google
> Groups "Umlaut" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
umlaut-softwa...@googlegroups.com
> <mailto:
umlaut-softwa...@googlegroups.com>.