> --
> You received this message because you are subscribed to the Google Groups "ResourceSpace" group.
> To post to this group, send email to resour...@googlegroups.com.
> To unsubscribe from this group, send email to resourcespac...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/resourcespace?hl=en.
>
>
--
Tom Gleason, PHP Developer
DBA Impressive Design
Exploring ResourceSpace at:
http://resourcespace.blogspot.com
I'd love to hear more about what you're planning -- we're a
library/archive/museum, and have done something similar with
ResourceSpace. In our case, though, because of the variability in
accession numbering/identifiers, as well as the fact that some images
can contain more than one object, we're doing the reverse. We actually
store the ResourceSpace resource ID in our cataloging records, and
this allows us to link the cataloging records back to specific
ResourceSpace resources. I've created a simple SOAP gateway that
allows users of the cataloging system (Minisis) to pull in images from
ResourceSpace via resource number.
It would probably also be possible to customize ResourceSpace to do
what you're looking for, although it would take a bit of thought on
the best way to approach it.
In any case, I'd be happy to talk further about how we've done this if
you're interested.
-David Dwiggins
Systems Librarian/Archivist, Historic New England
ddwiggins at historicnewengland dot org
617-994-5948
I wouldn't necessarily recommend direct to mysql.
This would be a prime example of the use of a new API and a custom API
method, ie, send search terms to a custom api method that uses
ResourceSpaces do_search method, and retrieve a customized search
based on all those results. However, the API doesn't exist yet!
David is probably the person for you to talk to since he's more
familiar with your kind of problem.
Interested to know if any other DAMs show potential for you and why.
Tom
> For more options, visit this group at http://groups.google.com/group/resourcespace?hl=en.
And if the MySQL database is still there, it's even easier -- just a
SQL query. Plus if the accession number was embedded in the IPTC of
the orginal file, it should still be there inside the ResourceSpace
filestore.
As part of our project, we also created a new unique numbering system
for everything in our collection. So in addition to the traditional
accession number (with all its quirks), we also have a "globally
unique serial number" GUSN that is generated by the collections
system. We can also assign these to items that wouldn't have
traditionally gotten an accession number, such as a reference book or
a single item in a large archival collection. You can see this with
records like
http://bit.ly/b9dWMs (museum object)
http://bit.ly/90p7G2 (archival object)
http://bit.ly/dyxKw3 (library book)
All three have different identifiers (accession number for the museum
object, reference code/collection code for the archival object, and
call number for the book.) But all also have a GUSN. And the ones that
have images each also have at least one Resource ID associated with
them, which is what's allowing the online catalog to pull an image.
We're still pondering how much data we want to pull back from our
cataloging system into ResourceSpace to flesh out the records there.
But generally for museum objects there is at least an accession number
and a title already.
In any case, I think it would be possible to make ResourceSpace do
what you want, but it would require some programming.
It sounds like maybe it would be a two step process:
1. Make it so that the accession numbers are always indexed as a unit.
We had problems with our accession numbers initially because
ResourceSpace wanted to pull them apart into separate words at the
periods. I believe I got around this by tweaking the indexing code so
that a period without a space after it would not be considered a word
break. But for your case, it sounds like what's really required is a
special flag on the field that tells the system not to attempt to
tokenize it at all, and instead to simply index the complete value. I
don't think off the top of my head there is a way to do this built in.
But I don't think it would be all that hard to implement. Ideally it
seems like the accession number would be pulled into a separate field
so that you could control this behavior independently of anything else
that might appear in the title field.
2. Once that was in place, the search code would also have to
understand that it should search this field as a unit rather than
breaking up the words.
I think there might be some value for this for other uses as well --
such as LC-formatted names.
Let me know your thoughts...
-David Dwiggins
# Configures separators to use when splitting keywords
# You must reindex after altering this if you have existing data in
the system (via pages/tools/reindex.php)
$config_separators=array("/","-",".","; ","(",")","\"","\\");
If you have A-1997, RS by default would split that into two keywords
since "-" is a config_separator.
If you remove "-" from the config_separators array, A-1997 would be
stored as a unit.
But it sounds like in this case what is really needed is the ability
to tell RS not to tokenize a field at all, and instead index it as a
unit. Doesn't seem like this would be too difficult to do.
It might also require a change on the search end, though, so that the
search parser doesn't try to break the accession number up into
individual words. So if searching a specific field that is configured
for complete indexing, it would not try to break up the search term.
Hmm.
-David Dwiggins
One approach to this would be to integrate an existing search
application into ResourceSpace. I have been doing a lot of work with
Apache Solr for the public-facing view of our catalog. This would
provide very advanced search capabilities. But it has the drawback of
forcing the end user to install and run a second (java-based) piece of
software, making it far more difficult for beginning users.
Interestingly, there does appear to be a PHP port of Lucene in the
Zend php framework. Lucene is the search engine underlying Solr, and
the PHP port claims to be 100% compatible with the original java
implementation. I wonder if this might be a path toward being able to
use more advanced search capabilities while still holding onto the
standalone/*AMP-only architecture of ResourceSpace. Hmm.
I'm not sure of the level of effort that would be involved in this,
but it might be something to explore. Maybe I'll try to poke around
the zend lucene implementation if I have a few minutes sometime
soon...
-David Dwiggins
> For more options, visit this group at http://groups.google.com/group/resourcespace?hl=en.
>
>