Here is our submission for the data integration challenge. Attached is a PDF file (with an additional image). References to the application itself and codebase are included below.
Cheers,
John Deck and Brian Stucky
BiSciCol Tracker Application
iEvoBio Data Integration Challenge
John Deck1, Brian Stucky2
1Berkeley Natural History Museums, University of California, Berkeley, CA 94720.
2Department of Ecology and Evolutionary Biology, Colorado University, Boulder, CO 80309.
The BiSciCol Tracker links and tracks scientific collection objects (specimens, sequences, images, etc.) and their digital metadata across multiple institutional collections, with each collection using their own information management system. The technical implementation is constructed from resource description framework (RDF) subject-predicate-object relations along with descriptions of subject and object types. By using web ontology language (OWL) transitive properties for object relationships, BiSciCol will make it easy to discover how objects from different collections are related to one another. Individual objects are built using globally unique identifiers (GUIDs) constructed from a combination of uniform resource identifiers (URIs) and institution specific identifiers that will link and resolve objects. The BiSciCol service will maintain an easily searchable and scalable index of object relationships, available via standard web browsers and representational state transfer (REST).
Links to the Prototype application and BiSciCol codebase plus a depiction of how the application works is available at http://biscicol.blogspot.com/p/biscicol-prototype.html. The BiSciCol code is licensed under the Modified Berkeley Standard Distribution (BSD) License.