Thanks so much for sharing this. Glad you were at the meeting, and I'm
looking forward to some more in-depth thoughts from others in the
coming days. So far, I've just been browsing twitter's #schemaorg,
which clearly doesn't allow much richness.
A few quick thoughts / comments:
* Great news re: alternative syntaxes / serializations will still be
considered by these SE's
* Also glad to hear about moving groups to w3c, and that Jeni
Tennison's involved
* Evan & Co. turned NewsArticle around very quickly, and it's nice to
see a somewhat more deeply modeled data-type in schema.org. Wondering
what it would take for some of us in the LAM world to do the same for
"after the slash" schemaorg types underneath CreativeWork, Book,
Sculpture, &c.
* Good relations creator is Martin Hepp
As a <editorial> postscript of my own, I still don't see what's so
hard about RDFa:
##Declare some namespaces for metadata properties
<div xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:rdv="http://rdvocab.info/Elements/"
xmlns:rdfrbr="http://rdvocab.info/uri/schema/FRBRentitiesRDA/"
xmlns:rdrel="http://rdvocab.info/RDARelationshipsWEMI/"
xmlns:bibo="http://purl.org/ontology/bibo/"
## Identify your thing as a kind of thing with a name:
about="#book" typeof="rdfrbr:Manifestation">
## Give some metadata about it:
<div property="dc:identifier" content="dedupmrg17257152"></div>
<div property="dc:title" content="A history of philosophy in
America."></div>
<div property="dcterms:alternative" content=""></div>
<div property="dcterms:extent" content="2v. cm.."></div>
<div property="dc:identifier" content="1958381"></div>
<div property="dc:identifier" content="75040254"></div>
<div property="bibo:isbn" content="0399116508"></div>
<div property="dc:publisher" content="Capricorn"></div>
<div property="dc:publisher" content=""></div>
<div property="dcterms:created" content="1977"></div>
<div property="dc:subject" content="Philosophy, American --
History"></div>
## relate it to other named & typed things:
<div rel="rdrel:workManifested">
<div typeof="rdfrbr:Work"
about="http://example.org/thing21827719#work>
<div property="dc:dentifier" content="21827719"></div>
</div>
</div>
Thanks again for sharing your notes on this important meeting!
Best,
-Corey
--
Corey A Harper
Metadata Services Librarian
New York University Libraries
20 Cooper Square, 3rd Floor
New York, NY 10003-7112
212.998.2479
corey....@nyu.edu
-c
1) Thank Brian for putting these notes together (Thanks!) and
2) Ask if others had anything to add to Brian's notes...
I know the attending group was rather small (relatively speaking) so, to anyone else who participated, it would be great to get your thoughts/notes etc, especially as it pertains to LAM orgs.
Cordially,
Kevin
p.s. I agree with Corey about the RDFa. It's really not *that* hard. My biggest issue (unknown) is the how much semantic noise will be in our HTML once we've done everything humanly possible to make our data available (RDFa, microdata, schema.org...).
I'm not sure which spec you were reading, but definitely check out the
RDFa Primer [1], Introduction to RDFa from A List Apart [2] or the
Open Graph Protocol docs if you want a more helpful and friendly
introduction than the RDFa Syntax and Processing spec [4].
I am definitely leaning toward using microdata more these days, since
it allows you to do most of what RDFa offers in a much more
straightforward (I think) syntax...and parsing it is *way* easier. But
RDFa and Microformats still have their benefits, and pre-existing
deployments, so I think it's important to know what's available and be
able to use the right tool for the job.
//Ed
[1] http://www.w3.org/TR/xhtml-rdfa-primer/
[2] http://www.alistapart.com/articles/introduction-to-rdfa/
[3] http://ogp.me/
[4] http://www.w3.org/TR/rdfa-syntax/
I think your point about knowing "what's available and be[ing] able to
use the right tool for the job" is the crux of the issue, and I'd love
for there to be some more discussion about which approach is best
suited to particular use cases.
Microdata seems like a very effective--and much simpler than
RDFa--approach to embedding structured data into web documents. It
also seems to lose some of the more useful features of the RDF data
model & graph-based approach to metadata. My primary use cases for RDF
are centered around relationships *between* resources, and about the
topics and entities (people, places, things) that are in our data.
In your experience, are schema.org's enumerations & cannonical
references [1] sufficient for capturing this kind of information? I
suspect I'm blindered by years of studying semantic web technologies,
but it seems to me that these constructs aren't quite the same as a
generic syntax for saying resource A has relationship B to resource
C...
-Corey
[1] http://schema.org/docs/gs.html#advanced_enum
--
The interesting thing is that you can use microdata to describe a
graph of related resources. Take a look at the example at schema.org
for a Person [1]. I snipped a bit out here:
--
<div itemscope itemtype="http://schema.org/Person">
<span itemprop="name">Jane Doe</span>
Graduate students:
<a href="http://www.xyz.edu/students/alicejones.html" itemprop="colleagues">
Alice Jones</a>
<a href="http://www.xyz.edu/students/bobsmith.html" itemprop="colleagues">
Bob Smith</a>
</div>
--
If this was sitting on the web somewhere at http://example.com/jane it
would be basically asserting:
--
@prefix person <http://schema.org/Person#>.
<http://example.com/jane>
a person:Person,
person:name "Jane Doe",
person:colleagues (<http://www.xyz.edu/students/alicejones.html>,
<http://www.xyz.edu/students/bobsmith.html>) .
--
Kinda neat right?
//Ed
I think this is part of why I find the various http://rdfs.schema.org/
efforts to be significant.
Thanks for posting this concrete example.
-Corey
--
<div xmlns:schemaorg="http://schema.org/" typeof="schemaorg:Person">
<span property="schemaorg:name">Jane Doe</span>
Graduate students:
<a href="http://www.xyz.edu/students/alicejones.html" rel="schemaorg:colleagues">Alice Jones</a>
<a href="http://www.xyz.edu/students/bobsmith.html" rel="schemaorg:colleagues">Bob Smith</a>
</div>
That's not hugely different. Now, because RDFa can accommodate more complicated data, creating RDFa can get a lot more elaborate very quickly.
But, to speak to Brian's points (a few of which were expressed by the Google people), I would also say that the RDFa spec is *not* designed for the average web developer. The Schema.org vocabulary, which is distinct from, but nevertheless effectively acts as another authoritative source of information for, the microdata spec [1], is far more accessible. For starters - and this is a personal gripe - with microdata, I do not have to work as hard to determine the correct attribute to use, in part because the microdata attributes follow a naming pattern to some degree and in part because I feel they're a little more intuitive. On the other hand, I like being able to declare something to be multiple types and, if you have complex data, xmlns prefixes are handy. Above all, I think the most powerful thing schema.org has going for it presently is its website, which is replete with example after example about how to *implement* the schema.org vocabulary using the microdata HTML5 format. We shouldn't underestimate how such a basic thing as clear documentation and examples can be for individuals trying to figure this stuff out. (And, if a web developer starts using schema.org, Google, Yahoo! and Bing are, presumably, already in position to start using the embedded data, which is more than we - LODLAMers - can generally say about the data we've published and made available.)
About schema.org's documentation, I wonder precisely how much that level of commitment to documentation and examples has played a role in schema.org's (and microdata's) rise in the last few months (putting aside schema.org's backers for a moment). That's probably a study in itself. And, considering Ed's point about "using the right tool for the job," I'd be interested whether the LODLAM community believes
1) whether our data are sufficiently expressible in microdata
2) if not, is that a problem, otherwise what needs to change and
3) Is there a distinct role for microdata or RDFa within LAM organizations, depending on the use case
FWIW, a number of people have written about the positives and negatives of RDFa over microdata (and vice-versa). Despite the author's very, very close association with RDFa, I do think Manu Sporny's post comparing RDFa and microformats is very well done [2]. A quick Google search will return the thoughts of many more commentators.
Warmly,
Kevin
[1] http://dev.w3.org/html5/md/Overview.html
[2] http://manu.sporny.org/2011/uber-comparison-rdfa-md-uf/
Totally. This is incredibly important, especially if the point is to
encourage adoption. A friendly validator or inspector that helps
people quietly determine if they are doing things right can help quite
a bit too.
> 1) whether our data are sufficiently expressible in microdata
> 2) if not, is that a problem, otherwise what needs to change and
> 3) Is there a distinct role for microdata or RDFa within LAM organizations, depending on the use case
I guess it's kind of dodging the question, but I use "right tool for
the job" as a subjective measure. I know I have my own tool
preferences, and often the rightness of a tool is how well it fits how
I think, or a team I'm on thinks, and what we happen to be doing. I
think the only way we'll see commonality in serialization formats and
vocabularies is by actually using them, and seeing what works what
doesn't, and collectively arriving at the same conclusions.
At the moment it seems we are in the unfortunate position of having to
use RDFa to communicate metadata to Facebook, and Microdata to do the
same with Google, Bing, Yahoo, et al. Although you'd think it would be
in the interests of all involved to look for both and Microformats.
But this is a big step forward from where we were a few years ago,
when there weren't any widely deployed consumer grade tools looking
for structured data on the Web.
I think it can help a lot to stay focused on what you are trying to
enable by putting metadata in HTML, and choosing appropriately. So
that kind of begs your question of what the LOD-LAM use cases are. One
could do worse than look at the draft Use Case Report from the W3C
Library Linked Data Incubator Group:
http://www.w3.org/2005/Incubator/lld/wiki/UseCaseReport
But maybe (just maybe) it's more in the spirit of LOD-LAM to build
some apps that make library, archives and museum data available on the
web, and tools to use it, and see what works...
//Ed
//Ed
Linked Data: Hands on How-To
http://www.diglib.org/forums/2011forum/schedule/linked-data-hands-on-how-to/
We propose to facilitate a ‘hands-on’ workshop in which participants
can gain direct experience working with their collection data as
linked data sets.
Bring an Excel spreadsheet or other file containing your collection
data and a laptop prepared to jump in elbow to elbow with colleagues
who want to learn more about LOD in a fun, low key, collaborative way.
Don’t have collection data you can bring? That’s ok! We’ll supply 2-3
sample data sets to choose from to experiment on locally.
We will attempt to package everything in a way that if connectivity is
sparse the workshop can proceed on your laptop and you will go home
with material for future experimentation.
Targeted Learning outcomes: Understand how institutions can…
* express collection data as Linked Open Data according to LOD
ontologies/schemas from schema.org
* use and/or create schemas on schema.org
* process LOD collections in Refine and index them in Freebase
* consume LOD via Exhibit and/or Omeka/other tool of choice
This is designed to be an ~3 hour introductory workshop.
Can we seen an example?
//Ed
Ed, for an example, just check out the source of pretty much any
calisphere item. The first one I hit has loads of embedded DC (as well
as XTF) metadata. [1]
There was the start of a conversation on the DC General list about how
to best to work DC Metadata in to emerging HTML5 best practice, which
would happen alongside a process to map dcterms properties / classes
to the schema.org vocab. [2] If that discussion gets revived, I may
reach out to the two of you and others on this list for input.
Thanks,
-Corey
[1] http://content.cdlib.org/ark:/13030/tf387008q3/
[2] http://wiki.dublincore.org/index.php/Schema.org_Alignment
--