Fwd: [gcd-contact] Comic Web Schema

Lionel English

unread,

Feb 7, 2012, 7:57:58 PM2/7/12

to GCD tech

Forwarded from the contact list

---------- Forwarded message ----------
From: Olson, Peter <pol...@marvel.com>
Date: Tue, Feb 7, 2012 at 4:35 PM
Subject: [gcd-contact] Comic Web Schema
To: con...@comics.org

Hi -

Over the past month or so, Marvel has been working on a draft microdata schema for comics for the W3C (the web's governing standards body) and schema.org (a consortium of search engines working on the semantic web). Microdata is a way of inserting semantic information into HTML, so that the specific data points can be available for search engines and other aggregators without obstructing the presentation of pages. The goal of the specification is to make comics easier to find on the web and to encourage cool new application development with comic data. The success of this initiative depends on widespread adoption, so right now we're reaching out to publishers, retailers, creators and fans to get feedback on the spec, and we'd love to get comics.org’s perspective on it. Once the spec is finalized and adopted by the W3C and schema.org, we plan to evangelize its use in order to help both print and digital comics become more consistent and discoverable across the web.

The draft spec is housed at http://www.w3.org/wiki/PeriodicalsComics and ongoing discussion can be found here: http://lists.w3.org/Archives/Public/public-vocabs/. There's nothing in the spec that's going to be very new to anyone who has dealt with comic data.

If you have any questions, feel free to reach out to me directly or join in the discussion over email.

- peter

Peter Olson | VP, Web and Application Development | Marvel Entertainment | e: pol...@marvel.com | ph: 212-576-4028

******************************************************************************

Nothing contained in this e-mail shall (a) be considered a legally binding agreement, amendment or modification of any agreement with Marvel, each of which requires a fully executed agreement to be received by Marvel or (b) be deemed approval of any product, packaging, advertising or promotion material, which may only come from Marvel's Legal Department.

******************************************************************************

THINK GREEN - SAVE PAPER - THINK BEFORE YOU PRINT!

--
GCD-Contact mailing list - gcd-c...@googlegroups.com
To unsubscribe send email to gcd-contact...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/gcd-contact

--
Lionel English

San Diego, CA
lio...@beanmar.net

Donald Dale Milne

unread,

Feb 7, 2012, 9:26:17 PM2/7/12

to GCD Tech List, GCD Main List, GCD Chat List

This may be of interest to many of us so I'm forwarding it to various
lists. It came in to the Contact list.

- Don Milne

On 2/7/2012 7:35 PM, Olson, Peter wrote:
>
> Hi -
>
> Over the past month or so, Marvel has been working on a draft
> microdata schema for comics for the W3C (the web's governing standards
> body) and schema.org (a consortium of search engines working on the
> semantic web). Microdata is a way of inserting semantic information
> into HTML, so that the specific data points can be available for
> search engines and other aggregators without obstructing the
> presentation of pages. The goal of the specification is to make comics
> easier to find on the web and to encourage cool new application
> development with comic data. The success of this initiative depends on
> widespread adoption, so right now we're reaching out to publishers,
> retailers, creators and fans to get feedback on the spec, and we'd

> love to getcomics.orgï¿½sperspective on it. Once the spec is finalized

> and adopted by the W3C and schema.org, we plan to evangelize its use
> in order to help both print and digital comics become more consistent
> and discoverable across the web.
>
> The draft spec is housed

> at_http://www.w3.org/wiki/PeriodicalsComics_and ongoing discussion can
> be found here:_http://lists.w3.org/Archives/Public/public-vocabs/_.

Lionel English

unread,

Feb 7, 2012, 11:24:59 PM2/7/12

to Peter Olson, GCD tech

Hi Peter,

From context, it would appear that Graphic Novel extends Book, and Periodical Series extends Creative Work. Can you tell me where those are defined?

On Tue, Feb 7, 2012 at 4:35 PM, Olson, Peter <pol...@marvel.com> wrote:

Hi -

Over the past month or so, Marvel has been working on a draft microdata schema for comics for the W3C (the web's governing standards body) and schema.org (a consortium of search engines working on the semantic web). Microdata is a way of inserting semantic information into HTML, so that the specific data points can be available for search engines and other aggregators without obstructing the presentation of pages. The goal of the specification is to make comics easier to find on the web and to encourage cool new application development with comic data. The success of this initiative depends on widespread adoption, so right now we're reaching out to publishers, retailers, creators and fans to get feedback on the spec, and we'd love to get comics.org’s perspective on it. Once the spec is finalized and adopted by the W3C and schema.org, we plan to evangelize its use in order to help both print and digital comics become more consistent and discoverable across the web.

The draft spec is housed at http://www.w3.org/wiki/PeriodicalsComics and ongoing discussion can be found here: http://lists.w3.org/Archives/Public/public-vocabs/. There's nothing in the spec that's going to be very new to anyone who has dealt with comic data.

If you have any questions, feel free to reach out to me directly or join in the discussion over email.

- peter

Peter Olson | VP, Web and Application Development | Marvel Entertainment | e: pol...@marvel.com | ph: 212-576-4028

******************************************************************************

Nothing contained in this e-mail shall (a) be considered a legally binding agreement, amendment or modification of any agreement with Marvel, each of which requires a fully executed agreement to be received by Marvel or (b) be deemed approval of any product, packaging, advertising or promotion material, which may only come from Marvel's Legal Department.

******************************************************************************

THINK GREEN - SAVE PAPER - THINK BEFORE YOU PRINT!

--

GCD-Contact mailing list - gcd-c...@googlegroups.com
To unsubscribe send email to gcd-contact...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/gcd-contact

--
Lionel English

Chair, Board of Directors

Grand Comics Database

http://www.comics.org/

Lionel English

unread,

Feb 8, 2012, 12:02:46 AM2/8/12

to GCD tech

He cc:ed the tech list on his reply to me, but he's not a member, so it probably bounced. I'm forwarding his answer here in case others are interested.

---------- Forwarded message ----------
From: Olson, Peter <pol...@marvel.com>

Date: Tue, Feb 7, 2012 at 8:31 PM
Subject: RE: [gcd-contact] Comic Web Schema
To: Lionel English <lionel....@comics.org>
Cc: GCD tech <gcd-...@googlegroups.com>

Hi Lionel -

They're defined on schma.org - http://www.schema.org/CreativeWork and http://www.schema.org/Book. There's not a centralized place for storing and discussing these unfortunately. The current discussion on the thread is that Perodical Series is better modeled as a child of the "Intangible" schema, so that might move.

-peter

-----Original Message-----
From: Lionel English [mailto:lionel....@comics.org]
Sent: Tue 2/7/2012 11:24 PM
To: Olson, Peter
Cc: GCD tech
Subject: Re: [gcd-contact] Comic Web Schema

Hi Peter,

From context, it would appear that Graphic Novel extends Book, and
Periodical Series extends Creative Work. Can you tell me where those are
defined?

On Tue, Feb 7, 2012 at 4:35 PM, Olson, Peter <pol...@marvel.com> wrote:

> **

>
> Hi -
>
> Over the past month or so, Marvel has been working on a draft microdata
> schema for comics for the W3C (the web's governing standards body) and
> schema.org (a consortium of search engines working on the semantic web).
> Microdata is a way of inserting semantic information into HTML, so that the
> specific data points can be available for search engines and other
> aggregators without obstructing the presentation of pages. The goal of the
> specification is to make comics easier to find on the web and to encourage
> cool new application development with comic data. The success of this
> initiative depends on widespread adoption, so right now we're reaching out
> to publishers, retailers, creators and fans to get feedback on the spec,
> and we'd love to get comics.org's perspective on it. Once the spec is
> finalized and adopted by the W3C and schema.org, we plan to evangelize
> its use in order to help both print and digital comics become more
> consistent and discoverable across the web.
>

> The draft spec is housed at *http://www.w3.org/wiki/PeriodicalsComics*<http://www.w3.org/wiki/PeriodicalsComics>and ongoing discussion can be found here:
> *http://lists.w3.org/Archives/Public/public-vocabs/*<http://lists.w3.org/Archives/Public/public-vocabs/>.

Jochen G.

unread,

Feb 8, 2012, 4:26:59 AM2/8/12

to pol...@marvel.com, gcd-...@googlegroups.com

Hi Peter,

Gave it a short read. Interesting idea and proposal.

Some thoughts and comments for discussion.

One problem I see is the distinction between periodocal series and
Graphic novel. At comics.org we had similar discussions that this is a
somewhat US-perspective and doesn't really cover for example European
album series which have aspects of both, a series and a book. Periodical
has two meanings, the substantive describes the physical object
(magazine, journal), the adjective describes the repeating and regular
aspect of publishing.

Not sure what the proposal is intended to cover in the end, but it has
to be understood that in this form (and this distinction of periodical
series vs. graphic novels) it covers only some aspects of international
comics. We at comics.org haven't really found THE solution for this but
are making steps in that direction. If a comic can be both, part of a
series and a graphic novel (this e.g. concerns book fields like ISBN)
the problem wouldn't be there. Does it need to be 'periodical series' or
can it just be 'series' ?

I see why one wants to have the Diamond code in there, but this only
applies to US comics and only in the last 20 (?) years. I doubt that it
can be in an international standard (as I understand the aim of
schema.org to be).

Similar is publisherId, these exist for books as well, but are not
present in the schema for books (probably because nowadays ISBNs will
that role). These numbers can only be known by the publisher, so I am
not sure if these can be used in the end.

The last two comments are trying to take an outside view.

Jochen

Am 08.02.2012 01:57, schrieb Lionel English:
> Forwarded from the contact list
>
> ---------- Forwarded message ----------
> From: *Olson, Peter* <pol...@marvel.com <mailto:pol...@marvel.com>>
> Date: Tue, Feb 7, 2012 at 4:35 PM
> Subject: [gcd-contact] Comic Web Schema

> To: con...@comics.org <mailto:con...@comics.org>
>
>
> __

>
> Hi -
>
> Over the past month or so, Marvel has been working on a draft microdata
> schema for comics for the W3C (the web's governing standards body) and

> schema.org <http://schema.org/> (a consortium of search engines working

> on the semantic web). Microdata is a way of inserting semantic
> information into HTML, so that the specific data points can be available
> for search engines and other aggregators without obstructing the
> presentation of pages. The goal of the specification is to make comics
> easier to find on the web and to encourage cool new application
> development with comic data. The success of this initiative depends on
> widespread adoption, so right now we're reaching out to publishers,
> retailers, creators and fans to get feedback on the spec, and we'd love

> to getcomics.org <http://comics.org/>ï¿½sperspective on it. Once the spec
> is finalized and adopted by the W3C and schema.org <http://schema.org/>,

> we plan to evangelize its use in order to help both print and digital
> comics become more consistent and discoverable across the web.
>

> The draft spec is housed at_http://www.w3.org/wiki/PeriodicalsComics_and

> ongoing discussion can be found

> here:_http://lists.w3.org/Archives/Public/public-vocabs/_. There's

> nothing in the spec that's going to be very new to anyone who has dealt
> with comic data.
>
> If you have any questions, feel free to reach out to me directly or join
> in the discussion over email.
>
> - peter
>
> Peter Olson | VP, Web and Application Development | Marvel Entertainment

> | e: pol...@marvel.com <mailto:pol...@marvel.com> | ph: 212-576-4028
> <tel:212-576-4028>

>
> ******************************************************************************
>
> Nothing contained in this e-mail shall (a) be considered a legally
> binding agreement, amendment or modification of any agreement with
> Marvel, each of which requires a fully executed agreement to be received
> by Marvel or (b) be deemed approval of any product, packaging,
> advertising or promotion material, which may only come from Marvel's
> Legal Department.
>
> ******************************************************************************
>
> THINK GREEN - SAVE PAPER - THINK BEFORE YOU PRINT!
>
>
>
>
>
> --
> GCD-Contact mailing list - gcd-c...@googlegroups.com

> <mailto:gcd-c...@googlegroups.com>

> To unsubscribe send email to gcd-contact...@googlegroups.com

> <mailto:gcd-contact%2Bunsu...@googlegroups.com>

> For more options, visit this group at
> http://groups.google.com/group/gcd-contact
>
>
>
> --
> Lionel English
> San Diego, CA

> lio...@beanmar.net <mailto:lio...@beanmar.net>
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com
> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> For more options, visit this group at

> http://groups.google.com/group/gcd-tech

Lionel English

unread,

Feb 8, 2012, 2:05:07 PM2/8/12

to gcd-...@googlegroups.com

Per my invitation, Peter has joined the tech list. He also suggests that those of us who are interested consider posting comments to the w3.org discussion board mentioned in his original post.

> to getcomics.org <http://comics.org/>’sperspective on it. Once the spec

Lionel English

unread,

Feb 8, 2012, 2:08:50 PM2/8/12

to gcd-...@googlegroups.com

I have another question about the schema, btw: do the schema attributes represent attributes an object *must* have, or that it *may* have? I'm guessing the latter, as presumably the schema offers standardized ways you *may* tag content on the web, but of course there's nothing forcing you to tag anything at all. But I'd just like to confirm that.

Olson, Peter

unread,

Feb 8, 2012, 2:19:35 PM2/8/12

to gcd-...@googlegroups.com

That is indeed the case. Obviously the more complete the tags are the better, but no guns are being held to anyone’s head to completely tag entities on the web.

-peter

Henry Andrews

unread,

Feb 8, 2012, 5:25:39 PM2/8/12

to gcd-...@googlegroups.com

Hi Peter- I'm thrilled to see this project and thank you so much for contacting us!

Glancing over the schema proposal, a few things spring to mind. I have not had time to go over the background discussion yet, so I apologize for any duplication of topics already discussed, and hope to revisit this all in more detail perhaps this weekend.

First, how complete are you trying to make this? Do you want to provide space for as many attributes as possible, or just gather some core data that's common to most publications? Or even just most publications within a particular sphere (current U.S. comics, which is what this schema seems to be geared towards).

Second, Jochen's mentioned that there are many non-U.S. comics that don't fit this schema particularly well. I'd like to add that there are many non-current comics that don't fit this schema either. Some of these concerns overlap. Distribution is, historically, a much more complex topic than just Diamond. Do you want to track which distributor distributed the comic book? We plan to track the national distributor at some point in the future (for U.S. books- the concept may be different for other countries, which is one reason we don't do this already).

Publication is also much more complex for older U.S. comics (pretty much everything I mention should be qualified as applying to older U.S. comics, which is my area of interest). I see you've attached the publisher ID to the issue rather than the series, which will definitely help things. But the question of which publishers are "the same" and what they should be called is one that continually vexes us. It's relatively easy for large publishers with well-documented histories, but even with Marvel and DC it's not at all hard to come up with problems in earlier phases of their histories. And once you get into smaller publishers, it's very, very murky. We currently use three separate fields to track publisher, and it's still very problematic.

Volume and Issue "numbers" can't really be numbers in the schema. At least not in general. There are all sorts of crazy things that serve as issue or volume numbers, plus you have to somehow deal with things that aren't actually numbered, but have some sort of convention when they're discussed. Some of that could go in subtitle instead, but I think you'll hit limitations.

There's also the question of *which* issue number. Often there's more than one- the most common being a volume number plus issue number in the indicia (formal publication data), with a whole number on the cover. i.e. volume 2 issue 1 in the indicia, but 13 on the cover (for a monthly book with one volume a year).

Which brings up another point- volume is not a series attribute, unless you're going to use a much more fine-grained notion of series. DC used annual volume numbers up through at least the 1970s, I think. They just didn't appear on the cover or anywhere prominent. For instance, Adventure Comics #430 is part of volume 39, an #431 begins volume 40.

All of this goes back to the question of how detailed you want to be. We're still refining our own schema to capture more of these sorts of things and handle more and more difficult situations.

A few other notes- a fixed set of comic creator roles will cause problems (it certainly has for us- we get around it with lots of notes). Also, have you given any thoughts to crediting editors, or people in other business roles?

And then the biggest content-related issue- you're only tracking credits on an issue basis. How do you plan to handle anthologies, which at one time were the dominant form of comic book (and may still be in some markets, I don't know).

thanks,

-henry

From: "Olson, Peter" <pol...@marvel.com>
To: gcd-...@googlegroups.com
Sent: Wednesday, February 8, 2012 11:19 AM
Subject: RE: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema

Lionel English

unread,

Feb 8, 2012, 6:02:17 PM2/8/12

to gcd-...@googlegroups.com

I'll have more comments when I have more time, but I want to add something to what Henry said. I think you need to create at least one other object, such as Comic Story, which is derived from Creative Work. A creative work has an author and may have an editor. Comic Stories could be where you add attributes like artist/illustrator and other creative roles, and perhaps page counts. Or you may want to assign the writer and artist to the creator roles, and then all creative personnel to the author role. You may also wish to create a Comic Stip entity, since that's slightly different than a Comic Story created for a Comics Periodical.

At any rate, both a graphic novel and a comics periodical would then be containers that could hold one or more Comic Stories and/or Comic Strip installments (if you want to draw a distinction), just as, more generally, a Book may contain many Stories (text narratives). In the current publishing environment, it is the norm to have a single story in a book, and a single comics story in a comics periodical, but as Henry notes those aren't the only possible configurations and the schema should probably address that somehow.

Olson, Peter

unread,

Feb 8, 2012, 7:35:19 PM2/8/12

to gcd-...@googlegroups.com

Hi Henry & Lionel –

We thought very hard about adding stories to this schema but ultimately decided against including them. (We use the series > issue > story model internally - indeed we adopted it after guidance from you guys some years ago. It’s integral to how our comic data model works at Marvel.)

The main reason we decided against including stories (and housing creator and character assignments therein) is ease of implementation. In order to be most effective, the schema needs to be adopted widely and in order to get wide adoption it needs to be easy to understand and implement. Most retailer and fan sites don’t really think about comics as granularly as our groups do. Retailers and fans don’t encounter loose comic stories (the same way that particle physicists don’t encounter loose quarks) – they’re dealing with issues. Fans are writing about them and retailers are selling them in print and digital formats. It’s unlikely, I think, that a fan or retailer (or even many publishers) will go to the length of teasing out the collection of stories in an issue and then attribute creation credits to them. (Additionally, a lot of the schema.org direction is about modeling saleable products - loose stories aren’t for sale). Ultimately we decided it’s best to aggregate talent assignments at the issue level, even if that doesn’t perfectly match the “standard model” for comics as we understand them.

In any case even if the spec is adopted, is likely to be iterative and we can maybe introduce stories to the spec at a later date.

Re some of your other points –

See my other email about the distributor code – It can be any distributor (with Diamond being the obvious candidate in the US) or no distributor. In a lot of cases many of the identifiers for comics aren’t going to be present at all. Not every data point enumerated here has to be present for every comic on the web.

Volume – I’ll change volume from a number to a string in the spec. Your qualms are EXACTLY why we use the start year as the volume on marvel.com. No one f$%&-ing knows which volume number is which.

Creator roles – Editor is part of the Creative Work schema, as are a number of other roles. The schema.org design pattern kind of stipulates that each creative work type has a pre-defined set of creative roles. It’s problematic when other creative types arise but it’s kind of their call.

Anthologies – under this schema, creative credits will be aggregated at the issue level (for reasons outlined above).

Comic strips – I feel like comic strips are kind of their own beast. Even though they share some elements of storytelling with comic books, they are a distinct art form and are distributed and enumerated quite differently than comic books. I agree they probably deserve their own spec distinct from comic issues – and I’m probably not very qualified to write it. Maybe someone on your side can pick up the baton.

“Which issue number” – that type of question is a bit beyond the scope of what we can do with the schema. The schema only gives a structure to fill in – it doesn’t stipulate what should occupy any data point

Henry Andrews

unread,

Feb 8, 2012, 8:47:13 PM2/8/12

to gcd-...@googlegroups.com

Thanks for the replies, Peter. In particular, the direction of describing "salable units" is helpful in giving an overal framework for what is and isn't important in this initial effort. Likewise the notion that the schema doesn't try to enforce what is put in each field is helpful- when you get right down to it, that enforcement is the main preoccupation of the GCD. We usually either agree on the fields but not the exact contents, or agree on the broad principle of a set of fields but can't work out specific fields that capture the exact data that we want.

So, revising my earlier feedback:

* Issue numbers- as you note, the "which source" concern goes away. But I think it makes it even more important that it be a string rather than a number, so folks can put in unusual numbers. I don't even have to go far to come up with examples- Marvel ran dual-numbered issues for a while before they returned their post-Heroes Reborn series back to the numbering they would have had if not interrupted. DC's recent brief revival of Adventure Comics was dual numbered on both the cover and the indicia.

* Publishers- The difficulties are probably sidestepped in terms of schema definition, although when you get to older U.S. comics from more obscure publishers the usefulness of this field may be dubious given the varied (and often hotly contested) opinions as to who the "real" publisher of any given issue is. I suppose that's more of a content quality issue than a schema definition issue, although I guess you could make it a multi-valued field and then all of the possible publishers could be added. Come to think of it, you'll need to handle joint DC/Marvel and similar publications as well (we don't handle those well currently either). Hmm.. Are the books with DC/Vertigo on the DC or Vertigo? I guess that gets back into policing content, so it's not your concern. (can you tell that publication history is one of my primary interests?)

* Credits- I would definitely hope for an editor role. Some people might want to see, for instance, whether a given Silver Age DC title was edited by Julius Schwartz or not. Beyond that, if you're stuck with a fixed set you're stuck with a fixed set, and this is a reasonable set.

On the topic of distribution, I think there's some confusion (although your reply remains valid and useful). In earlier times there was no (visible or known) distribution code, but there sometimes was a distribution mark indicating who did the distribution (A.N.C., I.N.D., P.D.C., the Atlas globe, etc.) However, I'm not sure this would be of interest for your purposes. It's of interest to researchers, but doesn't apply much to current commercial concerns- even most collectors don't pay attention to it.

Looking at your open issues, I would advise against enumerating comic book formats. While there are several common ones, there are endless variations, especially globally. If you do attempt it, I advise including a catch-all for things that don't fit.

I would also advise treating comic strips as a separate but related project. The credits largely apply to both, but the unit of publication is so different- you want to go strip by strip, not newspaper by newspaper. The biggest problem is handling reprints of strips in comic books, but if you are not tracking reprints then you can more easily punt this.

Finally, since I don't have a formal position in the GCD these days, I feel free enough to be gauche and suggest our own issue id as a field. An increasing number of commercial sites are using GCD data, and it could also be viewed as a way to access extended information beyond the scope of the schema.org project, which is pretty much what commercial sites use it for now. But I would not consider it a slight if this suggestion were ignored :-)

thanks,

-henry

From: "Olson, Peter" <pol...@marvel.com>
To: gcd-...@googlegroups.com

Sent: Wednesday, February 8, 2012 4:35 PM

Donald Dale Milne

unread,

Feb 8, 2012, 10:03:01 PM2/8/12

to gcd-...@googlegroups.com

Just one minor thought on "loose stories arenï¿½t for sale"...YET. They
could very be sold digitally in the future, as individual songs from an
album are now.

- Don Milne

On 2/8/2012 7:35 PM, Olson, Peter wrote:
>
> Hi Henry & Lionel ï¿½

>
> We thought very hard about adding stories to this schema but
> ultimately decided against including them. (We use the series > issue
> > story model internally - indeed we adopted it after guidance from

> you guys some years ago. Itï¿½s integral to how our comic data model

> works at Marvel.)
>
> The main reason we decided against including stories (and housing
> creator and character assignments therein) is ease of implementation.
> In order to be most effective, the schema needs to be adopted widely
> and in order to get wide adoption it needs to be easy to understand

> and implement. Most retailer and fan sites donï¿½t really think about
> comics as granularly as our groups do. Retailers and fans donï¿½t

> encounter loose comic stories (the same way that particle physicists

> donï¿½t encounter loose quarks) ï¿½ theyï¿½re dealing with issues. Fans are

> writing about them and retailers are selling them in print and digital

> formats. Itï¿½s unlikely, I think, that a fan or retailer (or even many

> publishers) will go to the length of teasing out the collection of
> stories in an issue and then attribute creation credits to them.
> (Additionally, a lot of the schema.org direction is about modeling

> saleable products - loose stories arenï¿½t for sale). Ultimately we
> decided itï¿½s best to aggregate talent assignments at the issue level,
> even if that doesnï¿½t perfectly match the ï¿½standard modelï¿½ for comics

> as we understand them.
>
> In any case even if the spec is adopted, is likely to be iterative and
> we can maybe introduce stories to the spec at a later date.
>

> Re some of your other points ï¿½
>
> See my other email about the distributor code ï¿½ It can be any

> distributor (with Diamond being the obvious candidate in the US) or no
> distributor. In a lot of cases many of the identifiers for comics

> arenï¿½t going to be present at all. Not every data point enumerated

> here has to be present for every comic on the web.
>

> Volume ï¿½ Iï¿½ll change volume from a number to a string in the spec.

> Your qualms are EXACTLY why we use the start year as the volume on
> marvel.com. No one f$%&-ing knows which volume number is which.
>

> Creator roles ï¿½ Editor is part of the Creative Work schema, as are a

> number of other roles. The schema.org design pattern kind of
> stipulates that each creative work type has a pre-defined set of

> creative roles. Itï¿½s problematic when other creative types arise but
> itï¿½s kind of their call.
>
> Anthologies ï¿½ under this schema, creative credits will be aggregated

> at the issue level (for reasons outlined above).
>

> Comic strips ï¿½ I feel like comic strips are kind of their own beast.

> Even though they share some elements of storytelling with comic books,
> they are a distinct art form and are distributed and enumerated quite
> differently than comic books. I agree they probably deserve their own

> spec distinct from comic issues ï¿½ and Iï¿½m probably not very qualified

> to write it. Maybe someone on your side can pick up the baton.
>

> ï¿½Which issue numberï¿½ ï¿½ that type of question is a bit beyond the scope

> of what we can do with the schema. The schema only gives a structure

> to fill in ï¿½ it doesnï¿½t stipulate what should occupy any data point
>
> -peter
>
> *From:*gcd-...@googlegroups.com [mailto:gcd-...@googlegroups.com]
> *On Behalf Of *Lionel English
> *Sent:* Wednesday, February 08, 2012 6:02 PM
> *To:* gcd-...@googlegroups.com
> *Subject:* Re: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema

> ------------------------------------------------------------------------
>
> *From:*"Olson, Peter" <pol...@marvel.com <mailto:pol...@marvel.com>>
> *To:* gcd-...@googlegroups.com <mailto:gcd-...@googlegroups.com>
> *Sent:* Wednesday, February 8, 2012 11:19 AM
> *Subject:* RE: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema

>
> That is indeed the case. Obviously the more complete the tags are

> the better, but no guns are being held to anyoneï¿½s head to

> completely tag entities on the web.
>
> -peter
>

> *From:*gcd-...@googlegroups.com
> <mailto:gcd-...@googlegroups.com>
> [mailto:gcd-...@googlegroups.com
> <mailto:gcd-...@googlegroups.com>] *On Behalf Of *Lionel English
> *Sent:* Wednesday, February 08, 2012 2:09 PM
> *To:* gcd-...@googlegroups.com <mailto:gcd-...@googlegroups.com>
> *Subject:* Re: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema

>
> I have another question about the schema, btw: do the schema
> attributes represent attributes an object *must* have, or that it
> *may* have? I'm guessing the latter, as presumably the schema
> offers standardized ways you *may* tag content on the web, but of
> course there's nothing forcing you to tag anything at all. But I'd
> just like to confirm that.
>
> On Wed, Feb 8, 2012 at 11:05 AM, Lionel English
> <lio...@beanmar.net <mailto:lio...@beanmar.net>> wrote:
>
> Per my invitation, Peter has joined the tech list. He also
> suggests that those of us who are interested consider posting

> comments to the w3.org <http://w3.org/> discussion board mentioned

> in his original post.
>
> On Wed, Feb 8, 2012 at 1:26 AM, Jochen G. <gcd...@garcke.de
> <mailto:gcd...@garcke.de>> wrote:
>
> Hi Peter,
>
> Gave it a short read. Interesting idea and proposal.
>
> Some thoughts and comments for discussion.
>
> One problem I see is the distinction between periodocal series and

> Graphic novel. At comics.org <http://comics.org/> we had similar

> discussions that this is a
> somewhat US-perspective and doesn't really cover for example European
> album series which have aspects of both, a series and a book.
> Periodical
> has two meanings, the substantive describes the physical object
> (magazine, journal), the adjective describes the repeating and regular
> aspect of publishing.
>
> Not sure what the proposal is intended to cover in the end, but it has
> to be understood that in this form (and this distinction of periodical
> series vs. graphic novels) it covers only some aspects of
> international

> comics. We at comics.org <http://comics.org/> haven't really found

> THE solution for this but
> are making steps in that direction. If a comic can be both, part of a
> series and a graphic novel (this e.g. concerns book fields like ISBN)
> the problem wouldn't be there. Does it need to be 'periodical
> series' or
> can it just be 'series' ?
>
> I see why one wants to have the Diamond code in there, but this only
> applies to US comics and only in the last 20 (?) years. I doubt
> that it
> can be in an international standard (as I understand the aim of

> schema.org <http://schema.org/> to be).

>
> Similar is publisherId, these exist for books as well, but are not
> present in the schema for books (probably because nowadays ISBNs will
> that role). These numbers can only be known by the publisher, so I am
> not sure if these can be used in the end.
>
> The last two comments are trying to take an outside view.
>
> Jochen
>
> Am 08.02.2012 01:57, schrieb Lionel English:
>
> > Forwarded from the contact list
> >
> > ---------- Forwarded message ----------
>
> > From: *Olson, Peter* <pol...@marvel.com
> <mailto:pol...@marvel.com> <mailto:pol...@marvel.com
> <mailto:pol...@marvel.com>>>
> > Date: Tue, Feb 7, 2012 at 4:35 PM
> > Subject: [gcd-contact] Comic Web Schema
>
> > To: con...@comics.org <mailto:con...@comics.org>

> <mailto:con...@comics.org <mailto:con...@comics.org>>

> >
> >
> > __
>
> >
> > Hi -
> >
> > Over the past month or so, Marvel has been working on a draft
> microdata
> > schema for comics for the W3C (the web's governing standards
> body) and
>

> > schema.org <http://schema.org/> <http://schema.org/> (a

> consortium of search engines working
>
> > on the semantic web). Microdata is a way of inserting semantic
> > information into HTML, so that the specific data points can be
> available
> > for search engines and other aggregators without obstructing the
> > presentation of pages. The goal of the specification is to make
> comics
> > easier to find on the web and to encourage cool new application
> > development with comic data. The success of this initiative
> depends on
> > widespread adoption, so right now we're reaching out to publishers,
> > retailers, creators and fans to get feedback on the spec, and
> we'd love
>

> > to getcomics.org <http://getcomics.org/>
> <http://comics.org/>ï¿½sperspective on it. Once the spec

> > is finalized and adopted by the W3C and schema.org

> <http://schema.org/> <http://schema.org/>,

>
> > we plan to evangelize its use in order to help both print and
> digital
> > comics become more consistent and discoverable across the web.
> >
>
> > The draft spec is housed
> at_http://www.w3.org/wiki/PeriodicalsComics_and
>
> > ongoing discussion can be found
>
> > here:_http://lists.w3.org/Archives/Public/public-vocabs/_. There's
>
> > nothing in the spec that's going to be very new to anyone who
> has dealt
> > with comic data.
> >
> > If you have any questions, feel free to reach out to me directly
> or join
> > in the discussion over email.
> >
> > - peter
> >
> > Peter Olson | VP, Web and Application Development | Marvel
> Entertainment
>
> > | e: pol...@marvel.com <mailto:pol...@marvel.com>

> <mailto:pol...@marvel.com <mailto:pol...@marvel.com>> | ph:

> 212-576-4028
> > <tel:212-576-4028>
>
> >
> >
> ******************************************************************************
> >
> > Nothing contained in this e-mail shall (a) be considered a legally
> > binding agreement, amendment or modification of any agreement with
> > Marvel, each of which requires a fully executed agreement to be
> received
> > by Marvel or (b) be deemed approval of any product, packaging,
> > advertising or promotion material, which may only come from Marvel's
> > Legal Department.
> >
> >
> ******************************************************************************
> >
> > THINK GREEN - SAVE PAPER - THINK BEFORE YOU PRINT!
> >
> >
> >
> >
> >
> > --
> > GCD-Contact mailing list - gcd-c...@googlegroups.com
> <mailto:gcd-c...@googlegroups.com>
>

> > <mailto:gcd-c...@googlegroups.com

> <mailto:gcd-c...@googlegroups.com>>
>
> > To unsubscribe send email to
> gcd-contact...@googlegroups.com
> <mailto:gcd-contact%2Bunsu...@googlegroups.com>
>

> > <mailto:gcd-contact%2Bunsu...@googlegroups.com
> <mailto:gcd-contact%252Buns...@googlegroups.com>>

>
> > For more options, visit this group at
> > http://groups.google.com/group/gcd-contact
> >
> >
> >
> > --
> > Lionel English
> > San Diego, CA
>
> > lio...@beanmar.net <mailto:lio...@beanmar.net>

> <mailto:lio...@beanmar.net <mailto:lio...@beanmar.net>>

>
> >
> > --
> > GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

>
> > To unsubscribe send email to
> gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech%2Bunsu...@googlegroups.com>

>
> > For more options, visit this group at
>
> > http://groups.google.com/group/gcd-tech
>
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

>
> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech%2Bunsu...@googlegroups.com>

> For more options, visit this group at
> http://groups.google.com/group/gcd-tech
>
>
>
>
> --
> Lionel English
>
> San Diego, CA

> lio...@beanmar.net <mailto:lio...@beanmar.net>

>
>
>
>
> --
> Lionel English
>
> San Diego, CA

> lio...@beanmar.net <mailto:lio...@beanmar.net>
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech+u...@googlegroups.com>

> For more options, visit this group at
> http://groups.google.com/group/gcd-tech
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech%2Bunsu...@googlegroups.com>

> For more options, visit this group at
> http://groups.google.com/group/gcd-tech
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech%2Bunsu...@googlegroups.com>

> For more options, visit this group at
> http://groups.google.com/group/gcd-tech
>
>
>
>
> --
> Lionel English
>
> San Diego, CA

> lio...@beanmar.net <mailto:lio...@beanmar.net>
>
> --
> GCD-Tech mailing list - gcd-...@googlegroups.com

> <mailto:gcd-...@googlegroups.com>

> To unsubscribe send email to gcd-tech+u...@googlegroups.com

> <mailto:gcd-tech+u...@googlegroups.com>

Olson, Peter

unread,

Feb 8, 2012, 10:03:51 PM2/8/12

to gcd-...@googlegroups.com

Hey Henry -

Editor is enumerated as part of the Creative Work schema, so it's implicitly part of the comic schema (you can see it in the sample code that I included). I'm not sure what the schema.org/W3C policy is around 3rd party ID systems - I'll see what they say about comics.org IDs.

- peter

__________________

Peter Olson
VP, Web and Application Development
Marvel Entertainment

212-576-4028
pol...@marvel.com

Nothing contained in this e-mail shall (a) be considered a legally binding agreement, amendment or modification of any agreement with Marvel, each of which requires a fully executed agreement to be received by Marvel or (b) be deemed approval of any product, packaging, advertising or promotion material, which can only come from Marvel's Legal Department.

-----Original Message-----
From: gcd-...@googlegroups.com on behalf of Henry Andrews
Sent: Wed 2/8/2012 8:47 PM
To: gcd-...@googlegroups.com
Subject: Re: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema

Thanks for the replies, Peter. In particular, the direction of describing "salable units" is helpful in giving an overal framework for what is and isn't important in this initial effort. Likewise the notion that the schema doesn't try to enforce what is put in each field is helpful- when you get right down to it, that enforcement is the main preoccupation of the GCD. We usually either agree on the fields but not the exact contents, or agree on the broad principle of a set of fields but can't work out specific fields that capture the exact data that we want.

So, revising my earlier feedback:
* Issue numbers- as you note, the "which source" concern goes away. But I think it makes it even more important that it be a string rather than a number, so folks can put in unusual numbers. I don't even have to go far to come up with examples- Marvel ran dual-numbered issues for a while before they returned their post-Heroes Reborn series back to the numbering they would have had if not interrupted. DC's recent brief revival of Adventure Comics was dual numbered on both the cover and the indicia.

* Publishers- The difficulties are probably sidestepped in terms of schema definition, although when you get to older U.S. comics from more obscure publishers the usefulness of this field may be dubious given the varied (and often hotly contested) opinions as to who the "real" publisher of any given issue is. I suppose that's more of a content quality issue than a schema definition issue, although I guess you could make it a multi-valued field and then all of the possible publishers could be added. Come to think of it, you'll need to handle joint DC/Marvel and similar publications as well (we don't handle those well currently either). Hmm.. Are the books with DC/Vertigo on the DC or Vertigo? I guess that gets back into policing content, so it's not your concern. (can you tell that publication history is one of my primary interests?)

* Credits- I would definitely hope for an editor role. Some people might want to see, for instance, whether a given Silver Age DC title was edited by Julius Schwartz or not. Beyond that, if you're stuck with a fixed set you're stuck with a fixed set, and this is a reasonable set.

On the topic of distribution, I think there's some confusion (although your reply remains valid and useful). In earlier times there was no (visible or known) distribution code, but there sometimes was a distribution mark indicating who did the distribution (A.N.C., I.N.D., P.D.C., the Atlas globe, etc.) However, I'm not sure this would be of interest for your purposes. It's of interest to researchers, but doesn't apply much to current commercial concerns- even most collectors don't pay attention to it.

Looking at your open issues, I would advise against enumerating comic book formats. While there are several common ones, there are endless variations, especially globally. If you do attempt it, I advise including a catch-all for things that don't fit.

I would also advise treating comic strips as a separate but related project. The credits largely apply to both, but the unit of publication is so different- you want to go strip by strip, not newspaper by newspaper. The biggest problem is handling reprints of strips in comic books, but if you are not tracking reprints then you can more easily punt this.

Finally, since I don't have a formal position in the GCD these days, I feel free enough to be gauche and suggest our own issue id as a field. An increasing number of commercial sites are using GCD data, and it could also be viewed as a way to access extended information beyond the scope of the schema.org project, which is pretty much what commercial sites use it for now. But I would not consider it a slight if this suggestion were ignored :-)

thanks,
-henry

>________________________________
> From: "Olson, Peter" <pol...@marvel.com>
>To: gcd-...@googlegroups.com
>Sent: Wednesday, February 8, 2012 4:35 PM
>Subject: RE: [gcd-tech] Fwd: [gcd-contact] Comic Web Schema
>
>

>Hi Henry & Lionel -

>
>We thought very hard about adding stories to this schema but ultimately decided against including them. (We use the series > issue > story model internally - indeed we adopted it after guidance from you guys some years ago. It's integral to how our comic data model works at Marvel.)
>

>The main reason we decided against including stories (and housing creator and character assignments therein) is ease of implementation. In order to be most effective, the schema needs to be adopted widely and in order to get wide adoption it needs to be easy to understand and implement. Most retailer and fan sites don't really think about comics as granularly as our groups do. Retailers and fans don't encounter loose comic stories (the same way that particle physicists don't encounter loose quarks) - they're dealing with issues. Fans are writing about them and retailers are selling them in print and digital formats. It's unlikely, I think, that a fan or retailer (or even many publishers) will go to the length of teasing out the collection of stories in an issue and then attribute creation credits to them. (Additionally, a lot of the schema.org direction is about modeling saleable products - loose stories aren't for sale).

Ultimately we decided it's best to aggregate talent assignments at the issue level, even if that doesn't perfectly match the "standard model" for comics as we understand them.
>
>In any case even if the spec is adopted, is likely to be iterative and we can maybe introduce stories to the spec at a later date.
>

>Re some of your other points -
>See my other email about the distributor code - It can be any distributor (with Diamond being the obvious candidate in the US) or no distributor. In a lot of cases many of the identifiers for comics aren't going to be present at all. Not every data point enumerated here has to be present for every comic on the web.
>Volume - I'll change volume from a number to a string in the spec. Your qualms are EXACTLY why we use the start year as the volume on marvel.com. No one f$%&-ing knows which volume number is which.
>Creator roles - Editor is part of the Creative Work schema, as are a number of other roles. The schema.org design pattern kind of stipulates that each creative work type has a pre-defined set of creative roles. It's problematic when other creative types arise but it's kind of their call.
>Anthologies - under this schema, creative credits will be aggregated at the issue level (for reasons outlined above).
>Comic strips - I feel like comic strips are kind of their own beast. Even though they share some elements of storytelling with comic books, they are a distinct art form and are distributed and enumerated quite differently than comic books. I agree they probably deserve their own spec distinct from comic issues - and I'm probably not very qualified to write it. Maybe someone on your side can pick up the baton.
>"Which issue number" - that type of question is a bit beyond the scope of what we can do with the schema. The schema only gives a structure to fill in - it doesn't stipulate what should occupy any data point

Lionel English

unread,

Feb 9, 2012, 2:42:31 AM2/9/12

to gcd-...@googlegroups.com

If the goal is to keep it from getting too complicated, then I'd say don't try to shove characters, objects, etc in there. At most you might want to do something to identify a feature. E.g. Spider-Man, X-Men, etc. In a more general sense, you might think of these as licensed properties--something like G.I. Joe, for example, is a property that stretches across several mediums.

For books the enumerated formats appear to be paperback, hardback, and digital. Graphic novel is derived from book, so it should inherit those formats, and I don't see a need to get more detailed than that if you want to keep it international. For periodicals, you may just want to go with print vs digital. I'd avoid getting into more specific formats such as magazine vs newspaper vs US comic-book vs digest, etc. Especially once you consider indie publishers, who may not conform to any of the standard sizes, and standard sizes will vary by country anyway.

Can a given object have more than one of the same attribute? I.e. can a Creative Work have multiple author attributes, or just a single attribute with multiple values? Can an item have multiple publishers?

Do you see a need to add a page count attribute? Price?

Olson, Peter

unread,

Feb 9, 2012, 9:20:58 AM2/9/12

to gcd-...@googlegroups.com

I noticed page count was missing and it should be there as of the last iteration I did last night. All schemas can attributes with multiple values – I don’t know of a use case when there’s multiple publishers but the schema would theoretically allow it.

Price is handled through the Product/Offer schemas (for currently on-sale books). There might be some value to having an original printed price as well, but I don’t know if that adds confusion or not.

Martin Schlömer

unread,

Feb 10, 2012, 7:35:56 PM2/10/12

to gcd-tech

The way i understand it schema.org provides standardized HTML tags
that search engines can pick up to index sites.

So, who would be using these tags? I assume it's only the site (e.g.
Marvel.com) providing original content or maybe even online shops
selling the books, so Google et al. can link to the product more
precisely and maybe even assist in purchasing it. A database like GCD
will have little to gain from these markups because it would require a
web crawler to collect the data which is tedious and error prone. And
quality of markup data will go downhill with the number of sites
offering the same books with individually compiled info.
For GCD a direct data export of new releases from the publisher would
propably be prefered.

Let's focus on the two things a publisher or point of sale really
wants: be found for the right search terms and provide precise
ordering information.

Search terms:
Someone looking for "Cap. America" may be interested in reading Secret
Avengers as well. But as long as the meta data doesn't include "Cap.
America" the search engine can only guess if he's in the issue. So i
would make sure to keep character information. Same for a synopsis
with major story events. Info like page count is interesting too, but
it's hardly going to be included in a textual search.

Ordering information:
Multiple distributor names + distributor codes sound logical these
days since the same book is published in different media
simultaniously. When you publish a book you want to give the buyer a
choice of getting the print edition or the digital edition (or maybe
even a TPB containing the story - which would make all this more
difficult but is a great service to the customer).
So either you supply the information for all distributors yourselves
or each distributor will only markup their own items on their site
(i.e. Diamond's and Comixology's markup will be different even though
it is the same story).

I hope i didn't go off topic too far in my assumptions.

Lionel English

unread,

Feb 13, 2012, 5:12:29 PM2/13/12

to gcd-...@googlegroups.com

I don't really have any opinion on whether or not the GCD should adopt these proposed standards for enhancing our HTML. Most of it would repeat information already on our pages in plain text, so presumably it's already being indexed.

I disagree with Martin that there's a real benefit in adding tags for characters, as I commented earlier. I think it might make more sense to tag features--the licensed properties that are the selling point of most issues/books. Other characters can be mentioned in the synopsis depending on their importance.

I agree with Martin though about the distributor/distribution tags. At the moment, Diamond has a lock on the US direct market. But, however small there's still a newsstand market as well, and there are foreign markets with their own distributors. So a distributor code field is useless without also providing a distributor id to indicate *whose* code is being given. For books/graphic novels you have the ISBN, which is a standard data point that any retailer or wholesaler can use, and that has a publisher code built in. For periodicals, it's unlikely your typical newsstand agent will be selling issues on line, so it's more likely to be used by online comics retailers than large book retailers. In the US, at least. But it would still be better to provide some form of id that can uniquely identify a specific issue, without resorting to a particular distributors internal code. The standard UPC code on most periodicals only serves to identify the periodical, not the issue, but many comics publishers, including Marvel IIRC, have been using the option five digit UPC add on to identify issue numbers. Accordingly I think you should add a UPC tag at the periodical level and rename the one you have at the issue level and either remove the distributor code or add another entity (distributor) and expand the use of distributor code to more clearly allow for multiple distributors.

I also agree with Henry--the use of UPC codes as issue identifiers is fairly recent. The people most likely to use the web schema to identify periodical issues (as opposed to books) are comics retailers--and with comics retailers back issues are just as much a commodity as current issues. So a unique per issue id would be useful for those types of retailers, and the GCD *is* in a position to provide such ids. Ebay has been using our data for well over a year now to provide a standardized comics catalog, so there's a precedent if needed.

--
GCD-Tech mailing list - gcd-...@googlegroups.com
To unsubscribe send email to gcd-tech+u...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/gcd-tech

Olson, Peter

unread,

Feb 14, 2012, 10:20:34 AM2/14/12

to gcd-...@googlegroups.com

I think that there is likely to be some modest traffic benefit to your site if you do add the markup. It does make a lot of sense to me to add the comics.org ID to the schema (and obviously it’ll be an easier sell to add that if you use the markup on your pages).

I’ll try to amend the spec to handle multiple distributors – your arguments make a lot of sense there. I know in the specific case of comixology they display the diamond code for most books on their site as well as their own IDs.

-peter

From: gcd-...@googlegroups.com [mailto:gcd-...@googlegroups.com] On Behalf Of Lionel English

Sent: Monday, February 13, 2012 5:12 PM
To: gcd-...@googlegroups.com

******************************************************************************

Andres Jimenez

unread,

Feb 14, 2012, 10:33:13 AM2/14/12

to gcd-...@googlegroups.com, pol...@marvel.com

Peter,

Is there any intentions to try to make this schema an international standard?

Cheers,

Andrés

2012/2/14 Olson, Peter <pol...@marvel.com>:

--
Andres Jimenez

Henry Andrews

unread,

Feb 14, 2012, 11:14:38 AM2/14/12

to gcd-...@googlegroups.com

I just took a look at the schema.org intro pages and I think it would be in our best interest to add this markup- search engines would be able to better locate and rank our pages, which would increase the likelihood of external search engine users finding not only our site, but the most relevant pages on our site. We tend to show up pretty well already, but this would allow for more advanced interpretation of our pages which is a good thing.

I'd be willing to commit to putting some time into the technical work on the GCD side.

Some parts of the schema will initially be difficult for us to use properly- for instance, isolating individual creators in our credit fields can be very difficult if the credit is complex. However, this situation will improve over time, and there's still plenty of fields where adding the schema.org markup would be trivial.

thanks,

-henry

From: "Olson, Peter" <pol...@marvel.com>
To: gcd-...@googlegroups.com

Sent: Tuesday, February 14, 2012 7:20 AM
Subject: RE: [gcd-tech] Re: Fwd: [gcd-contact] Comic Web Schema

Olson, Peter

unread,

Feb 14, 2012, 1:38:46 PM2/14/12

to gcd-...@googlegroups.com

Let me know if you need any help in that regard. We were able to mock up the markup (say that 10 times fast) on Marvel.com pretty quickly. The idea again is that this is easy to implement.

-peter

Olson, Peter

unread,

Feb 14, 2012, 1:41:07 PM2/14/12

to Andres Jimenez, gcd-...@googlegroups.com

Ideally, yes. The w3c folks suggested I contact Japan's Digital Comics Association. I haven't reached out to any international groups as of yet, though.

-peter

Henry Andrews

unread,

Feb 14, 2012, 3:52:11 PM2/14/12

to gcd-...@googlegroups.com

Really, the only thing I think I'll have any problems with are the things that are due to our own weird internal formatting, which derives from the pre-Web days when this was all done in tab-separated text files :-P I'll have to come up with a really weird credit to illustrate what I mean.

Likewise, I'd be happy to help answer any questions about GCD ids, although they're fairly straightforward. You'd likely want to use the issue ids primarily, as we may disagree on definitions of "series", and will almost certainly have trouble lining up definitions of "publisher". Although I suppose you could either allow multiple values for those, or just not worry about the correlation being exact. Obviously current major publishers such as Marvel are easy to correlate.

But issue IDs are what most people end up using directly- you'll see them on several sites that use our data.

thanks,

-henry

From: "Olson, Peter" <pol...@marvel.com>
To: gcd-...@googlegroups.com

Sent: Tuesday, February 14, 2012 10:38 AM

Lionel English

unread,

Feb 14, 2012, 4:44:26 PM2/14/12

to gcd-...@googlegroups.com

Issues are the things that are sold, so that makes them of the most interest to retailers, wholesalers, and the secondary market.

Henry Andrews

unread,

Feb 14, 2012, 4:59:08 PM2/14/12

to gcd-...@googlegroups.com

Yeah, they're the only item that unquestionably maps to a physical, distinct *thing*. All the rest of it is conceptual. Issue ID is really the fundamental unit in this sort of use case (and story idea in most others- the fact that we root our hierarchy in publishers and series is really the cause of so many of our problems).

cheers,

-henry

From: Lionel English <lio...@beanmar.net>
To: gcd-...@googlegroups.com
Sent: Tuesday, February 14, 2012 1:44 PM

Fred Wilson

unread,

Feb 19, 2012, 6:58:15 PM2/19/12

to gcd-...@googlegroups.com

But thank goodness you do, or my personal tracking systems would be unusable!

Olson, Peter

unread,

Mar 7, 2012, 8:02:35 PM3/7/12

to gcd-...@googlegroups.com

Hey guys –

Just an update (sorry I’ve been a little slow in keeping you up-to-date in this). I’ve made a few updates to the schema based on yours and other people’s comments. Regarding the comics.org internal ID, some of the schema.org members had a good suggestion. Basically there’s a method to extend data points in the schema (http://www.w3.org/wiki/PeriodicalsComics#Local_ID_Spaces) in order to reference local ID spaces. This would allow your ID (or internal publisher IDs or distributor IDs) to be referenced when needed by web site owners without having to formally specify it within the schema. Basically this provides a lot of flexibility to reference different IDs without cluttering up the property definitions or giving a privileged position to any one organization.

The schema has been moved to candidate status as well, so thanks for all your help so far.

-peter

Jeff David Lowe

unread,

Mar 12, 2012, 10:40:31 PM3/12/12

to gcd-...@googlegroups.com

All,
I have a tangential question about MySQL and installers.
Does anyone know of a way to install MySQL such that:
1) don't use Rpms on Linux
2) put it in a created directory and not default directory
3) create root account/password
4) and do this all programmatically?

Even a push in the right direction is appreciated.
Thanks,
Jeff

Lionel English

unread,

Mar 13, 2012, 2:14:23 AM3/13/12

to gcd-...@googlegroups.com

http://dev.mysql.com/doc/refman/5.5/en/source-installation.html

--
GCD-Tech mailing list - gcd-...@googlegroups.com
To unsubscribe send email to gcd-tech+u...@googlegroups.com
For more options, visit this group at http://groups.google.com/group/gcd-tech

Reply all

Reply to author

Forward