Rough notes from my Tagonomy project, but you'll get the idea:
Taxonomy-tagging hybrid
AS our taxonomies get more varied and more robust, they become way too
big for traditional drop downs or even just loading into a page, but
tagging results is too much varierty based on misspellings and
inconsistent use of tags. One solution might be a hybrid. In this
solution, the user interface behaves largely like delicious tagging,
but the tags have structure to them that aids in disambiguating their
meaning.
For example, consider the following definition of a tag:
Tag:
name
parent
implies
metadata
With this structure, tags can be nested according to the existing
taxonomies with which we normally deal, but the tags can also imply
other tags. For instance, we often get into difficulties trying to
mix subject matter tags in the same hierarchy as geographic tags. We
may want to tag a story as news/local/bearden and
sports/football/highschool/bearden_high, but this structures mix
taxonomies in a way that make it difficult to manage. Should it be
sports/football/college/myteam or college/football/myteam or what?
And a lot of times, you want to make sure that certain items get
multiple tags in a consitent manner.
Ideally, we'd tag the story as bearden_high which would imply
geo/tn/east/knox/bearden and education/school/highschool and
scope/city. I'm throwing the scope taxonomy out there as a
replacement for the 'local' tag which generally means that the story
is of use to a small community that we serve, but local doesn't do a
very good job of defining the scale of that community.
scope/
city/
state/
national/
Also note that I'm rooting the taxonomies under abstract terms like
scope, subject, topic, person, geo. We can have roots for yahoo, ap,
etc and the implication mechanism can be used for mapping between them
when needed. For instance, classifieds/automotive might imply
yahoo/transportation.
ht
We will need to keep track of the source of each tag also -- they can
be assigned, implied, mined or discovered.
Assigned tags are tags that have been added by staff. These are
the official tags.
Implied tags are tags that have been added by the system based on
other tags.
Mined tags are tags that are mechanically determined through data
mining the content. A better term might be 'derived'
Discovered tags are tags that have been given by the community.
news/
sports/
goverment/
entertainment/
technology/
community/
person/
place/
event/
organization/
school/
church/
charity/
business/
ad/
banner/
classifieds/
employment/
automotive/
realestate/
pets/
general/
The metadata of a tag is used to provide information that lies outside
of the tagonomy like ties to actual content items. Perhaps the
org/business/scripps tag might have metadata along the lines of
places:<place_id>,stocks:SSP, etc
When adding these tags, we'd not want to drill down all of the time.
instead, you'd want to type bare words and if they were unique, the
system should expand them for you. If you were a power user you might
use some syntax to help like double slashes to mean 'at any depth'.
Ie. If Ford is a person, a business, a car make and a county any you
wanted the county, you might type place//ford
root/
/inprint/frontpage
/inprint/lifestyle
/site-nav/lifestye
/people/
/place/
/subjectmatter/
/event/