I have a few quick questions and observations to kick things off.
Is A9.com still the best location for OpenSearch namespace URI?
OpenSearch.org seems like a better choice.
What is the status of IANA registration of the OpenSearch MIME type
application/opensearchdescription+xml?
Should OpenSearch model its IANA considerations after Atom's example?
http://www.apps.ietf.org/rfc/rfc4287.html#sec-7
Why is AdultContent so liberal in its accepted values? 5 accepted
values for a negative result. Specifying a single desired behavior
will help clients and publishers meet spec.
I just checked and AtomPub uses "yes" and "no" as the two valid values for its one boolean attribute. Unfortunately, those aren't in the 5 that AdultContent currently accepts.
I added some thoughts below. This is the first time I have read through the spec (some really exciting things in there!) and I wasn’t involved in any prior discussions so some parts may be wayward (apologies up front), but maybe one or two useful things will come out of it.
Btw, the link to Plone on “OpenSearch_software” has an extra “http” at the front, breaking the link.
steven
OpenSearch Observations
------------------------
Namespace
=========
The namespace for a future draft could probably be moved to http://www.opensearch.org – this should affect any compatibility with previous version written against the old namespace.
Security
========
I know his is perhaps a bit early stage and certainly a major “add” but I wonder whether some security considerations could be considered. When advertising your OpenSearch description document you may wish to indicate that authentication may be required – perhaps against all or a subset of domains. In doing this, a client who is using the description document could use something like oAuth to get valid tokens that could be passed with the search and would then allow querying against backend sources as an authenticated user.
This would be useful on two fronts :
(1) that many sites don’t seem keen on handing out syndicated search results and perhaps would be more at ease with some pre-authentication and
(2) you could use other personalization features to provide contextually relevant results to the client.
Contact
=======
Just a thought, but I wonder whether content could be of multiple types such as “Email”, “XMPP” and so on. I have ideas on where that could go but a bit out of scope for just now I think.
Tags
====
I like tags, but often run into difficulties such as “New York” and so on. I wonder whether we could use a comma or semi-colon delimiter or give guidance on how to mark multiple words as a single logical work (such as syntax such as New_York).
The “Query” element
===================
I’d like to see this paragraph in the document expanded on a little (perhaps it will be a little more obvious when I read later on the in the document).
The "Developer" element
=======================
I’d be interested in where this came from, but I do wonder whether Name and Contact child elements would be valuable here – for example for technical support.
The "Attribution" element
=========================
I could see this extending for a host of reasons.
1. In a search feed I can imagine the number of sources could be quite large and many ask that you identify this. 256 characters wouldn’t be enough for some searches.
2. Going back to my security point above. If would could identify the sources of the data within the feeds, we could tie oAuth into this and allow users to query protected (and perhaps personalized) content.
Perhaps these two points could be added to a separate element and something like an IDREF to point at the appropriate source, but worth pointing out.
I do wonder whether this could be extended to add optional data about the sources such as URL, LastCache (because many real-time RSS sources don’t allow you to hit them all the time so the user may sometimes have to get data that is slightly out of data and this may be worth indicating), NextUpdate (the converse to LastCache).
The "SyndicationRight" element
==============================
How does “closed” work in the real world?
The "AdultContent" element
==========================
Per the adult content - I wonder whether we could make this polymorphic, or employ another element and use something like the Content Labels (the replacement for PICS).
I 100% agree with making the basic selection fairly simple - I wonder whether there is an option to allow users to extend this if necessary
The "Language" element
======================
Would this not have to be incorporated into the OpenSearchDescription ? So using the xml:lang attribute we could allow others to pull in a description in their language? I’m not sure how this would be implemented – by separate URL’s for requesting the description document for an appropriate language? Are there any sites that provide descriptions in multiple languages and how is that currently achieved?
OpenSearch 1.1 parameters
=========================
It may be useful to have a “GUID” as a parameter in here. This could be used to refer to state on the server associated with the user search. I am working with FAST just now which uses some facilities which are highly dependent on state (otherwise you need to re-execute multiple queries to get to the “starting position” of your current query).
State isn’t always great, but in some cases for performance reasons it is clearly needed and an optional GUID could be used to permit OpenSearch state (short or long term depending on implementation).
Suggested Template
==================
This is simply something that occurred to me based on some work I’ve done recently. Many sites that provide syndicated data interpret what is to go into the elements differently – many putting fully marked up data into the “Content” section. This ignores the fact that sites provide feeds in Atom and RSS (in my case I normalized to Atom for all feeds).
It may be nice if in the description of the OpenSearch if they could optionally specify one or more Xslt templates that could be used to display a row from a given source. This could be particularly useful when combining data from multiple sources into a single output. The output of a single Xslt for an Atom source is varied – some sites add media, others have different semantic interpretations of the elements and so on.
Finally
=======
Based on a client I am working with who wishes to syndicate their data to partners securely with a large FAST backend database, there are a number of other features such as categories, similarity searching and so on I could imagine being part of syndicated search interface, but at the moment this may only complicate things so I’ll ignore adding anything on this for now.
An errata to what I mentioned on Content Labels – POWDER is what it’s now termed and could be useful.
http://www.w3.org/TR/powder-formal/
Perhaps some of it could be useful – e.g. the tagset.
steven
Agree. A wiki would be nice to pull discussion together.
http://opensearch.wetpaint.com/
I have structured and cross-referenced according to the current discussion. Please feel free to hack away.
-steven
Ah crap – didn’t realize it was open to anyone to add to.
I put the content I had on that wetpaint wiki here:
http://www.opensearch.org/User_talk:Weblivz
Needs cleaning up a little as I lost all formatting. I will have the WetPaint site removed.
Yes, having a central location for this probably makes more sense (as does locking down the key pages!).
Anyone - feel free to copy, merge, change, obliterate what I have added as things progress.
yes please keep it in one place.
>
> -DeWitt
>
> On Thu, Oct 9, 2008 at 1:39 AM, Steven Livingstone-Perez
> <web...@hotmail.com> wrote:
> Agree. A wiki would be nice to pull discussion together.
>
>
>
> http://opensearch.wetpaint.com/
>
>
opensearch.org is the better place to discuss that, but if you have good
suggestions how to streamline it I am sure you will not meet much
resistance.
>
>
--
Best Regards,
Christian Ledermann