Modified:
trunk/feedparser/docs/xml/feedparser.xml
trunk/feedparser/feedparser.py
trunk/www/docs/changes-42.html
trunk/www/docs/index.html
trunk/www/docs/reference-bozo.html
trunk/www/docs/reference-bozo_exception.html
trunk/www/docs/reference-encoding.html
trunk/www/docs/reference-entry-summary.html
trunk/www/docs/reference-entry-tags.html
trunk/www/docs/reference-entry.html
trunk/www/docs/reference-etag.html
trunk/www/docs/reference-feed-rights.html
trunk/www/docs/reference-feed-tags.html
trunk/www/docs/reference-feed-textinput.html
trunk/www/docs/reference-feed.html
trunk/www/docs/reference-headers.html
trunk/www/docs/reference-href.html
trunk/www/docs/reference-modified.html
trunk/www/docs/reference-namespaces.html
trunk/www/docs/reference-status.html
trunk/www/docs/reference-version.html
trunk/www/docs/reference.html
Log:
Modified: trunk/feedparser/docs/xml/feedparser.xml
==============================================================================
--- trunk/feedparser/docs/xml/feedparser.xml (original)
+++ trunk/feedparser/docs/xml/feedparser.xml Tue Mar 11 22:48:14 2008
@@ -3198,7 +3198,6 @@
<listitem><para>Updated the whitelist of <link
linkend="advanced.sanitization.html">acceptable HTML elements and
attributes</link> based on the latest draft of the &html5; specification.</para></listitem>
<listitem><para><xref linkend="advanced.sanitization.css"/>.
(Previous versions of &feedparser; simply stripped all inline styles.)
Many thanks to Sam Ruby for implementing this, despite my insistence
that it was impossible.</para></listitem>
<listitem><para><xref linkend="advanced.sanitization.svg"/> and <xref
linkend="advanced.sanitization.mathml"/>. Many thanks to Jacques
Distler for patiently debugging these features.</para></listitem>
-<listitem><para>&feedparser; always outputs embedded markup as
wellformed &xhtml;, no matter how malformed the input. You can serve
the output as <literal>application/xhtml+xml</literal>, and it will
never trigger your browser's user-hostile draconian error handling.</para></listitem>
<listitem><para>&iri; support for every element that can contain a &uri;.</para></listitem>
<listitem><para>Ability to <link
linkend="advanced.base.disable">disable relative &uri; resolution</link>.</para></listitem>
<!--<listitem><para>Ability to <link
linkend="advanced.sanitization.disable">disable &html; sanitizing</link>.</para></listitem>-->
Modified: trunk/feedparser/feedparser.py
==============================================================================
--- trunk/feedparser/feedparser.py (original)
+++ trunk/feedparser/feedparser.py Tue Mar 11 22:48:14 2008
@@ -152,7 +152,7 @@
codepoint2name[ord(codepoint)]=name
# BeautifulSoup parser used for parsing microformats from embedded
HTML content
-# http://www.crummy.com/software/BeautifulSoup/.
+# http://www.crummy.com/software/BeautifulSoup/
# feedparser is tested with BeautifulSoup 3.0.x, but it might work
with the
# older 2.x series. If it doesn't, and you can figure out why, I'll
accept a
# patch and modify the compatibility statement accordingly.
Modified: trunk/www/docs/changes-42.html
==============================================================================
--- trunk/www/docs/changes-42.html (original)
+++ trunk/www/docs/changes-42.html Tue Mar 11 22:48:14 2008
@@ -31,8 +31,6 @@
<li>
<a href="html-sanitization.html#advanced.sanitization.svg" title="SVG
Sanitization">SVG Sanitization</a> and <a
href="html-sanitization.html#advanced.sanitization.mathml"
title="MathML Sanitization">MathML Sanitization</a>. Many thanks to
Jacques Distler for patiently debugging these features.</li>
<li>
-<span class="application">Universal Feed Parser</span> always outputs
embedded markup as wellformed <acronym title="Extensible HyperText
Markup Language">XHTML</acronym>, no matter how malformed the input.
You can serve the output as <tt
class="literal">application/xhtml+xml</tt>, and it will never trigger
your browser's user-hostile draconian error handling.</li>
-<li>
<acronym title="International Resource Identifier">IRI</acronym>
support for every element that can contain a <acronym title="Uniform
Resource Identifier">URI</acronym>.</li>
<li>Ability to <a
href="resolving-relative-links.html#advanced.base.disable"
title="Disabling Relative URIs Resolution">disable relative <acronym
title="Uniform Resource Identifier">URI</acronym> resolution</a>.</li>
<li>
Modified: trunk/www/docs/index.html
==============================================================================
--- trunk/www/docs/index.html (original)
+++ trunk/www/docs/index.html Tue Mar 11 22:48:14 2008
@@ -219,7 +219,7 @@
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.link">feed.textinput.link</a></span></li>
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.name">feed.textinput.name</a></span></li>
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.description">feed.textinput.description</a></span></li>
-<li><span class="section"><a
href="reference-feed-textinput.html#id307430">Annotated example</a></span></li>
+<li><span class="section"><a
href="reference-feed-textinput.html#id307408">Annotated example</a></span></li>
</ul>
</li>
<li>
Modified: trunk/www/docs/reference-bozo.html
==============================================================================
--- trunk/www/docs/reference-bozo.html (original)
+++ trunk/www/docs/reference-bozo.html Tue Mar 11 22:48:14 2008
@@ -28,7 +28,7 @@
<div></div>
</div>
<p>See <a href="bozo.html" title="Bozo Detection">Bozo Detection</a>
for more details on the <tt class="varname">bozo</tt> bit.</p>
-<a name="id315359"></a><table class="tip" border="0" summary="">
+<a name="id315337"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">bozo</tt> may not be present. Some platforms,
such as Mac OS X 10.2 and some versions of FreeBSD, do not include an
<acronym title="Extensible Markup Language">XML</acronym> parser in
their <span class="application">Python</span> distributions. <span
class="application">Universal Feed Parser</span> will still work on
these platforms, but it will not be able to detect whether a feed is
well-formed. However, it <span class="emphasis"><em>can</em></span>
detect whether a feed's character encoding is incorrectly declared.
(This is done in <span class="application">Python</span>, not by the
<acronym title="Extensible Markup Language">XML</acronym> parser.) See
<a href="character-encoding.html" title="Character Encoding
Detection">Character Encoding Detection</a> for details.</td></tr>
Modified: trunk/www/docs/reference-bozo_exception.html
==============================================================================
--- trunk/www/docs/reference-bozo_exception.html (original)
+++ trunk/www/docs/reference-bozo_exception.html Tue Mar 11 22:48:14 2008
@@ -28,7 +28,7 @@
<div></div>
</div>
<p>See <a href="bozo.html" title="Bozo Detection">Bozo Detection</a>
for more details.</p>
-<a name="id315447"></a><table class="tip" border="0" summary="">
+<a name="id315425"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">bozo_exception</tt> will only be present if <tt
class="varname">bozo</tt> is <tt class="constant">1</tt>.</td></tr>
Modified: trunk/www/docs/reference-encoding.html
==============================================================================
--- trunk/www/docs/reference-encoding.html (original)
+++ trunk/www/docs/reference-encoding.html Tue Mar 11 22:48:14 2008
@@ -27,11 +27,11 @@
</div>
<div></div>
</div>
-<a name="id314937"></a><table class="note" border="0" summary="">
+<a name="id314915"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">The process
by which <span class="application">Universal Feed Parser</span>
determines the character encoding of the feed is explained in <a
href="character-encoding.html" title="Character Encoding
Detection">Character Encoding Detection</a>.</td></tr>
</table>
-<a name="id314957"></a><table class="tip" border="0" summary="">
+<a name="id314935"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">This element
always exists, although it may be an empty string if the character
encoding can not be determined.</td></tr>
</table>
Modified: trunk/www/docs/reference-entry-summary.html
==============================================================================
--- trunk/www/docs/reference-entry-summary.html (original)
+++ trunk/www/docs/reference-entry-summary.html Tue Mar 11 22:48:14 2008
@@ -40,7 +40,7 @@
<h3>See also</h3>
<ul><li><a href="reference-entry-summary_detail.html" title="entries[i].summary_detail">entries[i].summary_detail</a></li></ul>
</div>
-<a name="id309594"></a><table class="note" border="0" summary="">
+<a name="id309572"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">Some feeds
include both a <tt class="literal">summary</tt> and <tt
class="literal">description</tt> element for each entry. In this case,
the first element will be available in <tt
class="varname">entry['summary']</tt> and the second will be available
in <tt class="varname">entry['content'][0]</tt>.</td></tr>
</table>
Modified: trunk/www/docs/reference-entry-tags.html
==============================================================================
--- trunk/www/docs/reference-entry-tags.html (original)
+++ trunk/www/docs/reference-entry-tags.html Tue Mar 11 22:48:14 2008
@@ -34,7 +34,7 @@
<li><tt class="literal">/rdf:RDF/rdf:channel/rdf:item/dc:subject</tt></li>
</ul>
</div>
-<a name="id312162"></a><table class="note" border="0" summary="">
+<a name="id312140"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">Prior to
version 4.0, <span class="application">Universal Feed Parser</span>
exposed categories in <tt class="varname">feed.category</tt> (the
primary category) and <tt class="varname">feed.categories</tt> (a list
of tuples containing the domain and term of each category). These uses
are still supported for backward compatibility, but you will not see
them in the parsed results unless you explicitly ask for them.</td></tr>
</table>
Modified: trunk/www/docs/reference-entry.html
==============================================================================
--- trunk/www/docs/reference-entry.html (original)
+++ trunk/www/docs/reference-entry.html Tue Mar 11 22:48:14 2008
@@ -31,7 +31,7 @@
<li><tt class="literal">/rdf:RDF/rdf:item</tt></li>
</ul>
</div>
-<a name="id308456"></a><table class="tip" border="0" summary="">
+<a name="id308434"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">This element
always exists, although it may be an empty list.</td></tr>
</table>
Modified: trunk/www/docs/reference-etag.html
==============================================================================
--- trunk/www/docs/reference-etag.html (original)
+++ trunk/www/docs/reference-etag.html Tue Mar 11 22:48:14 2008
@@ -28,7 +28,7 @@
<div></div>
</div>
<p>The purpose of <tt class="varname">etag</tt> is explained more
fully in <a href="http-etag.html" title="ETag and Last-Modified
Headers">ETag and Last-Modified Headers</a>.</p>
-<a name="id315162"></a><table class="tip" border="0" summary="">
+<a name="id315140"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">etag</tt> will only be present if the feed was
retrieved from a web server, and only if the web server provided an
ETag <acronym title="Hypertext Transfer Protocol">HTTP</acronym> header
for the feed. If the feed was parsed from a local file or from a
string in memory, <tt class="varname">etag</tt> will not be present.</td></tr>
Modified: trunk/www/docs/reference-feed-rights.html
==============================================================================
--- trunk/www/docs/reference-feed-rights.html (original)
+++ trunk/www/docs/reference-feed-rights.html Tue Mar 11 22:48:14 2008
@@ -24,7 +24,7 @@
<div class="abstract"><p>A human-readable copyright statement for the feed.</p></div>
<p>If this contains <acronym title="HyperText Markup
Language">HTML</acronym> or <acronym title="Extensible HyperText Markup
Language">XHTML</acronym>, it is <a href="html-sanitization.html"
title="Sanitization">sanitized</a> by default.</p>
<p>If this contains <acronym title="HyperText Markup
Language">HTML</acronym> or <acronym title="Extensible HyperText Markup
Language">XHTML</acronym>, certain (X)HTML elements within this value
may contain relative <acronym title="Uniform Resource
Identifier">URI</acronym>s. If so, they are <a
href="resolving-relative-links.html" title="Relative Link
Resolution">resolved according to a set of rules</a>.</p>
-<a name="id304929"></a><table class="note" border="0" summary="">
+<a name="id304906"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">For
machine-readable copyright information, see <a
href="reference-feed-license.html" title="feed.license">feed.license</a>.</td></tr>
</table>
Modified: trunk/www/docs/reference-feed-tags.html
==============================================================================
--- trunk/www/docs/reference-feed-tags.html (original)
+++ trunk/www/docs/reference-feed-tags.html Tue Mar 11 22:48:14 2008
@@ -34,7 +34,7 @@
<li><tt class="literal">/rdf:RDF/rdf:channel/dc:subject</tt></li>
</ul>
</div>
-<a name="id307985"></a><table class="note" border="0" summary="">
+<a name="id307963"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">Prior to
version 4.0, <span class="application">Universal Feed Parser</span>
exposed categories in <tt class="varname">feed.category</tt> (the
primary category) and <tt class="varname">feed.categories</tt> (a list
of tuples containing the domain and term of each category). These uses
are still supported for backward compatibility, but you will not see
them in the parsed results unless you explicitly ask for them.</td></tr>
</table>
Modified: trunk/www/docs/reference-feed-textinput.html
==============================================================================
--- trunk/www/docs/reference-feed-textinput.html (original)
+++ trunk/www/docs/reference-feed-textinput.html Tue Mar 11 22:48:14 2008
@@ -66,7 +66,7 @@
<div class="section" lang="en">
<div class="titlepage">
<div><div><h3 class="title">
-<a name="id307430" class="skip" href="#id307430" title="link to this
section"><img src="images/permalink.gif" alt="[link]" title="link to
this section" width="8" height="9"></a> Annotated example</h3></div></div>
+<a name="id307408" class="skip" href="#id307408" title="link to this
section"><img src="images/permalink.gif" alt="[link]" title="link to
this section" width="8" height="9"></a> Annotated example</h3></div></div>
<div></div>
</div>
<p>This is a text input in a feed:</p>
Modified: trunk/www/docs/reference-feed.html
==============================================================================
--- trunk/www/docs/reference-feed.html (original)
+++ trunk/www/docs/reference-feed.html Tue Mar 11 22:48:14 2008
@@ -31,7 +31,7 @@
<li><tt class="literal">/rdf:RDF/rdf:channel</tt></li>
</ul>
</div>
-<a name="id303346"></a><table class="tip" border="0" summary="">
+<a name="id303323"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">This element
always exists, although it may be an empty dictionary.</td></tr>
</table>
Modified: trunk/www/docs/reference-headers.html
==============================================================================
--- trunk/www/docs/reference-headers.html (original)
+++ trunk/www/docs/reference-headers.html Tue Mar 11 22:48:14 2008
@@ -27,7 +27,7 @@
</div>
<div></div>
</div>
-<a name="id315283"></a><table class="tip" border="0" summary="">
+<a name="id315261"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">headers</tt> will only be present if the feed was
retrieved from a web server. If the feed was parsed from a local file
or from a string in memory, <tt class="varname">headers</tt> will not
be present.</td></tr>
Modified: trunk/www/docs/reference-href.html
==============================================================================
--- trunk/www/docs/reference-href.html (original)
+++ trunk/www/docs/reference-href.html Tue Mar 11 22:48:14 2008
@@ -28,7 +28,7 @@
<div></div>
</div>
<p>If the feed was redirected from the original requested address, <tt
class="varname">href</tt> will contain the final (redirected) address.</p>
-<a name="id315102"></a><table class="tip" border="0" summary="">
+<a name="id315080"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">href</tt> will only be present if the feed was
retrieved from a web server. If the feed was parsed from a local file
or from a string in memory, <tt class="varname">href</tt> will not be present.</td></tr>
Modified: trunk/www/docs/reference-modified.html
==============================================================================
--- trunk/www/docs/reference-modified.html (original)
+++ trunk/www/docs/reference-modified.html Tue Mar 11 22:48:14 2008
@@ -28,7 +28,7 @@
<div></div>
</div>
<p>The purpose of <tt class="varname">modified</tt> is explained more
fully in <a href="http-etag.html" title="ETag and Last-Modified
Headers">ETag and Last-Modified Headers</a>.</p>
-<a name="id315229"></a><table class="tip" border="0" summary="">
+<a name="id315207"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">modified</tt> will only be present if the feed was
retrieved from a web server, and only if the web server provided a
Last-Modified <acronym title="Hypertext Transfer
Protocol">HTTP</acronym> header for the feed. If the feed was parsed
from a local file or from a string in memory, <tt
class="varname">modified</tt> will not be present.</td></tr>
Modified: trunk/www/docs/reference-namespaces.html
==============================================================================
--- trunk/www/docs/reference-namespaces.html (original)
+++ trunk/www/docs/reference-namespaces.html Tue Mar 11 22:48:14 2008
@@ -27,11 +27,11 @@
</div>
<div></div>
</div>
-<a name="id314877"></a><table class="note" border="0" summary="">
+<a name="id314854"></a><table class="note" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/note.png" alt="Note" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">The prefixes
listed in the <tt class="varname">namespaces</tt> dictionary may not
match the prefixes defined in the original feed. See <a
href="namespace-handling.html" title="Namespace Handling">Namespace
Handling</a> for more details.</td></tr>
</table>
-<a name="id314896"></a><table class="tip" border="0" summary="">
+<a name="id314873"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">This element
always exists, although it may be an empty dictionary if the feed does
not define any namespaces (such as an <acronym title="Rich Site
Summary">RSS</acronym> 2.0 feed with no extensions).</td></tr>
</table>
Modified: trunk/www/docs/reference-status.html
==============================================================================
--- trunk/www/docs/reference-status.html (original)
+++ trunk/www/docs/reference-status.html Tue Mar 11 22:48:14 2008
@@ -30,7 +30,7 @@
<p>If the feed was redirected from its original <acronym
title="Uniform Resource Locator">URL</acronym>, <tt
class="varname">status</tt> will contain the redirect status code, not
the final status code.</p>
<p>If <tt class="varname">status</tt> is <tt
class="constant">301</tt>, the feed was permanently redirected to a new
<acronym title="Uniform Resource Locator">URL</acronym>. Clients
should update their address book to request the new <acronym
title="Uniform Resource Locator">URL</acronym> from now on.</p>
<p>If <tt class="varname">status</tt> is <tt
class="constant">410</tt>, the feed is gone. Clients should stop
polling the feed.</p>
-<a name="id315047"></a><table class="tip" border="0" summary="">
+<a name="id315024"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">
<tt class="varname">status</tt> will only be present if the feed was
retrieved from a web server. If the feed was parsed from a local file
or from a string in memory, <tt class="varname">status</tt> will not be present.</td></tr>
Modified: trunk/www/docs/reference-version.html
==============================================================================
--- trunk/www/docs/reference-version.html (original)
+++ trunk/www/docs/reference-version.html Tue Mar 11 22:48:14 2008
@@ -66,7 +66,7 @@
<dd><a href="http://diveintomark.org/archives/2004/04/14/hot-rss">Hot
<acronym title="Rich Site Summary">RSS</acronym></a></dd>
</dl></div>
<p>If the feed type is completely unknown, <tt
class="varname">version</tt> will be an empty string.</p>
-<a name="id314831"></a><table class="tip" border="0" summary="">
+<a name="id314809"></a><table class="tip" border="0" summary="">
<tr><td rowspan="2" align="center" valign="top" width="1%"><img
src="images/tip.png" alt="Tip" title="" width="24" height="24"></td></tr>
<tr><td colspan="2" align="left" valign="top" width="99%">This element
always exists, although it may be an empty string if the version can
not be determined.</td></tr>
</table>
Modified: trunk/www/docs/reference.html
==============================================================================
--- trunk/www/docs/reference.html (original)
+++ trunk/www/docs/reference.html Tue Mar 11 22:48:14 2008
@@ -113,7 +113,7 @@
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.link">feed.textinput.link</a></span></li>
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.name">feed.textinput.name</a></span></li>
<li><span class="section"><a href="reference-feed-textinput.html#reference.feed.textinput.description">feed.textinput.description</a></span></li>
-<li><span class="section"><a
href="reference-feed-textinput.html#id307430">Annotated example</a></span></li>
+<li><span class="section"><a
href="reference-feed-textinput.html#id307408">Annotated example</a></span></li>
</ul>
</li>
<li>