[ANNOUNCE] TagSoup 1.2.1 released at long, long, LONG last

116 views
Skip to first unread message

John Cowan

unread,
Jul 7, 2011, 8:26:07 PM7/7/11
to tagsoup...@googlegroups.com
TagSoup 1.2.1 is a very belated bug fix release. Here are the changes:

* Match DOCTYPE case-blind
* Extend PushbackReader's size for oddball cases like & followed by CR
* Leo Sutic's 2x-4x speedup by precompiling HTMLScanner table
* Daniel Janus's fix for ]] in CDATA sections
* Remove bogus newline after printing children of the root element
* Allow noscript element anywhere, same as script
* Updated to 2011 edition of W3C character entities

I trust this makes all of you, at least the ones who haven't given up on
TagSoup and me completely, very happy. There will be a TagSoup 1.3 with
(I hope) the following improvements:

* Consider having elements that suppress reopening
* Handle namespace declarations semi-properly
* Squelch namespaces better in --nons mode
* Clean systemids as well as publicids
* Deal with zero or negative length ids in DOCTYPE declarations
* Fix duplicate attributes to use the last value, not the first

But I'm not going to promise when. In the meantime, share and enjoy!

--
John Cowan co...@ccil.org
"Not to know The Smiths is not to know K.X.U." --K.X.U.

bryan hunt

unread,
Jul 8, 2011, 7:04:48 AM7/8/11
to co...@ccil.org, tagsoup...@googlegroups.com
On Thu, 7 Jul 2011 20:26:07 -0400, John Cowan <co...@mercury.ccil.org> wrote:
> TagSoup 1.2.1 is a very belated bug fix release. Here are the changes:
> > * Handle namespace declarations semi-properly
> * Squelch namespaces better in --nons mode
> * Clean systemids as well as publicids
> * Deal with zero or negative length ids in DOCTYPE declarations
> * Fix duplicate attributes to use the last value, not the first
>
> But I'm not going to promise when. In the meantime, share and enjoy!

As a recent convert, I'm glad to see this project is alive and kicking -
a quick n dirty gumtree screensaver enabled me to find a new houseshare
and escape a really odd house. Thanks Dude!

Bryan Hunt

unread,
Jul 8, 2011, 8:21:05 AM7/8/11
to co...@ccil.org, tagsoup...@googlegroups.com

s/saver/scraper/


Jukka Zitting

unread,
Jul 9, 2011, 5:09:56 AM7/9/11
to tagsoup...@googlegroups.com
Hi,

On Fri, Jul 8, 2011 at 2:26 AM, John Cowan <co...@mercury.ccil.org> wrote:
> TagSoup 1.2.1 is a very belated bug fix release.

Nice work, thanks!

Are you or someone already working on getting this release up on Maven
Central? I can help with that if needed.

BR,

Jukka Zitting

John Cowan

unread,
Jul 9, 2011, 12:28:22 PM7/9/11
to Jukka Zitting, tagsoup...@googlegroups.com
Jukka Zitting scripsit:

> Are you or someone already working on getting this release up on Maven
> Central? I can help with that if needed.

If anyone is, I don't know about it, so feel free to go ahead, and thanks.

--
John Cowan <co...@ccil.org> http://www.ccil.org/~cowan
Today an interactive brochure website, tomorrow a global content
management system that leverages collective synergy to drive "outside of
the box" thinking and formulate key objectives into a win-win game plan
with a quality-driven approach that focuses on empowering key players
to drive-up their core competencies and increase expectations with an
all-around initiative to drive up the bottom-line. --Alex Papadimoulis

Jukka Zitting

unread,
Aug 23, 2011, 5:17:37 AM8/23/11
to tagsoup-friends
Hi,

On Sat, Jul 9, 2011 at 6:28 PM, John Cowan <co...@mercury.ccil.org> wrote:
> Jukka Zitting scripsit:
>> Are you or someone already working on getting this release up on Maven
>> Central? I can help with that if needed.
>
> If anyone is, I don't know about it, so feel free to go ahead, and thanks.

The 1.2.1 release is now up on Maven Central.

I now have the ability to also upload any future TagSoup releases to
Maven Central (see [1]), and would be happy to keep doing so for the
time being. See also [2] for the build.xml changes I made to simplify
the upload process.

[1] https://issues.sonatype.org/browse/OSSRH-2108
[2] https://github.com/jukka/tagsoup/commit/fb51e4ef8f227499867fce700b60b37d94df3e65

BR,

Jukka Zitting

John Cowan

unread,
Aug 29, 2011, 2:51:26 PM8/29/11
to Jukka Zitting, tagsoup-friends
Jukka Zitting scripsit:

> [2] https://github.com/jukka/tagsoup/commit/fb51e4ef8f227499867fce700b60b37d94df3e65

Should I incorporate this patch into the next release?

--
John Cowan http://ccil.org/~cowan co...@ccil.org
Monday we watch-a Firefly's house, but he no come out. He wasn't home.
Tuesday we go to the ball game, but he fool us. He no show up. Wednesday he
go to the ball game, and we fool him. We no show up. Thursday was a
double-header. Nobody show up. Friday it rained all day. There was no ball
game, so we stayed home and we listened to it on-a the radio. --Chicolini

Jukka Zitting

unread,
Sep 17, 2011, 5:40:41 AM9/17/11
to tagsoup-friends
Hi,

On Mon, Aug 29, 2011 at 8:51 PM, John Cowan <co...@mercury.ccil.org> wrote:
> Jukka Zitting scripsit:
>> [2] https://github.com/jukka/tagsoup/commit/fb51e4ef8f227499867fce700b60b37d94df3e65
>
> Should I incorporate this patch into the next release?

The change is basically only needed when you're uploading a release to
the Maven repository, so it's not that essential to the average user.
And it does come with the extra dependency to the Maven Ant tasks, so
it does complicate the default build setup a bit. Thus I think it's
best if I for now keep it simply as a patch against the official
source tree.

If you or someone else also wants to take care of the Maven repository
uploads then having the extra goals in the official build would of
course be useful.

BR,

Jukka Zitting

John Cowan

unread,
Sep 18, 2011, 2:30:30 PM9/18/11
to Jukka Zitting, tagsoup-friends
Jukka Zitting scripsit:

> The change is basically only needed when you're uploading a release to
> the Maven repository, so it's not that essential to the average user.
> And it does come with the extra dependency to the Maven Ant tasks, so
> it does complicate the default build setup a bit.

Which is already complex enough given that you have to download the
proper XSLT engine and install it.

> Thus I think it's best if I for now keep it simply as a patch against
> the official source tree.

Sounds right. Thanks.

--
There are three kinds of people in the world: John Cowan
those who can count, co...@ccil.org
and those who can't.

Reply all
Reply to author
Forward
0 new messages