Server ops can go ahead and upgrade. The release notes are
here.
The upgrade command is: sudo /opt/cartulary/bin/upgrade.sh
The primary focus of this release is on improvements to article content
extraction and workarounds for problem feeds in the aggregator. Since we
added a new article extraction engine last time, the number of articles
we are able to extract improved. But, even so, the quality was
sometimes poor. I spent a lot of time regex'ing and css'ing to get the
format of the articles as presentable as possible.
The goal, in general, has always been to take any document and pare it
down to it's most basic structure. That means, no DIV's, no html
attributes, no css, etc. Just the most vanilla, basic html you can imagine. Then we build it back up with just enough styling to make it easy to read. I think you'll be very happy with it.
Lots of time spent working around crappy XML feed shenanigans this time too. I was able to get the feed parsing error rate way down on my server. We're now in the low single digit percentages of feeds that still aren't parsing, out of well over 1000 feeds on my box. And most of those are just flat garbage, so no hope for consuming them.
I try to follow the "be liberal in what you consume, and conservative in what you produce" philosophy. In that vein we're going to make every effort to get something out of a feed if at all humanly (and programmatically) possible.
DJ