Most of my last works on Amara now are not about xml processing but
related to html (web scraping). Other libs are doing more helpers
about parsing html (lxml with cssselects, iterlinks, sourceline, ...)
and are other aproaches to extract info (pyparsing) and to access it
(mechanize, selenium). What is your experience?
Should we let amara as it is now, a very good generic xml parser and
use other libs for more specialized works? Or transform Amara into a
Big Data ready lib in order to extract / analize / visualize
unstructured data?
Saludos,
-- luismiguel (@lmorillas)
My only use has been to generate XML, it was the most sensible
Python library I found.
One mans view.
regards
--
Dave Pawson
XSLT XSL-FO FAQ.
Docbook FAQ.
http://www.dpawson.co.uk
Dear all,
Before responding to Luis's question, I would like to echo something said by Sylvain in the "What is the status of Amara?" thread: it's quite difficult to convince management that Amara is the right tool for the job when it's been in alpha since January 2009. To respond to Luis's concerns about Amara's uptake, I feel reasonably certain that other potential users and contributors are also hurt by the dynamic outlined above. I know we would have used another product if it did a similar job and wasn't in alpha.
From this experience, I strongly encourage providing stable Amara releases more often. If I could be so bold, I think that planning these releases on a product roadmap would be a great way to accomplish this. Following Luis's comments, I could imagine that Amara 2.0 will include the core engine and other stable code while Amara 2.1 would include these HTML libraries, and so on. Not only would this roadmap advertise what the product can and will be able to do, it will also encourage community feedback about where the product should go and give users stable software with which to start providing feedback, tests, and bugfixes in order to enhance the functionality.
Amara is an absolutely phenomenal tool and clearly fills a gap that is not filled by any other XML tool I have seen. It deserves to have a huge community behind it, and with a little bit of website and process tweaking, this project really could take off.
On Mon, Apr 23, 2012 at 11:56 AM, Pigneri, Rocco <RPig...@lavastorm.com> wrote:Dear all,
Before responding to Luis's question, I would like to echo something said by Sylvain in the "What is the status of Amara?" thread: it's quite difficult to convince management that Amara is the right tool for the job when it's been in alpha since January 2009. To respond to Luis's concerns about Amara's uptake, I feel reasonably certain that other potential users and contributors are also hurt by the dynamic outlined above. I know we would have used another product if it did a similar job and wasn't in alpha.
From this experience, I strongly encourage providing stable Amara releases more often. If I could be so bold, I think that planning these releases on a product roadmap would be a great way to accomplish this. Following Luis's comments, I could imagine that Amara 2.0 will include the core engine and other stable code while Amara 2.1 would include these HTML libraries, and so on. Not only would this roadmap advertise what the product can and will be able to do, it will also encourage community feedback about where the product should go and give users stable software with which to start providing feedback, tests, and bugfixes in order to enhance the functionality.
Amara is an absolutely phenomenal tool and clearly fills a gap that is not filled by any other XML tool I have seen. It deserves to have a huge community behind it, and with a little bit of website and process tweaking, this project really could take off.
You're certainly right about all that, and all the stuff you mention is stuff that I've always wanted to do, but I have so much going on that contemplating it all is nearly hopeless. I do appreciate that folks such as Luis and Werner are really digging in to help, because that puts a far more direct sort of pressure on me to find the time to support them. The general idea of growing Amara's community is a nice one, but the simple, awkward fact is that such an idea alone isn't quite enough for me. I use Amara all the time in work (nice to be my own manager ;) ), and it generally gives me an edge in getting stuff done, so I find myself needing additional motivation to push it to the next level in order to make *other* managers happy.I guess my point in saying the above is that maybe the current flurry of activity will help. Certainly I don't want to promise any more than a specific focus at a time. Test and docs are the biggest issue with Amara right now, and incidentally, if we can get tests and docs much improved, I think we can easily make it beta status. And then maybe we can look at the next in line of the problems you mention.