Thanks for your note. I'm pleasantly surprised you dug through the RA
paper and noticed TextTickle. While we never did release that, it was
mostly a toy example for our own understanding of how such a system
should work.
A lot of those ideas did make it into ScalaNLP in the stage package in
a possibly cleaner way, with some examples in the topic modeling
toolbox http://nlp.stanford.edu/software/tmt/. The pipelines for
turning a CSV or TSV file with text fields into a dataset are all part
of scalanlp, while the topic modeling bits are part of TMT. I haven't
hooked up other models to it, but it's a pretty straightforward task.
The stages in the pipeline store a signature of their history, which
allows for nice automatic caching and such.
Mallet is a great piece of software which includes lots of models.
ScalaNLP is designed to be more of a common platform for developers
who want to work with text. We'd love to include more text algorithms
as things mature, possibly in a bazaar type model with external
contributions, but that hasn't been a priority as of yet.
dan
> --
> You received this message because you are subscribed to the Google Groups "ScalaNLP" group.
> To post to this group, send email to scal...@googlegroups.com.
> To unsubscribe from this group, send email to scalanlp+u...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/scalanlp?hl=en.
>
>