--
You received this message because you are subscribed to the Google Groups "nltk-users" group.
To post to this group, send email to nltk-...@googlegroups.com.
To unsubscribe from this group, send email to nltk-users+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/nltk-users?hl=en.
I've had some success with the Megam option, although configuring Megam never seems smooth (perhaps that is just my own problem!).
MaxEnt is going to be slower than Naive Bayes to train: You are trading speed for a more sophisticated model that supports independent features.
Richard (M)
- Dave
So I think we can close this as an eclipse problem, not an nltk problem.
It's actually still a bit puzzling. When I run via the command line,
python takes up about 126 megs on the larger dataset using GIS.
Eclipse's footprint, in the meantime, is 438 mb, and supposedly is
allocating a gig of space, so there should be plenty of overhead.
Lesson learned, eclipse is not the right environment for actually
running code. I'll have to find a better interactive environment than
windows shell, though.
Cheers,
Dave
On further investigation, I think Correa had it right. If I run
outside of eclipse, I don't run out of memory even on the larger
dataset.
So I think we can close this as an eclipse problem, not an nltk problem.
It's actually still a bit puzzling. When I run via the command line,
python takes up about 126 megs on the larger dataset using GIS.
Eclipse's footprint, in the meantime, is 438 mb, and supposedly is
allocating a gig of space, so there should be plenty of overhead.
Lesson learned, eclipse is not the right environment for actually
running code. I'll have to find a better interactive environment than
windows shell, though.