Hi,
Just wondering if any progress has been made on this post? I’ve built a model with similar code structure to Cory but only using MAXENT and SVM. The model builds fine and has a recall accuracy of 89%. When I try to classify new data I get poor or non-working results. The MAXENT model gives 2% recall accuracy and SVM doesn’t work, at all giving the following error:
Error in predict.svm(model, corpus@classification_matrix, prob = TRUE, :
test data does not match model !
My conclusion is that the problem stems from the model DTM having a different column structure (the list of words) to the new DTM. Do these column structures really need to be the same? This seems a bit restrictive if they do.
Does anyone have a working example of using RTextTools to classify a completely new set of data after having built a model?
Thanks for your help,
Mark
--
You received this message because you are subscribed to the Google Groups "rtexttools-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rtexttools-he...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.