jiuren
unread,Nov 2, 2010, 7:38:12 AM11/2/10Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to DigitalPebble
hi,all
I'm doing a multilabel classification and I split the task to some
small tasks. I need to modify the labels of some documents in one
corpus to build a new corpus. As you know, building a corpus from raw
files is time-consuming, especially for CJK languages which need to be
segmented first.
But there is no direct api to do this work. I've read part of the
source code and found a setLabel() method which is not public. Can I
just modify the method to public and let the user code use it?Will
this bring some side effect?
another question:
If I use multiFieldDocument, can I make use some fields only to store
some meta data but not to affect the classification? For example I can
set some field unindexed in lucene.