Hi Ismail,
> What is the number of ham/spam messages for higher detection and lower FP?
More training should generally make for higher detection rates.
Roughly equivalent number of ham/spam trains should make for best
accuracy (YMMV). How much training is needed for acceptable results is
largely dependent on your environment (single-user environments are
least demanding; larger environments with more varied mail flow will
require more training). Default behaviour is not to try classify
messages until 200 learns are performed (it's a configurable setting).
> And when should we add message to fuzzy storage?
Fuzzy is useful for matching attachments & nearly-identical messages.
> contains less tokens than required for bayes classifier: 10 < 11
Message is too short to be classified by bayes. This is governed by
configuration - you can reduce number of required tokens but accuracy
will suffer.
> skipped for bayes classifier: already in class spam; probability 99.92
Message already has high spam probability so training was skipped to
avoid over-training.
Best,
-AL.