On article <
5ogrgg99d7l9nucvu...@4ax.com>, Marco Old
wrote (at least in part):
> So I train an email as Spam and those words get into the
> classificaiton for Spam messages and then on the next email, I train
> the email as not Spam and those words are removed from the
> classification. Then the next email is not considered Spam.
>
>
Maybe you're no "training" SpamHalter correctly. There's a big
difference between selecting one or more misclassified messages and
MOVING it to the Suspicious or junk mail folder, and picking
Spamhalter classification > Train message(s) as Spam from the menu.
The same applies the other way around, that is, MOVING message(s)
from the Suspicious or junk mail folder to any other folder is much
more effective than Train message(s) as Not-Spam. There's a technical
explanation for each method but in a nutshell it's how it works.
OTOH if it is not your case you may benefit of SpamHalter's database
cleaning which will remove deprecated data from corpus. Pick it from
Tools > Spam and content controls > Spamhalter... > Cleanup...
My current Spamhalter training strategy and settings:
(*) Train on classification errors only (smaller database)
( ) Train always (larger database, self-trained) <- no need if you're
run standalone
or on small LAN.
Spam level (%): 50 Not-spam boost: 1
SpamHalter has been running flawlessly here since version 1.0 with
these settings.
--
Kind regards,
Euler German
Please, reply preferably to the list.
Reply-To: partially ROT13, invalid=com
Due to spam I'm filtering-out GoogleGroups. Sorry. :(