Stocks and P-R-O-F-I-T
max at assuredata.com
Thu Nov 2 01:39:35 GMT 2006
> I had recently tried what I thought was a good technique, and created
> script that fed all email from every MailScanner white listed email
> address into sa-learn as ham nightly, without doing a check on the
> emails. This was obviously a bad choice as jokes and other spam like
> emails must have processed for months.
> Anyway, I scrapped the bayes database and started from scratch using
> a sample bayes db from FSL (I think it's called). From there I've been
> feeding quite a bit of spam into sa-learn for about a week or two. I'd
> say I've fed about 400 spam mails thus far. However, as of today I'm
> still getting the p-r-o-f-i-t and stock spasm with bayes scores of
> anywhere from 10% to 50%.
>>My question is what kind of stock spams are they?
>>Are they image based, or text based?
>>If it's image, bayes won't help you much, as bayes doesn't understand
> My question is how long or how many emails should it take bayes to
> figure out these spam emails? Is there a way of viewing the progress?
> With the other scores from DCC, Pyzor, Razor, the score is close to
> being tagged as spam, but sometimes it's not quite there because of
> bayes score.
>>for image spams, try adding the SARE stocks ruleset.
Yes, they are partially or sometimes full images.
I have the SARE Stock ruleset installed, the problem is that whether or
not this ruleset is being triggered, bayes is sometimes giving a
negative score. Are you saying bayes cannot be trained to score high on
messages that have images? I would think it would examine the fact that
it's an image and header information, but maybe I'm wrong.
More information about the MailScanner