How to tell if SpamAssassin Bayasian filtering is working
Philip Zeigler
philip at zeiglers.net
Tue Sep 26 17:50:30 IST 2006
Run the --lint test to see how many ham and spam messages it has.
Spamassassin needs to have at least 200 spam and 200 ham in bayes before it
considers the bayes to be trained. It ignores the bayes results until then.
Philip
-----Original Message-----
From: mailscanner-bounces at lists.mailscanner.info
[mailto:mailscanner-bounces at lists.mailscanner.info] On Behalf Of Jonathan B.
Bayer
Sent: Tuesday, September 26, 2006 11:15 AM
To: MailScanner discussion
Subject: How to tell if SpamAssassin Bayasian filtering is working
Hello MailScanner,
I've enabled Bayasian filtering with SpamAssassin. I've been feeding missed
spam to Spamassassin by hand, and I get reports that it is adding tokens
from the scanned files.
However, it doesn't seem to have affected the results I'm getting. I've
looked both at spam and non-spam, and don't see any reference in the headers
to any sort of Bayasian scanning.
So I'm wondering if it is really doing the Bayasian scanning or not.
I've reproduced all the relevent information below (I think). Any ideas or
suggestions are welcome.
Thanks in advance.
JBB
I have the following relevant lines in my MailScanner.conf file:
Spam Checks = yes
Use SpamAssassin = yes
Wait During Bayes Rebuild = no
SpamAssassin Site Rules Dir = /etc/mail/spamassassin
SpamAssassin Site Rules Dir = /etc/mail/spamassassin
SpamAssassin Local Rules Dir =
SpamAssassin Local State Dir = # /var/lib
SpamAssassin Default Rules Dir =
Debug SpamAssassin = no
There is a soft link in place:
/etc/mail/spamassassin/mailscanner.cf ->
/etc/MailScanner/spam.assassin.prefs.conf
The file spam.assassin.prefs.conf contains the following:
dns_available yes
use_bayes 1
bayes_path /var/MailScanner/bayes/bayes
score BAYES_00 -15.0
score BAYES_05 -5.0
score BAYES_95 5.0
score BAYES_99 15.0
bayes_auto_learn 1
bayes_ignore_header bayerfamily.net-MailScanner
bayes_ignore_header bayerfamily.net-MailScanner-SpamCheck
bayes_ignore_header bayerfamily.net-MailScanner-SpamScore
bayes_ignore_header bayerfamily.net-MailScanner-Information
bayes_auto_expire 0
lock_method flock
use_auto_whitelist 0
use_dcc 0
use_pyzor 0
envelope_sender_header X-MailScanner-From
header FRIEND_GREETINGS Subject =~ /you have an E-Card from/i
describe FRIEND_GREETINGS Nasty E-card from FriendGreetings.com
score FRIEND_GREETINGS 100.0
header FRIEND_GREETINGS2 Subject =~ /you have a greeting card from/i
describe FRIEND_GREETINGS2 Nasty E-card from FriendGreetings.com
score FRIEND_GREETINGS2 100.0
And /var/MailScanner/bayes contains:
-rw------- 1 root root 230 Sep 25 04:03 bayes.mutex
-rw------- 1 root root 49152 Sep 25 04:03 bayes_seen
-rw------- 1 root root 360448 Sep 25 04:03 bayes_toks
JBB
---
Jonathan B. Bayer
mailto:jbayer at bayerfamily.net
--
MailScanner mailing list
mailscanner at lists.mailscanner.info
http://lists.mailscanner.info/mailman/listinfo/mailscanner
Before posting, read http://wiki.mailscanner.info/posting
Support MailScanner development - buy the book off the website!
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
More information about the MailScanner
mailing list