Quick 'bayes' question

Scott Silva ssilva at SGVWATER.COM
Wed Jun 29 00:40:18 IST 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "US-ASCII" character set.  ]
    [ Some characters may be displayed incorrectly. ]

Jason Williams spake the following on 6/28/2005 3:57 PM:
> Well im back. This time, a question on bayes.
> 
> I've been working to get bayes setup and running properly (and I don't
> think bayes has evern been setup to work properly to be honest).
> 
> First, in my spam.assassin.prefs.conf file, I have
> 
> use_bayes 1
> bayes_patch       /usr/local/etc/MailScanner/bayes/
> bayes_file_mode 0660
> 
> # Bump up SpamAssassin scores on the high and low end
> # score BAYES_00 -15.0
> # score BAYES_05 -5.0
> # score BAYES_95 5.0
> # score BAYES_99 15.0
> 
> # To disable bayes autolearn
> # bayes_auto_learn 0
> 
> Just trying to make sure I have the basics setup.
> 
> I ran --lint, it found the bayes DB no problem. However, when I look in
> the bayes directory, I see a bunch of files that look like this:
> 
> _toks.expire98xxx  different numbers at the end.
> 
> As I was reading over the site, it recommened to do a dump and look at
> the magic. Well here it is:
> 
> 0.000          0          3          0  non-token data: bayes db version
> 0.000          0          0          0  non-token data: nspam
> 0.000          0          2          0  non-token data: nham
> 0.000          0         43          0  non-token data: ntokens
> 0.000          0 1083442244          0  non-token data: oldest atime
> 0.000          0 1083446498          0  non-token data: newest atime
> 0.000          0          0          0  non-token data: last journal
> sync atime
> 0.000          0          0          0  non-token data: last expiry atime
> 0.000          0          0          0  non-token data: last expire
> atime delta
> 0.000          0          0          0  non-token data: last expire
> reduction count
> 
> Reading over the wiki site, there are a lot of things going on with the
> bayes system.
> First question I have is that if I want to train the bayesian learning
> system (or even to rebuild it) would I just point it to the quarantine
> directory? Seems logical.
> 
> I'm sure im missing something. Been rather long, mind numbing day.
> 
> I appreciate any feedback.
> 
> Jason
> 
Either you dumped the wrong database, or this one has very little in it.
Try sa-learn --dump magic --dbpath /path/to/bayes/bayes
Should be the bayes db path in spamassassin.prefs.conf.

Mine has much more data;

0.000          0          3          0  non-token data: bayes db version
0.000          0      29146          0  non-token data: nspam
0.000          0      81693          0  non-token data: nham
0.000          0     124702          0  non-token data: ntokens
0.000          0 1119230907          0  non-token data: oldest atime
0.000          0 1120001312          0  non-token data: newest atime
0.000          0 1119999608          0  non-token data: last journal
sync atime
0.000          0 1119929516          0  non-token data: last expiry atime
0.000          0     691200          0  non-token data: last expire
atime delta
0.000          0      28272          0  non-token data: last expire
reduction count


-- 

/-----------------------\           |~~\_____/~~\__  |
| MailScanner; The best |___________ \N1____====== )-+
| protection on the net!|                   ~~~|/~~  |
\-----------------------/                      ()

------------------------ MailScanner list ------------------------
To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
'leave mailscanner' in the body of the email.
Before posting, read the Wiki (http://wiki.mailscanner.info/) and
the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).

Support MailScanner development - buy the book off the website!




More information about the MailScanner mailing list