SLQ Bayes

Dhawal Doshy dhawal at NETMAGICSOLUTIONS.COM
Thu Jun 9 01:24:59 IST 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "US-ASCII" character set.  ]
    [ Some characters may be displayed incorrectly. ]

Peter Russell wrote:
> Dhawal Doshy wrote:
> 
>> Peter Russell wrote:
>>
>>>>
>>>>
>>>> a mis-interpretation on my part, the BayesStore SQL.pm already uses 
>>>> persistent connections to the database (though user_prefs and 
>>>> probably AWL_SQL doesn't). I have removed this part from the wiki.
>>>>
>>>> - dhawal
>>>
>>>
>>>
>>> Thanks for that.
>>>
>>> Something i have noticed. All sa --lint tests worked fine. I see the 
>>> autolearn=spam/notspam but i dont see bayes getting used during 
>>> message processing :(
>>>
>>> IS there anything else i need to do to make it work?
>>>
>>
>> Did you comment out the following lines in spam.assassin.prefs.conf
>> bayes_path /etc/MailScanner/bayes/bayes
>> bayes_file_mode 0660
>>
>> also you mention --lint worked fine, but what exactly does it say? i 
>> would double-check every required change once again.
>>
>> - dhawal
> 
> 
> Yep added the other lines from your guide, commented out the baove 2 and 
> then did a lint using that config file and it appears to work. Nothing 
> in lint is marked as failed. I wonder how the username bit works, 
> because if i use the lint test in mailwatch i get a different result as 
> it appears as though is using the apache credentials to connect, which 
> have no access and therefore i get a <200 spams error. SHould the 
> username in the sql DB security be *@localhost or *@remotehost ?
> 
> debug: bayes: Using username: root
> debug: bayes: Database connection established
> debug: bayes: found bayes db version 3
> debug: bayes: Using userid: 6
> <snip>
> 
> debug: running uri tests; score so far=0.126
> debug: bayes corpus size: nspam = 19137, nham = 35279
> debug: tokenize: header tokens for *F = "U*ignore 
> D*compiling.spamassassin.taint.org D*spamassassin.taint.org D*taint.org 
> D*org"
> debug: tokenize: header tokens for *m = "  1118275789 lint_rules "
> debug: tokenize: header tokens for *RT = " "
> debug: tokenize: header tokens for *RU = " "
> debug: bayes: tok_get_all: Token Count: 20
> debug: bayes token 'somewhat' => 0.0798823885826036
> debug: bayes: score = 0.36410136776969
> debug: registering glue method for check_uridnsbl 
> (Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x9592ab8))
> debug: Razor2 is available
> <snip>
> 
> debug: 
> tests=BAYES_40,DCC_CHECK,MISSING_HEADERS,MISSING_SUBJECT,NO_REAL_NAME
> 

Here 'localhost' ought to be changed to the server_name that is hosting 
your database (make sure the server_name is resolvable either via dns or 
/etc/hosts), unless the database and SA are on the same server.

bayes_store_module Mail::SpamAssassin::BayesStore::SQL
bayes_sql_dsn DBI:mysql:sa_bayes:localhost <== change this parameter
bayes_sql_username sa_user
bayes_sql_password sa_password

Also what does a MailScanner debug for SA have to report?

- dhawal

------------------------ MailScanner list ------------------------
To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
'leave mailscanner' in the body of the email.
Before posting, read the Wiki (http://wiki.mailscanner.info/) and
the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).

Support MailScanner development - buy the book off the website!




More information about the MailScanner mailing list