bayes DB not growing

Martin Hepworth martinh at SOLID-STATE-LOGIC.COM
Wed May 18 09:24:20 IST 2005


Looks like the autolearn threshhold is too low to me.

the defaults in /usr/local/share/spamassassin/10_misc.cf are

bayes_auto_learn_threshold_nonspam   0.1
bayes_auto_learn_threshold_spam              12.0

I'd make sure you're not overriding these values in any of your site
specific rules in /etc/mail/spamassassin or spam.assassin.prefs.conf

--
Martin Hepworth
Snr Systems Administrator
Solid State Logic
Tel: +44 (0)1865 842300


Arif Malik wrote:
>  ok sorry, now that I am using the bayes starter kit thing, I do have
> something like this in my logs:
> BAYES_95 2.06
>
> so that part looks good to me - but now I am just worried about the
> autolearn=spam thing - from what I have read, messages tagged like this
> are fed to the bayes filter as spam... but EVERY one of my messages is
> showing this, even the ones that arent tagged as spam... so I am worried
> the ham messages are being fed to it as well.
>
> -----Original Message-----
> From: MailScanner mailing list [mailto:MAILSCANNER at JISCMAIL.AC.UK] On
> Behalf Of Arif Malik
> Sent: Tuesday, May 17, 2005 9:32 AM
> To: MAILSCANNER at JISCMAIL.AC.UK
> Subject: Re: bayes DB not growing
>
> Thanks, Billy.. I don't even get any mention of BAYES in the point
> scoring part like you do below... im not sure what im missing here, does
> anyone have any suggestions?? here is another log entry - notice it is
> marked as "not spam" but then it has "autolearn=spam". Also it doesn't
> show any Bayes probability entry either as Billy's does.
>
> May 17 08:17:38 filter MailScanner[4736]: Message 1DY3op-0001Fb-G3 from
> 65.249.2
> 45.178 (1-21388026-fitblog.com?reef at stderr.megadealz.net) to aergasg.com
> is not spam, SpamAssassin (score=3.007, required 4, autolearn=spam,
> DNS_FROM_AHBL_RHSBL  0.07, HTML_FONT_INVISIBLE 0.07, HTML_IMAGE_ONLY_24
> 1.00, HTML_MESSAGE 0.00, HTM L_TAG_EXIST_TBODY 0.23,
> RAZOR2_CF_RANGE_51_100 1.49, RAZOR2_CHECK 0.15)
>
> when I am in debug mode I see this also.. should it have some kind of
> entry like BAYES in it?? I used one of those bayes starter kits, so I
> don't get that <200 spam error any more.. but it still doesn't seem to
> be active!
>
> debug: tests=DCC_CHECK,MISSING_HEADERS,MISSING_SUBJECT,NO_REAL_NAME
> debug:
> subtests=__HAS_MSGID,__MSGID_OK_DIGITS,__MSGID_OK_HOST,__SANE_MSGID,__UN
> USABLE_MSGID
>
> -----Original Message-----
> From: MailScanner mailing list [mailto:MAILSCANNER at JISCMAIL.AC.UK] On
> Behalf Of Billy A. Pumphrey
> Sent: Tuesday, May 17, 2005 8:46 AM
> To: MAILSCANNER at JISCMAIL.AC.UK
> Subject: Re: bayes DB not growing
>
> I was going to say also that sometimes there is no autolearn statement,
> like this:
>
> -0.41 BAYES_05 Bayesian spam probability is 1 to 5%
> 0.01 NO_REAL_NAME From: does not include a real name
>
>
> Billy Pumphrey
> IT Manager
> Wooden & McLaughlin
>
>
>>-----Original Message-----
>>From: Arif Malik [mailto:Arifm at TOMASJEWELRY.COM]
>>Sent: Tuesday, May 17, 2005 10:39 AM
>>To: MAILSCANNER at JISCMAIL.AC.UK
>>Subject: Re: bayes DB not growing
>>
>>Hmm - what I am seeing in my logs, is EVERY message shows
>>"autolearn=spam" - even though my bayes DB isn't growing... even
>>messages that aren't tagged as spam are showing it - for example:
>>
>>May 15 07:05:42 filter MailScanner[11331]: Message 1DXJk9-00083F-Tv
>
> from
>
>>209.0.2
>>4.12 (bounce-flnl-45112503 at mx01.gamerival.com) to adggdwe.com is not
>>spam, SpamA ssassin (score=2.733, required 4, autolearn=spam, AWL
>>1.02,
>
> HTML_90_100
>
>>0.19, HT
>>ML_FONT_BIG 0.23, HTML_MESSAGE 0.00, MIME_HEADER_CTYPE_ONLY 0.48,
>>NO_REAL_NAME 0 .18, URIBL_SBL 0.63)
>>
>>shouldn't only messages that are considered spam be showing the
>>"autolearn=spam" ??? I still don't know why the bayes DB only has 1
>
> spam
>
>>in it still, but I will try using sa-learn to feed it 200 spams and
>
> hams
>
>>and see what happens... but I am still wondering about the autolearn
>>behavior, if anyone has any insight.. thanks!
>>
>>-----Original Message-----
>>From: MailScanner mailing list [mailto:MAILSCANNER at JISCMAIL.AC.UK] On
>>Behalf Of Billy A. Pumphrey
>>Sent: Tuesday, May 17, 2005 7:48 AM
>>To: MAILSCANNER at JISCMAIL.AC.UK
>>Subject: Re: bayes DB not growing
>>
>>That is a good link.  On my mailwatch, I look at the spam messages and
>
> I
>
>>do see this a lot:
>>Autolearn=spam
>>
>>I also see that some messages do not have a autolearn= I am guessing
>>that it was not autolearned because from the link, it said that a
>>message needs 3 points from the header and 3 points from the body to
>
> be
>
>>autolearned.
>>
>>When I do a spamassassin -D --lint.  I get:
>>debug: bayes: found bayes db version 3
>>debug: using "/root/.spamassassin" for user state dir
>>debug: bayes: Not available for scanning, only 0 spam(s) in Bayes DB <
>
>
>>200
>>
>>So mine says that there are only 0 spams.  Does this mean that I need
>
> to
>
>>fix something?
>>
>>
>>Billy Pumphrey
>>IT Manager
>>Wooden & McLaughlin
>>
>>>-----Original Message-----
>>>From: Raylund Lai [mailto:raylund.lai at KANKANWOO.COM]
>>>Sent: Tuesday, May 17, 2005 3:59 AM
>>>To: MAILSCANNER at JISCMAIL.AC.UK
>>>Subject: Re: bayes DB not growing
>>>
>>>have a look on this link to see whether it answer your question.
>>>http://wiki.apache.org/spamassassin/AutolearningNotWorking
>>>
>>>Cheers
>>>Raylund
>>>
>>>Arif Malik wrote:
>>>
>>>
>>>>I have a new installation of mailscanner, and for the last few
>
> days
>
>>I
>>
>>>>keep noticing the following message:
>>>>
>>>>debug: bayes: Not available for scanning, only 0 spam(s) in Bayes
>
> DB
>
>><
>>
>>>200
>>>
>>>>now today, it has finally changed to: debug: bayes: Not available
>>
>>for
>>
>>>>scanning, only 1 spam(s) in Bayes DB < 200 but there has been
>
> quite
>
>>>>a few spams that have gone through, and
>>
>>have
>>
>>>>been marked as spam, and i see "autolearn=spam" in the log.
>>
>>shouldn't
>>
>>>>these be added to the bayes DB?? here is the rest of that part of
>>
>>the
>>
>>>>log that deals with bayes:
>>>>
>>>>debug: bayes: 2357 tie-ing to DB file R/O
>>>>/home/exim/.spamassassin/bayes_toks
>>>>debug: bayes: 2357 tie-ing to DB file R/O
>>>>/home/exim/.spamassassin/bayes_seen
>>>>debug: bayes: found bayes db version 3
>>>>debug: bayes: Not available for scanning, only 1 spam(s) in Bayes
>
> DB
>
>><
>>
>>>200
>>>
>>>>debug: bayes: 2357 untie-ing
>>>>debug: bayes: 2357 untie-ing db_toks
>>>>debug: bayes: 2357 untie-ing db_seen
>>>>debug: Score set 1 chosen.
>>>>
>>>>any idea what i might be doing wrong ? it is odd to me that 1
>
> email
>
>>>>did finally end up in the bayes db... thanks for any help!!!
>>>>------------------------ MailScanner list ------------------------
>
>
>>>>To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
>>>>'leave mailscanner' in the body of the email.
>>>>Before posting, read the Wiki (http://wiki.mailscanner.info/) and
>>>>the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>>>>
>>>>*Support MailScanner development - buy the book off the website!*
>>>
>>>------------------------ MailScanner list ------------------------
>
> To
>
>>>unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
>>>'leave mailscanner' in the body of the email.
>>>Before posting, read the Wiki (http://wiki.mailscanner.info/) and
>
> the
>
>>>archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>>>
>>>Support MailScanner development - buy the book off the website!
>>
>>------------------------ MailScanner list ------------------------ To
>>unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
>>'leave mailscanner' in the body of the email.
>>Before posting, read the Wiki (http://wiki.mailscanner.info/) and the
>>archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>>
>>Support MailScanner development - buy the book off the website!
>>
>>------------------------ MailScanner list ------------------------ To
>>unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
>>'leave mailscanner' in the body of the email.
>>Before posting, read the Wiki (http://wiki.mailscanner.info/) and the
>>archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>>
>>Support MailScanner development - buy the book off the website!
>
>
> ------------------------ MailScanner list ------------------------ To
> unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
> 'leave mailscanner' in the body of the email.
> Before posting, read the Wiki (http://wiki.mailscanner.info/) and the
> archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>
> Support MailScanner development - buy the book off the website!
>
> ------------------------ MailScanner list ------------------------ To
> unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
> 'leave mailscanner' in the body of the email.
> Before posting, read the Wiki (http://wiki.mailscanner.info/) and the
> archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>
> Support MailScanner development - buy the book off the website!
>
> ------------------------ MailScanner list ------------------------
> To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
> 'leave mailscanner' in the body of the email.
> Before posting, read the Wiki (http://wiki.mailscanner.info/) and
> the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).
>
> Support MailScanner development - buy the book off the website!

**********************************************************************

This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify
the system manager.

This footnote confirms that this email message has been swept
for the presence of computer viruses and is believed to be clean.

**********************************************************************

------------------------ MailScanner list ------------------------
To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
'leave mailscanner' in the body of the email.
Before posting, read the Wiki (http://wiki.mailscanner.info/) and
the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).

Support MailScanner development - buy the book off the website!




More information about the MailScanner mailing list