Stops after RCVD_IN_BL_SPAMCOP_NET

Kai Schaetzl maillists at conactive.com
Sat Jan 10 09:31:16 GMT 2009


Joe Garvey wrote on Fri, 9 Jan 2009 15:03:33 -0800:

> There are a few other
> rules that hit over 45,000 but it drops dramatically after that with
> most rules only being hit with an average of 5,000.

this is absolutely normal. If all hits where hitting each spam we could reduce the
number of SA rules to 20. If you are using extra rulesets you may assess them this
way and decide if they are (still) worth it.

With RCVD_IN_BL_SPAMCOP_NET
> having such as high hit count compared to everything else it really
> makes me wonder why no other rules are getting hit as much as it is.

because rules like spamcop and spamhaus are best used at MTA level to spare your 
MS/SA a lot of processing.

> 
> required     112,503   8,110   7.2   104,393   92.8
> DCC_CHECK  Listed in DCC (http://rhyolite.com/anti-spam/dcc)  86,708  1,066  1.2  85,642  98.8
> autolearn=spam   84,906  0  0  84,906  100
> RCVD_IN_BL_SPAMCOP_NET  Received via a relay in bl.spamcop.net  74,756  256  0.3  74,500  99.7
> BAYES_99  Bayesian spam probability is 99 to 100%  73,555  87  0.1  73,468  99.9
> URIBL_JP_SURBL  Contains an URL listed in the JP SURBL blocklist  66,847  40  0.1  66,807  99.9
> URIBL_SBL  Contains an URL listed in the SBL blocklist  64,011  15  0  63,996  100
> URIBL_SBLXBL  Contains a URL listed in the SBL/XBL blocklist  59,950  13  0  59,937  100
> URIBL_AB_SURBL  Contains an URL listed in the AB SURBL blocklist  57,969  72  0.1  57,897  99.9
> HTML_MESSAGE  HTML included in message  57,796  5,932  10.3  51,864  89.7
> URIBL_OB_SURBL  Contains an URL listed in the OB SURBL blocklist  54,305  28  0.1  54,277  99.9
> URIBL_WS_SURBL  Contains an URL listed in the WS SURBL blocklist  46,946  18  0  46,928  100
> RAZOR2_CHECK  Listed in Razor2 (http://razor.sf.net)  46,385  227  0.5  46,158  99.5
> RAZOR2_CF_RANGE_51_100  Razor2 gives confidence level above 50%  45,793  188  0.4  45,605  99.6
> RCVD_IN_XBL  Received via a relay in Spamhaus XBL  44,779  2  0  44,777  100
> DIGEST_MULTIPLE  Message hits more than one network digest check  40,121  50  0.1  40,071  99.9

This is all very well.

> Here is the values from sa-learn --dump magic
> 0.000          0          3          0  non-token data: bayes db version
> 0.000          0       6493          0  non-token data: nspam
> 0.000          0        847          0  non-token data: nham
> 0.000          0     207718          0  non-token data: ntokens
> 0.000          0 1231449300          0  non-token data: oldest atime
> 0.000          0 1231541795          0  non-token data: newest atime
> 0.000          0 1231541368          0  non-token data: last journal sync atime
> 0.000          0 1231519200          0  non-token data: last expiry atime
> 0.000          0      86400          0  non-token data: last expire atime delta
> 0.000          0       1792          0  non-token data: last expire reduction count

this is all very well, except that you are slashing your bayes db each day, your
latest token is from one day ago. I wouldn't that.

Kai

-- 
Kai Schätzl, Berlin, Germany
Get your web at Conactive Internet Services: http://www.conactive.com





More information about the MailScanner mailing list