Check which rules hit
Denis Beauchemin
Denis.Beauchemin at usherbrooke.ca
Tue Jun 30 12:18:49 UTC 2015
Peter, post some log lines and I will make it work for you.
Denis
De : MailScanner [mailto:mailscanner-bounces at lists.mailscanner.info] De la part de Peter Nitschke
Envoyé : 24 juin 2015 08:55
À : mailscanner at lists.mailscanner.info
Objet : RE: Check which rules hit
o;?
Scrap that, I deleted the line with "cached" as my logs don't have that and now I am getting screen output.
However it looks as though deleting the "cached" line is causing errors. Is there a better way to deal with that?
Peter
*********** REPLY SEPARATOR ***********
On 24/06/2015 at 10:13 PM Peter Nitschke wrote:
Is the output to screen or to a file?
Thanks.
*********** REPLY SEPARATOR ***********
On 22/06/2015 at 5:51 PM Denis Beauchemin wrote:
My log lines look like this:
Jun 22 13:26:16 10.32.103.21 smtps2 MailScanner[47071]: Message t5MHQFDv033375 from 10.32.106.21 (someone at usherbrooke.ca<mailto:someone at usherbrooke.ca>) to usherbrooke.ca is n'est pas un polluriel, SpamAssassin (not cached, score=-6.206, requis 6.5, autolearn=not spam, BAYES_00 -1.90, HTML_MESSAGE 0.00, RDNS_NONE 0.79, UDES_FROM01 -3.00, UDES_FROM02 -0.10, UDES_FROMTO01 -2.00)
Jun 22 13:27:23 10.32.103.28 smtpe1 MailScanner[61090]: Message t5MHR8mw063252 from 64.5.96.10 (someone at alliinclusive.space<mailto:someone at alliinclusive.space>) to usherbrooke.ca is est un polluriel, SpamAssassin (not cached, score=7.776, requis 6.5, autolearn=spam, BAYES_50 0.80, HTML_EXTRA_CLOSE 0.00, HTML_MESSAGE 0.00, HTML_TAG_BALANCE_BODY 1.16, MIME_HTML_ONLY 0.72, MIME_HTML_ONLY_MULTI 0.00, MPART_ALT_DIFF 0.79, RDNS_NONE 0.79, STYLE_GIBBERISH 3.50, T_REMOTE_IMAGE 0.01)
The script does a first grep for the strings in yellow. Thereb third string that matches all ham/spam: un polluriel, SpamAssassin
Then Perl is used to search for lines that match:
1- The yellow string followed by b
2- Then the green string (could also be b
3- Then the blue strings, including the digits, decimal point and minus sign
4- Then the dark grey strings, including the digits, decimal point and minus sign
5- Then the light gray string that could be omitted
6- Then all the rules names and their respective score (could be negative)
At the end the matching rules are printed sorted as requested.
Denis
-----Message d'origine-----
De : MailScanner [mailto:mailscanner-bounces at lists.mailscanner.info] De la part de Peter Nitschke
EnvoyC) : 22 juin 2015 13:21
C mailscanner at lists.mailscanner.info<mailto:mailscanner at lists.mailscanner.info>
Objet : RE: Check which rules hit
This looks interesting.
I edited to English for spam, not spam etc, but when I run it, it just says "processing /var/log/maillog" and is very fast, but I get nothing, no output to screen or file that I can find.
Any suggestions how I can understand it better?
Thanks,
Peter
*********** REPLY SEPARATOR ***********
On 19/06/2015 at 12:26 PM Denis Beauchemin wrote:
>This encoded message has been converted to an attachment.
>
>I created this script a while back just to do that:
>#!/usr/bin/perl -w
>#
># Script that looks through maillog to find all messages tagged as spam
># by MailScanner. It then tallies the different SpamAssassin rules
>that # fired.
># Denis Beauchemin, 20050516
>
>use Getopt::Long;
>
># Where some commands reside:
>my $GREP = "/bin/grep";
>my $GUNZIP = "/bin/gunzip";
>
># Value of "Spam =" in %report-dir%/languages.conf my $isSpamString =
>"est un polluriel, SpamAssassin";
>my $isHamString = "est pas un polluriel, SpamAssassin";
>my $allString = " un polluriel, SpamAssassin";
># Value of "score =" in %report-dir%/languages.conf
>my $scoreString = "score=";
># Value of "required =" in %report-dir%/languages.conf
>my $reqdString = "requis ";
>my $autoString = "autolearn=spam";
>my $cachedString = "cached, ";
>my $nCachedString = "not cached, ";
>
>my $maillog = "/var/log/maillog";
>@maillogs = ();
>
>my $sortByName = 0;
>my $sortByHits = 0;
>my $getHam = 0;
>my $getAll = 0;
>my $help = 0;
>
>GetOptions(
> 'sortbyname|byname' => \$sortByName,
> 'sortbyhits|byhits' => \$sortByHits,
> 'log=s' => \@maillogs,
> 'ham' => \$getHam,
> 'all' => \$getAll,
> 'help' => \$help,
>);
>
>if ( $help ) {
> print '
>This program tallies SpamAssassin\'s rules that were triggered when an
>email was detected as spam by MailScanner.
>
>You can search for ham with the --ham option.
>
>You can search for all SpamAssassin results with the --all option.
>
>By default it sorts the results by rule name. It can also sort them by
>number of hits if called with --sortbyhits (or --byhits).
>
>The option --sortbyname (or --byname) is the default one.
>
>If you don\'t want to use the current maillog, specify a different one
>with --log new-maillog.
>
>All unknown command line parameters will be treated as additional file
>names to process.
>
>It is OK for a log file to be gzipped.
>';
> exit;
>}
>
>push @maillogs, @ARGV;
>@maillogs = ( $maillog ) if ( @maillogs == 0 ); #print "Maillogs:
>@maillogs\n"; #my $searchString = $getHam ? $isHamString :
>$isSpamString; my $searchString; if ( $getAll ) {
> $searchString = "$allString";
>} elsif ( $getHam ) {
> $searchString = "$isHamString";
>} else {
> $searchString = "$isSpamString";
>}
>
>foreach my $maillog ( @maillogs ) {
> print "Processing $maillog...\n";
>
> $sortByName++ if ( ( $sortByName == 0 ) && ( $sortByHits == 0
) );
>
> my $openCmd = "LANG=C $GREP \"$searchString\" $maillog |";
> if ( $maillog =~ /\.gz$/ ) {
> $openCmd = "$GUNZIP -c $maillog | LANG=C $GREP
\"$searchString\"
>|";
> }
> open LOG, "$openCmd" || die "Cannot open $maillog";
>
> while ( <LOG> ) {
> next unless /$searchString
>\((?:$cachedString|$nCachedString)$scoreString[-\d.]+, $reqdStrin
>g[-\d.]+,(?: $autoString,)?(.*)$/;
> my $hits = $1;
> foreach my $hit ( $hits =~ / ([^\s]+) -?[\d.]+(?:,|\))/g ) {
> $hit{$hit}++;
> }
> }
>
> close LOG;
>}
>
>if ( $sortByName ) {
> foreach my $hit ( sort keys %hit ) {
> printf "%27s %5d\n", $hit, $hit{$hit};
> }
>} elsif ( $sortByHits ) {
> foreach my $hit ( sort {$hit{$b}<=>$hit{$a}} keys %hit ) {
> printf "%27s %5d\n", $hit, $hit{$hit};
> }
>}
>
>
>-----Message d'origine-----
>De : MailScanner [mailto:mailscanner-bounces at lists.mailscanner.info]
De
>la part de Peter Nitschke
>EnvoyC) : 19 juin 2015 02:21
>C href="mailto:mailscanner at lists.mailscanner.info">mailscanner at lists.mailscanner.info<mailto:mailscanner at lists.mailscanner.info>
>Objet : Check which rules hit
>
>I have built up a large number of rules for SA to use with MS and many
>are probably now obsolete.
>
>How can I monitor which rules are getting hits?
>
>Thanks.
>
>Peter
>
>
>
>
>--
>MailScanner mailing list
>mailscanner at lists.mailscanner.info<mailto:mailscanner at lists.mailscanner.info>
>http://lists.mailscanner.info/listinfo/mailscanner
>
>
--
MailScanner mailing
>list
mailscanner at lists.mailscanner.info<mailto:mailscanner at lists.mailscanner.info>
http://lists.mailscanner.info/listinfo/mailscanner
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mailscanner.info/pipermail/mailscanner/attachments/20150630/95ecf754/attachment.html>
More information about the MailScanner
mailing list