FW: MailWatch Spam Learn / Bayes DB SOLVED
Glenn Steen
glenn.steen at gmail.com
Tue May 13 10:19:05 IST 2008
2008/5/13 Brett Carruthers <bcarruthers at iii.net.au>:
>
>
> -----Original Message-----
> From: Brett Carruthers
> Sent: Tuesday, 13 May 2008 9:14 AM
> To: mailscanner at lists.mailscanner.info
> Subject: RE: MailWatch Spam Learn / Bayes DB
>
>
>
> -----Original Message-----
> From: Glenn Steen [mailto:glenn.steen at gmail.com]
> Sent: Monday, 12 May 2008 5:50 PM
> To: MailScanner discussion
> Subject: Re: MailWatch Spam Learn / Bayes DB
>
> 2008/5/12 Glenn Steen <glenn.steen at gmail.com>:
> > 2008/5/12 Mohammed Alli <malli at mcrirents.com>:
> >
> >
> > >
> > > ________________________________
> > >
> > > From: mailscanner-bounces at lists.mailscanner.info on behalf of
> Brett Carruthers
> > > Sent: Sun 5/11/2008 8:25 PM
> > > To: mailscanner at lists.mailscanner.info
> > > Subject: MailWatch Spam Learn / Bayes DB
> > >
> > >
> > >
> > >
> > >
> > > Hi,
> > >
> > > I have a few week old install of MailScanner / MailWatch / Scalix
> on CentOS 5.1 and my bayes DB is going OK but is still giving me 0%
> chances on some spam mail.
> > >
> > > So I want to train it a bit more so the bayes works even better.
> > >
> > > What do I have to do to get MailWatch to be able to manually
> spam/ham learn on its 'Message Operations' report?
> > >
> > > Currently, if I try and learn spam it gives me an error about the
> message not being in the quarantine eg.
> > >
> > > Message m4BK4Wg4005306 not found in quarantine
> > >
> > > Some settings from MailScanner.conf
> > >
> > > --
> > >
> > > Quarantine dir = /var/spool/MailScanner/quarantine
> > >
> > > Quarantine Infections = yes
> > >
> > > Quarantine Whole Messages As Queue Files = yes
> > >
> > > Quarantine Whole Message = yes
> > >
> > > Spam Actions = store-spam
> > >
> > > High Scoring Spam Actions = delete (don't need to worry about
> these as its already learnt these are Spam!)
> > >
> > > Non Spam Actions = deliver store header "X-Spam-Status: No"
> > >
> > > If anyone could lean me in the right direction I would appreciate
> it very much!
> > >
> > > Thanks,
> > >
> > > Brett
> > >
> > >
> > >
> > >
> > >
> > > Hi Brett,
> > >
> > >
> > >
> > > Try the following to fix your message operation error:
> > >
> > > 9.23 Fix for the Reporting Function in Message Operations
> > >
> > > Change the following in /var/www/mailscanner/do_message_ops.php
> file:
> > >
> > > $id = $Regs[1];
> > >
> > > to
> > >
> > > $id = str_replace("_", ".",$Regs[1]);
> > >
> > > Good Luck
> > >
> > > Mohammed
> > >
> > That IS a good fix, provided you use Postfix, which Brett doesn't
> seem to do.
> > Much more likely that there simply is nothing in the quarantine to
> > learn from. There is a simple one-line fix to message_ops.php that
> > "enhance the SQL to actually check that the message is quarantined
> > (fix by Dhawal Doshy, go search the MailWatch list archives...)... If
> > one want to be able to do this at all, one need to include "store" in
> > the Non Spam Actions ...
> > Brett, if you look at the details page for the message, do you have
> > the "learn/release" block at the bottom? I'm pretty certain you
> don't.
> >
> > Oh and BTW, this should've gone to the MailWatch list rather than the
> > MailScanner one... Slightly OT here:-).
> >
> > Cheers
>
> Oh, sorry... you do have "store" set. Hm. This tyraining, is it from
> the MessageOps page, or the details? What happens if you try train
> manually? Is the message content "there"? Do you have a
> clean_quarantine script in cron, and what have you set the cleani9ng
> period to be?
>
> Cheers
> --
> -- Glenn
> email: glenn < dot > steen < at > gmail < dot > com
> work: glenn < dot > steen < at > ap1 < dot > se
> --
>
> Hi Glen,
>
> Sorry for the topic being OT, I have recognized the problem being
> quarantine related and started at MailScanner... I'll go away and ask
> the MailWatch list if I can confirm my quarantine settings!
>
> I don't seem to see the quarantine messages anymore since I turned off
> 'Quarantine Whole Messages As Queue Files' yesterday, before that I
> would get the files in the quarantine albeit with a different queue
> identifier than the SQL logged eg qfm4C5WuYl031783 for queue, SQL wants
> a format of m4CKXUKw031678.
That is the queue file, which is what you get from having the wrong
setting in MailScanner.conf ... You should have a file called
"message" in there (for the virus quarantine...), which contain the
RFC822 message (the whole message, less the envelope info... which is
fine, since you have that in the database). In the non-spam and spam
quarantine you should have files named as the queue ID, containg the
same thing (the plain text RFC822 message).
MailWatch depends on this.
> I have been using the 'MessageOps' page in reports and my individual
> message's don't show learn spam/ham. I can't seem to find my messages in
>
> quarantine unless I have the full queue file logged. I think I need to
No, this is likely completely backwards:-). Likely due to your
permission issue (see the MW list post(s)).
> use just the message content if I were to train the bayes engine (not
> the full queue file), can you please confirm if I am right in this
> assumption? I would like to try and train manually but I haven't been
> able to store all my messages...
>
> I have the clean_quarantine script ready to go but don't have it
> currently running...
>
> Thanks for your help,
> Brett
>
> --
> MailScanner mailing list
> mailscanner at lists.mailscanner.info
> http://lists.mailscanner.info/mailman/listinfo/mailscanner
>
> Before posting, read http://wiki.mailscanner.info/posting
>
> Support MailScanner development - buy the book off the website!
>
>
>
> Good news, I solved the problem, it was all to do with permissions on
> the quarantine dirs and the bayes dir. I can now learn and train from
> Mailwatch successfully.
And likely have reset the quarantine as queue files setting to "no"?
> Thanks for your help,
> Brett
>
Cheers
--
-- Glenn
email: glenn < dot > steen < at > gmail < dot > com
work: glenn < dot > steen < at > ap1 < dot > se
More information about the MailScanner
mailing list