when is Bayes scoring used?
Dene Ulmschneider
dene at DATATECHIE.COM
Wed May 7 21:20:47 IST 2003
I have checked, double checked and triple checked the system.
They I uninstalled and reinstalled SpamAssassin. I will see what happens
now with the number of messages reported when I run 'sa-learn -D --rebuild'
in a day or two.
Once thing I did notice is that there are 2 directories with the bayes
files in it (bayes_seen, bayes_toks, and bayes_msgcount). Are those files
supposed to be in /root/.spamassassin as well as /.spamassassin?
I thought it was supposed to be one or the other?
Dene
At 10:43 AM 5/7/2003 -0400, you wrote:
>Maybe you have different versions of SpamAssassin installed? One version
>using the files with .db and the end and the other without? Did you upgrade
>SpamAssassin on 3/28? If you have multiple versions (or older remnants of
>versions) of SpamAssassin, you could try uninstalling it. Then make sure it
>is uninstalled, and tools like sa-learn are not still around. Then
>reinstall and see if that help.
>
>Jason
>
> > -----Original Message-----
> > From: Dene Ulmschneider [mailto:dene at DATATECHIE.COM]
> > Sent: Wednesday, May 07, 2003 10:07 AM
> > To: MAILSCANNER at JISCMAIL.AC.UK
> > Subject: Re: [MAILSCANNER] when is Bayes scoring used?
> >
> >
> > When I tried that - the output returned some errors about not
> > being able to
> > parse some whitelist_from and blacklist_from so I commented
> > them out since
> > I have the white and black lists in separate files anyway.
> > Other than that
> > - the output was identical.
> >
> > As far as the bayes_path - I would think that it is set
> > correctly (even
> > though I did not specify in any file anywhere) because when
> > MS is scanning
> > messages - I can see the lock file created and deleted. Also,
> > all of the
> > relevant Bayes files are being modified many times every day.
> >
> > If you feel that specifying the bayes_path will help - I will
> > try it - but
> > the spam.assassin.prefs.conf says that you only need to do
> > that if you move
> > it from the default location...
> >
> > thanks
> >
> > Dene
> >
> > At 02:48 PM 5/7/2003 +0100, you wrote:
> > >Dene,
> > >
> > >How about trying this:
> > >
> > >'sa-learn -D -p /etc/MailScanner/spam.assassin.prefs.conf --rebuild'
> > >
> > >Does this show anything different? Also, have you set
> > 'bayes_path' in any of
> > >the prefs files?
> > >
> > >Kind regards,
> > >Steve
> > >--
> > >Steve Freegard
> > >Systems Manager
> > >Littlehampton Book Services Ltd.
> > >
> > > > ----------
> > > > From: Dene Ulmschneider
> > > > Reply To: MailScanner mailing list
> > > > Sent: Wednesday, May 7, 2003 2:41 PM
> > > > To: MAILSCANNER at jiscmail.ac.uk
> > > > Subject: Re: when is Bayes scoring used?
> > > >
> > > > Julian-
> > > >
> > > > there was a message that I sent yesterday (clipped it out
> > of last email to
> > > > list) that showed the complete output of the command "sa-learn -D
> > > > --rebuild". That's how I know the system says there are
> > "Only 87 spam(s)
> > > > in
> > > > Bayes_db < 200".
> > > >
> > > > I cannot figure out why the learn.spam.log is always
> > counting "learned
> > > > from" messages but it is not increasing the number when I run the
> > > > "sa-learn
> > > > -D --rebuild".
> > > >
> > > > Any ideas?
> > > >
> > > > Dene
> > > >
> > > >
> > > > At 01:50 PM 5/7/2003 +0100, you wrote:
> > > > >At 13:32 07/05/2003, you wrote:
> > > > >>OK - this is really getting a little confusing...
> > > > >>
> > > > >>I checked my "learn.spam.log" this morning and a found
> > the following
> > > > entries:
> > > > >>
> > > > >>Wed May 7 00:01:01 EDT 2003
> > > > >>Learned from 4 messages.
> > > > >>Learned from 1 messages.
> > > > >>Wed May 7 01:01:01 EDT 2003
> > > > >>
> > > > >>The problem is that when I run sa-learn -D --rebuild I
> > still get the
> > > > >>message that says:
> > > > >>Only 87 spam(s) in Bayes_db < 200
> > > > >>(it should be AT LEAST the 87 form yesterday plus the
> > ones listed above
> > > > -
> > > > >>right?)
> > > > >>
> > > > >>Can anyone tell me how to fix this? The Bayes files on
> > > > /root/.spamassassin
> > > > >>are all being updated multiple times per day so I know
> > it is working,
> > > > >>unless of course the sa-learn command is reading Bayes
> > info from another
> > > > >>directory that really DOES only have 87 spam(s).
> > > > >>
> > > > >>Is there a way to run sa-learn and have it tell you the
> > path that it is
> > > > >>reading the Bayes info from?
> > > > >
> > > > >Have you tried
> > > > > sa-learn -D
> > > > >? I just ran "sa-learn" on its own and it prints the
> > usage for you.
> > > > >
> > > > >
> > > > >>Thank for any assistance.
> > > > >>
> > > > >>Dene
> > > > >>
> > > > >>At 02:38 PM 5/6/2003 -0400, you wrote:
> > > > >>>something else to add...
> > > > >>>
> > > > >>>According the script that Julian provided to run
> > sa-learn through cron,
> > > > >>>my log is called "learn.spam.log"
> > > > >>>
> > > > >>>When I checked that file - I added up all of the
> > "learned form XX
> > > > >>>messages" and the total number was 447.
> > > > >>>
> > > > >>>Is the "learned from" referring to spam and ham? Is it
> > possible that I
> > > > >>>have 87 spam and the rest of them a ham? I thought I
> > was pretty sure
> > > > that
> > > > >>>more spam was getting processed than ham - but I could
> > be wrong.
> > > > >>>
> > > > >>>Can anyone shed a little light?
> > > > >>>
> > > > >>>Dene
> > > > >
> > > > >--
> > > > >Julian Field
> > > > >www.MailScanner.info
> > > > >MailScanner thanks transtec Computers for their support
> > > >
> > > >
> > >
> > >
> > >*************************************************************
> > *********
> > >This email and any files transmitted with it are confidential and
> > >intended solely for the use of the individual or entity to whom they
> > >are addressed. If you have received this email in error please notify
> > >the system manager.
> > >
> > >This footnote also confirms that this email message has been swept by
> > >MIMEsweeper for the presence of computer viruses.
> > >
> > >www.lbsltd.co.uk
> > >*************************************************************
> > *********
> > >
> > >--
> > >This message has been scanned for viruses and dangerous
> > >content by Data Techie, and is believed to be clean.
> > >Data Techie... always there to protect you!
> > >http://www.datatechie.com
> >
> > Thank You
> >
> > Dene Ulmschneider
> > Data Techie Inc.
> > --------------------------------------------------------------
> > -----------
> > office: 718.738.8859
> > cell: 646.996.2976
> > email: dene at datatechie.com
> > pager mail: denenow at datatechie.com
> > website: www.datatechie.com
> > --------------------------------------------------------------
> > -----------
> > "Life is too short...-...you should have dessert first"
> >
>
>--
>This message has been scanned for viruses and dangerous
>content by Data Techie, and is believed to be clean.
>Data Techie... always there to protect you!
>http://www.datatechie.com
Thank You
Dene Ulmschneider
Data Techie Inc.
-------------------------------------------------------------------------
office: 718.738.8859
cell: 646.996.2976
email: dene at datatechie.com
pager mail: denenow at datatechie.com
website: www.datatechie.com
-------------------------------------------------------------------------
"Life is too short...-...you should have dessert first"
More information about the MailScanner
mailing list