<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<META content="MSHTML 6.00.2800.1170" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=179533415-06052003><FONT face=Arial color=#0000ff size=2>I
think you need 200 spam and 200 ham. Try running spamassassin with the -D
switch for debug and see what it says about bayes. Also, you can run the
check_bayes_db command and see how many spam and ham have been learned.
And you can run "sa-learn -D --rebuild" and see if it says anything about there
not being enough spam or ham. These may give you some clues to your
questions.</FONT></SPAN></DIV>
<DIV><SPAN class=179533415-06052003><FONT face=Arial color=#0000ff
size=2></FONT></SPAN> </DIV>
<DIV><SPAN class=179533415-06052003><FONT face=Arial color=#0000ff
size=2>Jason</FONT></SPAN></DIV>
<BLOCKQUOTE dir=ltr
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px solid; MARGIN-RIGHT: 0px">
<DIV class=OutlookMessageHeader dir=ltr align=left><FONT face=Tahoma
size=2>-----Original Message-----<BR><B>From:</B> Dene Ulmschneider
[mailto:dene@DATATECHIE.COM]<BR><B>Sent:</B> Tuesday, May 06, 2003 10:53
AM<BR><B>To:</B> MAILSCANNER@JISCMAIL.AC.UK<BR><B>Subject:</B> Re:
[MAILSCANNER] when is Bayes scoring used?<BR><BR></FONT></DIV>Hey Julian et
all-<BR><BR>In regards to all of the messages I have read that Bayes will not
start working until the magic number of 200 messages is reached, I am certain
that I have processed more than 200 messages and yet I still see no "Bayes"
entries in the headers.<BR><BR>I have checked the files in /root/.spamassassin
and found the
following:<BR><BR><I>filename<X-TAB> </X-TAB><X-TAB> </X-TAB>size<X-TAB> </X-TAB><X-TAB> </X-TAB>date
modified<BR></I>auto-whitelist<X-TAB> </X-TAB><X-TAB> </X-TAB>644.0
kb<X-TAB> </X-TAB>today<BR>auto-whitelist.db<X-TAB> </X-TAB>12.0
kb<X-TAB> </X-TAB><X-TAB> </X-TAB>3.28.03<BR>bayes_msgcount<X-TAB> </X-TAB>3.2
kb<X-TAB> </X-TAB><X-TAB> </X-TAB>today<BR>bayes_seen<X-TAB> </X-TAB><X-TAB> </X-TAB>1.3
mb<X-TAB> </X-TAB><X-TAB> </X-TAB>today<BR>bayes_seen.db<X-TAB> </X-TAB><X-TAB> </X-TAB>4.0
kb<X-TAB> </X-TAB><X-TAB> </X-TAB>3.28.03<BR>bayes_toks<X-TAB> </X-TAB><X-TAB> </X-TAB>2.6
mb<X-TAB> </X-TAB><X-TAB> </X-TAB>today<BR>bayes_toks.db<X-TAB> </X-TAB><X-TAB> </X-TAB>12.0
kb<X-TAB> </X-TAB><X-TAB> </X-TAB>3.28.03<BR><BR>while
I was checking these files - I saw that a new file was created and then
deleted called auto-whitelist.lock, due to the fact that the system starting
processing mails at this time.<BR><BR>The questions that I have
are:<BR>1-according to previous statements about the size of bayes_msgcount,
have I only correctly processed 3 or 4 emails?<BR>2-why are all of the .db
files form a month and a half ago?<BR>3-why are there still no headers
containing anything regarding Bayes?<BR><BR>Am I missing something. I have had
MailScanner running for about 2 months now and am certain that I have
processed enough emails.<BR><BR>Any help is appreciated.<BR><BR>Thank
You<BR><BR>Dene Ulmschneider<BR>Data Techie
Inc.<BR>-------------------------------------------------------------------------<BR>office:<X-TAB> </X-TAB><X-TAB> </X-TAB>718.738.8859<BR>email:<X-TAB> </X-TAB><X-TAB> </X-TAB>dene@datatechie.com<BR>pager
mail:<X-TAB> </X-TAB>denenow@datatechie.com<BR>website:<X-TAB> </X-TAB><A
href="http://www.datatechie.com/"
eudora="autourl">www.datatechie.com</A><BR>-------------------------------------------------------------------------<BR>"Life
is too short...-...you should have dessert first"<BR><BR>At 02:29 PM
5/6/2003 +0100, you wrote:<BR>
<BLOCKQUOTE class=cite cite="" type="cite">At 14:18 06/05/2003, you
wrote:<BR>
<BLOCKQUOTE class=cite cite="" type="cite">Well i have just setup
mailscanner 4.20-3 and i have some problemes<BR>with bayes
"scoring".<BR><BR>I have the bayes database working as it s modified each
time i receive<BR>a mail but when i gor spam i never seen BAYES_DB tag in
the scoring of<BR>spam.<BR>Is there a minim size of the bayes database in
order to be uzed for<BR>scoring?</BLOCKQUOTE><BR>It won't start using the
results of the Bayes data until 200 messages have<BR>been scanned. The
bayes_msgcount file will tell you how many it has scanned<BR>(file size ==
number of messages).<BR><BR><BR>
<BLOCKQUOTE class=cite cite="" type="cite">Thanks in advance for any
help<BR><BR>P.S<BR>the command<BR>check_bayes_db -db
/var/spool/spamassassin/bayes | head
-8<BR>0.000
0
0 0 non-token data: db
format = on-the-fly<BR>probs,<BR>expiry,
scan-counting<BR>0.000
0
16 0 non-token data:
nspam<BR>0.000
0 1233
0 non-token data:
nham<BR>0.000
0 51394
0 non-token data:
ntokens<BR>0.000
0
0 0 non-token data: oldest
age<BR>0.000
0 1382
0 non-token data: current
scan-count<BR>0.000
0
0 0 non-token data: last
expiry scan-count<BR>0.027
0
8 801 english<BR><BR><BR>--<BR>Eric
Doutreleau<BR>I.N.T
| Tel : +33 (0) 160764687<BR>9 rue Charles Fourier
| Fax : +33 (0) 160764321<BR>91011 Evry
France | email :
Eric.Doutreleau@int-evry.fr</BLOCKQUOTE><BR>--<BR>Julian Field<BR><A
href="http://www.mailscanner.info/"
eudora="autourl">www.MailScanner.info</A><BR>MailScanner thanks transtec
Computers for their support<BR><BR>--<BR>This message has been scanned for
viruses and dangerous<BR>content by Data Techie, and is believed to be
clean.<BR>Data Techie... always there to protect you!<BR><A
href="http://www.datatechie.com/"
eudora="autourl">http://www.datatechie.com</A></BLOCKQUOTE></BLOCKQUOTE></BODY></HTML>