massive spamassassin database files (Was: RE: Users of RBL's)

Daniel Maher daniel.maher at
Thu Jun 29 15:26:06 IST 2006

Hi there,

Speaking of massive spamassassin-related files, my auto-whitelist files are /huge/ - in every case larger than the seen and token files by a factor of 2 or 3.  Any idea what I could be doing to keep the whitelist nice and trim as well?

 °v°  Daniel Maher
/(_)\ Administrateur Système Unix
 ^ ^  Unix System Administrator
Sentio aliquos togatos contra me conspirare.
-----Original Message-----
From: mailscanner-bounces at [mailto:mailscanner-bounces at] On Behalf Of Mark Presling
Sent: June 29, 2006 6:59 AM
To: MailScanner discussion
Subject: Re: Users of RBL's

Hi Chris,

Have you checked the size of your bayes database files? I used to have a 
1GB machine that SpamAssassin would regularly time out on because the 
bayes DB would get too big from the auto learning. I had to tune it so 
that the DB file would stay below 5MB or it just timed out scanning 
larger messages. It also used up 100% of the CPU most of the time. I 
used to manually expire old tokens from it as well, but that was before 
MS started doing that automatically for me. Even on my newer server (2G 
Pentium 4) I still restrict the size of the the bayes DB with 
"bayes_expiry_max_db_size 400000". This seems to keep the DB at around 10MB.


Chris Hammond wrote:
>>>> Sounds like you may just be asking too much of the hardware.
>>> This could very well be.  Before I go asking for a new server though, I want to make sure I have my ducks in a row.
>>> When this was nothing more than a Postfix box with static rules, it handled the job just fine.  But I think it may
>>> be really working for it's living.
>> MailScanner and SpamAssassin do use a lot of resources. It looks to be 
>> cpu bound, but that's a good thing usually! Any way to upgrade that 
>> processor? To reduce CPU usage, tune/configure some software. Did you 
>> read the performance tweaks section in the mailscanner wiki? To reduce 
>> disk writes, setup syslog to log to another box, or put mysql on another 
>> box, or throw another cheap ide drive into the box and log to it, 
>> instead of the mirrored drives.
> I was beginning to feel the same way.  The DL-145 is a dual processor capable box
> so I will see about adding a second processor to it.
> I did go through the tweaks section on the wiki.  My next thought was moving MySQL to
> another machine.  There is no more room for another drive so that is not an option
> unfortunately.  I am going to move the MySQL server to another box tonight and see what
> that gains me.
> Thanks
> Chris
> --
> MailScanner mailing list
> mailscanner at
> Before posting, read
> Support MailScanner development - buy the book off the website!

This message has been scanned for viruses and dangerous
content by MailScanner, and is believed to be clean.

More information about the MailScanner mailing list