my mail server is drowning

Julian Field mailscanner at ecs.soton.ac.uk
Wed May 7 07:59:53 IST 2003


Try
1) switch off bayes support in SpamAssassin
2) reduce number of children to 2
3) temporarily try f-prot instead of sophossavi
4) get rid of your auto-whitelist completely

If you are doing lots of paging, then memory is surely your problem, not
CPU. Context switching on SPARC architectures is quite an expensive
operation, so if it is spending 30% of its CPU doing nothing other than
context-switching, you have already lost most of 1 CPU. You need to get the
paging figure down.

One other alternative which might be worth it as a temporary test, is to
use an Intel box instead of a SPARC one. You can still run Solaris on it,
no problem, but context switching is a lot cheaper on Intel than SPARC.

At 20:28 06/05/2003, you wrote:
>Gang,
>
>Setup: Sun E220R (2 450 MHZ CPUS, 2 GB memory), Solaris 8,
>MailScanner-4.20-3, sophos 3.69, sophosavi, sendmail 8.12.9,
>spamassassin 2.53 (bayes learning on).  Other stuff running
>on the box: qpopper 4.0.5, mailman 2.1.2.
>
>Problem: It can't keep up with the mail traffic.  We get roughly
>25K messages a day.  I didn't really have this problem until after
>sophos 3.67 came out, it has been getting worse even with sophossavi.
>A backlog forms in mqueue.in in the morning and the box can't keep up
>during the day.  Right now (1:30 PM), I've got slightly more than 1000
>messages waiting on MailScanner.  My system load (uptime load)
>is a steady a steady 10 to 12, sar shows roughly 70% usr, 30% sys
>with no idle.  vmstat shows that the system is doing memory paging
>like crazy (pi=5K).  Iostat is not outrageous on anything, so the issue
>seems to be lack of CPU.
>
>"top" shows that MailScanner is a real heavyweight on memory usage.
>Each process is roughly 40M is size, with most of that resident in
>memory.  The only other heavy memory user is named (170 to 190 MB).
>
>I've tried tuning MS by changing the number of emails per scan (25 to 100),
>the number of children (4 to 10, currently running 6).  Nothing helps.
>It takes MS several minutes to chomp on 100 messages -- time enough for
>100 more messages to roll in.
>
>Any suggestions for tuning?  Any ideas why MS takes so much memory?
>Is this related to the bayes learning in SA?  I've noticed that the
>files in /var/spool/spamassassin are big:
>
>-rw-r--r--   1 root     daemon   1335296 May  6 13:49 auto-whitelist
>-rw-------   1 root     daemon       112 May  6 13:49 bayes.lock
>-rw-------   1 root     daemon        98 May  6 13:49 bayes.lock.emerald.14132
>-rw-------   1 root     daemon     45468 May  6 13:49 bayes_journal
>-rw-------   1 root     daemon        54 May  6 13:49 bayes_msgcount
>-rw-r--r--   1 root     daemon   10559488 May  6 13:49 bayes_seen
>-rw-------   1 root     daemon   7700480 May  6 13:49 bayes_toks
>
>and that lsof shows me that bayes_seen and bayes_toks files are used
>by MS processes.  Maybe the size of these files related to the memory
>usage of MS?
>
>--- Jeff Earickson
>     Colby College

--
Julian Field
www.MailScanner.info
MailScanner thanks transtec Computers for their support



More information about the MailScanner mailing list