Upgraded to 4.67.6, MailScanner scans a batch then hangs at 100
percent CPU
Julian Field
MailScanner at ecs.soton.ac.uk
Wed Mar 12 19:42:57 GMT 2008
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Glenn Steen wrote:
> On 12/03/2008, Steve Crumley <scrumley at secure-enterprise.com> wrote:
>
>> > -----Original Message-----
>> > From: mailscanner-bounces at lists.mailscanner.info
>> > [mailto:mailscanner-bounces at lists.mailscanner.info] On Behalf
>>
>>
>>> Of Julian Field
>>>
>> > Sent: Tuesday, March 11, 2008 6:50 PM
>> > To: MailScanner discussion
>> > Subject: Re: Upgraded to 4.67.6, MailScanner scans a batch
>> > then hangs at 100 percent CPU
>> >
>>
>>
>>> * PGP Signed by an unverified key: 03/11/08 at 18:50:26
>>>
>> >
>> >
>> >
>> > Steve Crumley wrote:
>> > >
>> > >
>> > >
>> > >> -----Original Message-----
>> > >> From: mailscanner-bounces at lists.mailscanner.info
>> > >> [mailto:mailscanner-bounces at lists.mailscanner.info] On Behalf
>> > >> Of Glenn Steen
>> > >> Sent: Tuesday, March 11, 2008 4:32 PM
>> > >> To: MailScanner discussion
>> > >> Subject: Re: Upgraded to 4.67.6,MailScanner scans a batch
>> > >> then hangs at 100 percent CPU
>> > >>
>> > >> On 11/03/2008, Steve Crumley
>> > <scrumley at secure-enterprise.com> wrote:
>> > >>
>> > >>> > -----Original Message-----
>> > >>> > From: mailscanner-bounces at lists.mailscanner.info
>> > >>> > [mailto:mailscanner-bounces at lists.mailscanner.info] On Behalf
>> > >>>
>> > >>>
>> > >>>> Of Glenn Steen
>> > >>>>
>> > >>> > Sent: Tuesday, March 11, 2008 1:21 PM
>> > >>> > To: MailScanner discussion
>> > >>> > Subject: Re: Upgraded to 4.67.6,MailScanner scans a batch
>> > >>> > then hangs at 100 percent CPU
>> > >>> >
>> > >>> > On 11/03/2008, Steve Crumley
>> > >>>
>> > >> <scrumley at secure-enterprise.com> wrote:
>> > >>
>> > >>> > >
>> > >>> > >
>> > >>> > > > -----Original Message-----
>> > >>> > > > From: mailscanner-bounces at lists.mailscanner.info
>> > >>> > > > [mailto:mailscanner-bounces at lists.mailscanner.info]
>> > >>>
>> > >> On Behalf
>> > >>
>> > >>> > > > Of --[ UxBoD ]--
>> > >>> > >
>> > >>> > > > Sent: Tuesday, March 11, 2008 11:29 AM
>> > >>> > > > To: MailScanner discussion
>> > >>> > > > Subject: Re: Upgraded to 4.67.6, MailScanner scans a batch
>> > >>> > > > then hangs at 100 percent CPU
>> > >>> > > >
>> > >>> > >
>> > >>> > > > do you have strace installed on the server ? if so when the
>> > >>> > > > process is running at 100% CPU connect to it and
>> > see what it
>> > >>> > > > is doing. I had this before, but for the life of
>> > >>>
>> > >> me I cannot
>> > >>
>> > >>> > > > remember what I changed to fix it :(
>> > >>> > > >
>> > >>> > > > Things to check :-
>> > >>> > > >
>> > >>> > > > 1) Permissions, are they all correct
>> > >>> > > > 2) Check MailScanner.conf again just to make sure no typos
>> > >>> > > >
>> > >>> > > > Regards,
>> > >>> > > >
>> > >>> > > > --
>> > >>> > >
>> > >>> > >
>> > >>> > > Here is the output from strace:
>> > >>> > >
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > > waitpid(-1, 0xbff09448, WNOHANG) = 0
>> > >>> > >
>> > >>> > >
>> > >>> > >
>> > >>> > >
>> > >>> > > The system had been running fine for over a year, I
>> > >>>
>> > >> can't find any
>> > >>
>> > >>> > > permission or setting change thats doing this, but
>> > I could be
>> > >>> > > overlooking something.
>> > >>> > > Thanks,
>> > >>> > > -Steve
>> > >>> > >
>> > >>> > Could perhaps be a busted SQLite SA cache? What does
>> > >>>
>> > >> analyse_s<TAB> (I
>> > >>
>> > >>> > don't remember if it is sacache or spamassassin_cache
>> > >>>
>> > >> ... the command
>> > >>
>> > >>> > completion should take care of it:-) say? If it looks
>> > >>>
>> > >> fishy, simply
>> > >>
>> > >>> > delete the SA cache file and restart MS.
>> > >>> >
>> > >>> > You've run MailScanner --lint, right? Nothing obvious
>> > from that?
>> > >>> >
>> > >>> > Oh, and what av scanners do you use? Obviously not
>> > >>>
>> > >> clamavmodule, but
>> > >>
>> > >>> > perhaps clamav or clamd? are those OK?
>> > >>> >
>> > >>> > Cheers
>> > >>> > --
>> > >>> > -- Glenn
>> > >>> > email: glenn < dot > steen < at > gmail < dot > com
>> > >>> > work: glenn < dot > steen < at > ap1 < dot > se
>> > >>>
>> > >>>
>> > >>>> --
>> > >>>>
>> > >>> > MailScanner mailing list
>> > >>> > mailscanner at lists.mailscanner.info
>> > >>> > http://lists.mailscanner.info/mailman/listinfo/mailscanner
>> > >>> >
>> > >>> > Before posting, read http://wiki.mailscanner.info/posting
>> > >>> >
>> > >>> > Support MailScanner development - buy the book off the website!
>> > >>> >
>> > >>>
>> > >>>
>> > >>>
>> > >>> analyse_SpamAssassin_cache looks clean, MailScanner --lint
>> > >>>
>> > >> is clean too.
>> > >>
>> > >>> I'm running clamd for AV but I've set virus scanning to no
>> > >>>
>> > >> while working
>> > >>
>> > >>> on this.
>> > >>>
>> > >>> Thanks,
>> > >>> -Steve
>> > >>>
>> > >> Couldn't be something easily mended, huh:-)....
>> > >>
>> > >> What you seem to have attached to above (with strace) would be the
>> > >> main MailScanner process, since it basically just wait for it's
>> > >> children to end... Or is it? What does a ps listing show (one that
>> > >> show the command argument list, since Jules rewrite it to
>> > show what it
>> > >> thinks it is basically doing)?
>> > >> Do the children restart endlessly when hung? How many children are
>> > >> there, and in what state?
>> > >> Cheers
>> > >> -- Glenn
>> > >>
>> > >
>> > >
>> > >
>> > > When I first started it with 8 children, they all end up
>> > quickly hanging
>> > > and consuming CPU. For now, I've set it to 1 child and I've been
>> > > running in debug mode. The ps gives us a good clue! Its the only
>> > > mailscanner process and it reports "MailScanner: extracting
>> > attachments"
>> > >
>> > > Thanks,
>> > > -Steve
>> > >
>> > In which case go into "sub Explode" in
>> > /usr/lib/MailScanner/MailScanner/Message.pm, and add some
>> > "print STDERR"
>> > lines to generate tracing output so you can see how far it gets. When
>> > you do a "MailScanner --debug" it will show you the STDERR
>> > debug output
>> > in the terminal session.
>> >
>> > Jules
>> >
>>
>>
>> There's something very screwed up with my perl. I've put "print"s in
>> MailScanner around the call to Explode and I put a print first thing in
>> Explode. I get the output right before the call but nothing from
>> explode itself and we never return to MailScanner.
>>
>> I really appreciate everyone's help with this.
>> Thanks,
>> -Steve
>>
>>
> I wonder if STDERR is unbuffered (too lazy/tired to go look it up...:)
> ... Jules? Else you might need do that to get reliable error
> printing...
>
STDERR is unbuffered.
Jules
- --
Julian Field MEng CITP CEng
www.MailScanner.info
Buy the MailScanner book at www.MailScanner.info/store
MailScanner customisation, or any advanced system administration help?
Contact me at Jules at Jules.FM
PGP footprint: EE81 D763 3DB0 0BFD E1DC 7222 11F6 5947 1415 B654
PGP public key: http://www.jules.fm/julesfm.asc
-----BEGIN PGP SIGNATURE-----
Version: PGP Desktop 9.8.1 (Build 2523)
Comment: Use Thunderbird Enigmail to verify this message
Charset: ISO-8859-1
wj8DBQFH2DJGEfZZRxQVtlQRAujLAJ4ytKdJ3TqZrSPJDuyHKSomDJG13QCbBmN6
E1kEwpWmNuq7SzIB3mBdBTQ=
=22gB
-----END PGP SIGNATURE-----
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
More information about the MailScanner
mailing list