HTML body changed??? - Different SA scores / SA and MA

Jan-Peter Koopmann Jan-Peter.Koopmann at SECEIDOS.DE
Tue Mar 11 13:04:58 GMT 2003


Hi Julian,

I am still playing around with SpamAssassin and MailScanner a bit. Here
is something strange:

I took a junk mail and ran it through spamassassin -t. This is the
report:


Content analysis details:   (25.80 points, 6 required)
X_PRIORITY_HIGH    (1.9 points)  Sent with 'X-Priority' set to high
BAYES_90           (2.9 points)  BODY: Bayesian classifier says spam
probability is 90 to 99%
                   [score: 0.9815]
HTML_40_50         (0.4 points)  BODY: Message is 40% to 50% HTML
HTML_IMAGE_ONLY_02 (1.5 points)  BODY: HTML has images with 0-200 bytes
of words
PYZOR_CHECK        (1.2 points)  Listed in Pyzor, see
http://pyzor.sf.net/
DATE_IN_PAST_12_24 (0.1 points)  Date: is 12 to 24 hours before
Received: date
MSGID_OUTLOOK_TIME (4.4 points)  Message-Id is fake (in Outlook Express
format)
RCVD_FAKE_HELO_DOTCOM_2 (2.8 points)  Received contains a faked HELO
hostname (2)
RCVD_IN_NJABL      (1.2 points)  RBL: Received via a relay in
dnsbl.njabl.org
                   [RBL check: found 3.160.178.202.dnsbl.njabl.org.,]
                   [type: 127.0.0.9]
RCVD_IN_OSIRUSOFT_COM (0.5 points)  RBL: Received via a relay in
relays.osirusoft.com
                   [RBL check: found
3.160.178.202.relays.osirusoft.com., type: 127.0.0.3]
RCVD_IN_BL_SPAMCOP_NET (4.0 points)  RBL: Received via a relay in
bl.spamcop.net
                   [RBL check: found 3.160.178.202.bl.spamcop.net.]
RCVD_IN_DSBL       (4.3 points)  RBL: Received via a relay in
list.dsbl.org
                   [RBL check: found 3.160.178.202.list.dsbl.org.]
PRIORITY_NO_NAME   (0.6 points)  Message has priority setting, but no
X-Mailer


Then I fed exatly the same file into my system using exim -t < msg.txt.
This is what SA/MS found:

X-MailScanner-SpamCheck: spam, SpamAssassin (score=23.1, required 6,
AWL,
        BAYES_90, DATE_IN_PAST_12_24, HTML_20_30, HTML_IMAGE_ONLY_06,
        MSGID_OUTLOOK_TIME, PRIORITY_NO_NAME, RCVD_FAKE_HELO_DOTCOM_2,
        RCVD_IN_BL_SPAMCOP_NET, RCVD_IN_DSBL, RCVD_IN_NJABL,
        RCVD_IN_OSIRUSOFT_COM, X_PRIORITY_HIGH)


MailScanner/SpamAssassin changed HTML_40_50 to HTML_20_30. Why/How?
Moreover it shows HTML_IMAGE_ONLY_06 and not _02. Obviously something
changed the HTML source. I cannot see an Iframe tag anywhere. Moreover
the PYZOR_CHECK is missing which also indicates that the body has been
altered by MailScanner.

This is the body of the msg.file:

This is a multi-part message in MIME format.

------_=_NextPart_001_01C2E70F.C1B22B00
Content-Type: text/plain;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

 <http://datematch.org/dateme/> I can't wait to meet  =
8421PFsC8-249DRPN4997MsTV2-l25=20

------_=_NextPart_001_01C2E70F.C1B22B00
Content-Type: text/html;
        charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable


<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<center>
<a href=3D"http://datematch.org/dateme/">
I can't wait to meet
<img src=3D"http://dateme.coolfreepages.com/date.jpg"
</a>

8421PFsC8-249DRPN4997MsTV2-l25

------_=_NextPart_001_01C2E70F.C1B22B00--


Thanks,
  JP




More information about the MailScanner mailing list