What's in a name? - "spam" / "not spam" {Scanned}

Scott Silva ssilva at SGVWATER.COM
Wed Jun 30 16:43:23 IST 2004


| Greetings -
|
| Someone has just pointed out to me a slight problem with the text used to
| identify spam and non-spam in the "X-Blah-MailScanner-SpamCheck:" header
| when used with an IMAP server...
|
| For spam this heading looks something like this:
|
|     X-Blah-MailScanner-SpamCheck: not spam, SpamAssassin (score=-4.8,
|             required 8, BAYES_00 -4.90, BIZ_TLD 0.10)
|
| and for non-spam something like this:
|
|     X-Blah-MailScanner-SpamCheck: spam, SpamAssassin (score=22.781,
|             required 8, autolearn=spam, BAYES_99 5.40,
|             DATE_IN_PAST_12_24 0.75, ... )
|
How about       ": spam" (colon space spam)
and           ": not spam"




| The problem arises when trying to use client-side filtering with an IMAP
| mail program.  Such can be set to query the IMAP server to check the text
| of a particular message header.  However the IMAP specification stipulates
| that this match is as a case-insensitive substring.
|
| Thus setting up a search to check the "X-Blah-MailScanner-SpamCheck"
header
| for the word "spam" matches both spam and not-spam messages.  (The
converse
| -- checking for non-spam messages by looking for "not spam" is fine.)
|
| I am pondering changing the wording of one or both of these two strings in
| the languages.conf file.  The aim is to use wording such that neither is a
| case-insensitive substring of the other.
|
| But choosing words that satisfy this whilst still being clear to users is
| proving trickier than I'd at first thought.  Possibilities I've toyed with
| so far are:
|
|     spam                not spam
|     ----                --------
|     *spam*              not spam        (wildcards + problems if *s
omitted)
|     spammy              not spam        (too colloquial?)
|     probable spam       not spam        (doesn't fit high-scoring spam
well)
|     spam                genuine         (implies approval of leak-thru
spam)
|     spam                legitimate      (ditto)
|     spam                pukka           (do most staff/students know
pukka?)
|
| Bearing in mind this is a "difficult to change subsequently" setting once
| people have started using it in their filters I was wondering if any other
| sites had taken this step to address the problem?
|
| Cheers,
|
| Mike B-)
|
| --
| The Computing Service, University of York, Heslington, York Yo10 5DD, UK
| Tel:+44-1904-433811  FAX:+44-1904-433740
|
| * Unsolicited commercial e-mail is NOT welcome at this e-mail address. *
|
| -------------------------- MailScanner list ----------------------
| To leave, send    leave mailscanner    to jiscmail at jiscmail.ac.uk
| Before posting, please see the Most Asked Questions at
| http://www.mailscanner.biz/maq/     and the archives at
| http://www.jiscmail.ac.uk/lists/mailscanner.html
|
| --
| This message has been scanned for viruses and
| dangerous content by MailScanner, and is
| believed to be clean.
|


--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

-------------------------- MailScanner list ----------------------
To leave, send    leave mailscanner    to jiscmail at jiscmail.ac.uk
Before posting, please see the Most Asked Questions at
http://www.mailscanner.biz/maq/     and the archives at
http://www.jiscmail.ac.uk/lists/mailscanner.html



More information about the MailScanner mailing list