Recognising and flagging 'foreign' language e-mails in MCP

Quentin Campbell Q.G.Campbell at newcastle.ac.uk
Thu May 24 13:13:01 IST 2007


I use a small group of SpamAssassin rules in MCP to add a header to any
message that looks like it is in Russian. The added header will look
something like:

X-Newcastle-MailScanner-MCPCheck: MCP-Clean, MCP-Checker (score=0.01,
	required 1, MCP_RUSSIAN 0.01

This allows anyone who expects to receive messages in Russian to set up
a personal mail filter rule to look for the string "MCP_RUSSIAN" in the
message headers and move such messages into a "Russian" folder.

The reason they need to do this is that most messages in Russian that
are received here are tagged as spam. Most are spam! 

If this "MCP_RUSSIAN" rule precedes the personal mail filter rules that
recipients use for dealing with tagged spam then they don't miss
(possibly) important messages in Russian.

I want to do similar tagging in MCP for messages in German, Chinese and
Japanese and perhaps other languages if the need arises.

I am probably re-inventing the wheel here. Does anyone have, or know of,
sets of SpamAssassin rules that reliably recognise e-mail in various
foreign languages, the three languages above in particular? The SA
ok_languages and ok_locales options don't quite work in the way that is
needed to achieve the above.

Quentin 
---
PHONE: +44 191 222 8209    Information Systems and Services (ISS),
                           Newcastle University,
                           Newcastle upon Tyne,
FAX:   +44 191 222 8765    United Kingdom, NE1 7RU.
------------------------------------------------------------------



More information about the MailScanner mailing list