Resending [Phishing net and international characters]

Julian Field MailScanner at ecs.soton.ac.uk
Mon Aug 29 17:08:44 IST 2005


    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "US-ASCII" character set.  ]
    [ Some characters may be displayed incorrectly. ]

Denis Beauchemin wrote:

> Rabellino Sergio wrote:
>
>> Denis Beauchemin wrote:
>>
>>> On Sat, 27 Aug 2005 11:05:20 +0100, Julian Field
>>> <MailScanner at ECS.SOTON.AC.UK> wrote:
>>>
>>>  
>>>
>>>> How is MailScanner going to know that %E9 is the same as &eacute; ?
>>>>   
>>>
>>>
>>>
>>> I don't know but since these are no longer illegal there has to be a 
>>> way to
>>> not treat them as such.
>>>
>>> Denis
>>>
>>>  
>>>
>> I found this table (grabbed from w3c) that can be used to map the 
>> binary code to HTML code.
>>
>> http://htmlcodetutorial.com/characterentities_famsupp_69.html
>>
>> The bad thing is that this table must be hardcoded into MS, I think.
>>
>> Bye.
>>
>> PS. I've checked the table and E9 (=233) map to acute;
>
>
> Julian,
>
> I have found this Perl module: URI::Escape which has functions to 
> encode/decode %nn characters.  At the end of the readme it also says:
> The module can also export the |%escapes| hash, which contains the 
> mapping from all 256 bytes to the corresponding escape codes. Lookup 
> in this hash is faster than evaluating |sprintf("%%%02X", ord($byte))| 
> each time.
>
> Now, I'll try to find some other module that can convert the 
> international character into &eacute;...

I have just hardcoded in the table that was in the htmlcodetutorial link 
above. Works nicely.

-- 
Julian Field
www.MailScanner.info
Buy the MailScanner book at www.MailScanner.info/store
Professional Support Services at www.MailScanner.biz
MailScanner thanks transtec Computers for their support

PGP footprint: EE81 D763 3DB0 0BFD E1DC 7222 11F6 5947 1415 B654

------------------------ MailScanner list ------------------------
To unsubscribe, email jiscmail at jiscmail.ac.uk with the words:
'leave mailscanner' in the body of the email.
Before posting, read the Wiki (http://wiki.mailscanner.info/) and
the archives (http://www.jiscmail.ac.uk/lists/mailscanner.html).

Support MailScanner development - buy the book off the website!



More information about the MailScanner mailing list