Feature request: strip specific HTML tags {Scanned by HJMS}

Julian Field mailscanner at ecs.soton.ac.uk
Fri Sep 19 17:11:44 IST 2003


I need to work out how to use rather more of HTML::Parser and TokeParser 
than I understand at the moment. Time for some test code on a little case.

At 15:43 19/09/2003, you wrote:
>Nick,
>
>this is turning into a FFR (Frequent Feature Request :-) for MailScanner...
>as far as I've seen, there was a very nice functional proposal in
>http://tinyurl.com/nxyp ... that is, now it must be coded... I don't have the
>time right now (and I don't consider myself a nice parser programmer), it is
>possible that Julian is on other tasks... maybe someone else on the list has
>the time and the guts to do it... if it is well done, probably Julian will
>add it... this is the nice thing of open source, if you want it, you can do
>it :-)
>
>Regards.
>
>El 18 Sep 2003 a las 18:45, Nicholas Esborn escribió:
>
> > Bobby,
> >
> > It is my impression from reading the documentation that this option turns
> > the ENTIRE message into text.  I'm looking for a way to simply remove the
> > specific, risky tags but leave the rest of the HTML intact.
> >
> > -nick
> >
> > On Thu, Sep 18, 2003 at 02:40:03PM -0700, Rose, Bobby wrote:
> > > 4.23.11 has a convert html to text option.  Have you tried it?
> > >
> > > -----Original Message-----
> > > From: Furnish, Trever G [mailto:TGFurnish at HERFF-JONES.COM]
> > > Sent: Thursday, September 18, 2003 4:41 PM
> > > To: MAILSCANNER at JISCMAIL.AC.UK
> > > Subject: Re: Feature request: strip specific HTML tags {Scanned by
> > HJMS}
> > >
> > >
> > > Ditto.
> > >
> > > > -----Original Message-----
> > > > From: Nicholas Esborn [mailto:nicholas_esborn at AFFYMETRIX.COM]
> > > > Sent: Thursday, September 18, 2003 3:21 PM
> > > > To: MAILSCANNER at JISCMAIL.AC.UK
> > > > Subject: Feature request: strip specific HTML tags {Scanned by HJMS}
> > > >
> > > >
> > > > Hello,
> > > >
> > > > One of my users did not receive a daily headlines message from the
> > New
> > >
> > > > York Times, because the message contained IFrame tags.  I have since
> >
> > > > added the Times' sender address to a ruleset controlling the Allow
> > > > IFrame Tags parameter.
> > > >
> > > > However, I would prefer to be able to simply strip IFrame tags and
> > > > their contents from the HTML, rather than the current options of
> > > > quarantining the whole message, converting to text, or passing based
> >
> > > > on rules.
> > > >
> > > > Is this likely to happen?
> > > >
> > > > Thanks
> > > >
> > > > -nick
>
>--
>Mariano Absatz
>El Baby
>----------------------------------------------------------
>"An idiot with a computer is a faster, better idiot"
>                                   -- Rich Julius

-- 
Julian Field
www.MailScanner.info
MailScanner thanks transtec Computers for their support




More information about the MailScanner mailing list