Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re: Regex help

by sauoq (Abbot)
on Oct 01, 2002 at 21:32 UTC ( [id://202121]=note: print w/replies, xml ) Need Help??


in reply to Regex help

If you are using this to strip potentially malicious code, you should be more liberal in what you match.

1. Use /i because tags can be upper, lower, or mixed cases.

<script>CODE</script> <SCRIPT>CODE</SCRIPT> <ScRiPt>CODE</ScRiPt>

2. Be careful of whitespace in tags.

<script >CODE</script> <script>CODE</script >

3. Be careful of what gets left behind after you strip it. (The following example is a good reason not to use a non-greedy match.)

<<script></script>script>CODE</script>

I'd use something like jeffa's and eliminate as much as possible. I don't see any immediate problems with this: s#<script.*script\s*>##gis; but I didn't test it very thoroughly and there may be some. You might consider substituting repeatedly until nothing matches in order be sure you've avoided the 3rd issue above but that may well be overkill.

-sauoq
"My two cents aren't worth a dime.";

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://202121]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others scrutinizing the Monastery: (3)
As of 2024-04-19 22:28 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found