Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re: Removing Javascript

by domm (Chaplain)
on Jan 04, 2003 at 13:00 UTC ( [id://224247]=note: print w/replies, xml ) Need Help??


in reply to Removing Javascript

Jet another way, using HTML::Tree:

Please note that this doesn't handle the unescaped closing script tag in document.write. I'd suggest running tidy on the input before passing it to the parser.

#!/usr/bin/perl -w use strict; use HTML::Tree; my $doc = <<EOF; <html> <head> <script language="Javascript"> document.write("Don't forget your &lt;/script&gt; tag! It's important! +"); </script> </head> <body> This is just some text. </body> </html> EOF my $root=HTML::TreeBuilder->new(); $root->parse($doc); $root->eof; foreach my $n ($root->descendants) { next unless $n->tag; # skip text nodes $n->delete if $n->tag eq 'script'; } print $root->dump; # prints structure print $root->as_HTML # prints as HTML
-- #!/usr/bin/perl for(ref bless{},just'another'perl'hacker){s-:+-$"-g&&print$_.$/}

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://224247]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others wandering the Monastery: (8)
As of 2024-03-28 12:35 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found