Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  

Re: Remove HTML tags from document

by daeve (Deacon)
on Aug 04, 2003 at 03:52 UTC ( [id://280553]=note: print w/replies, xml ) Need Help??


in reply to Remove HTML tags from document

And in the spirit of TIMTOWTDI...

If you just need to strip all the html tags from a page, and are on a platform with lynx, you can use:

#! /usr/bin/perl use strict; use warnings; my $text=`lynx -dump htmlDocument.html`; print "$text";

HTH
Daeve

Replies are listed 'Best First'.
Re: Re: Remove HTML tags from document
by matth (Monk) on Aug 04, 2003 at 08:36 UTC
    How can I get this to print out to a file instead of the STDOUT? I have very large HTML files.
      perldoc -f open perldoc -f print perldoc perlopentut

      Abigail

      A reply falls below the community's threshold of quality. You may see it by logging in.
A reply falls below the community's threshold of quality. You may see it by logging in.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://280553]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others scrutinizing the Monastery: (2)
As of 2024-04-19 01:01 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found