Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

Re: Fetching Web Page and covert to text

by apl (Monsignor)
on Jul 16, 2009 at 14:04 UTC ( [id://780714]=note: print w/replies, xml ) Need Help??


in reply to Fetching Web Page and covert to text

Test all error returns (get, parse, open, etc.), die when they fail, and display the error-code returned.

Replies are listed 'Best First'.
Re^2: Fetching Web Page and covert to text
by Ankit.11nov (Acolyte) on Jul 17, 2009 at 05:33 UTC
    I have modified my code based on the inputs given.
    use strict; use warnings; use LWP::Simple; use HTML::TreeBuilder; use HTML::FormatText; print "Opening the URL"; my $URL=get("http://www.yahoo.com/") || die "Couldn't fetch page"; my $Format=HTML::FormatText->new; my $TreeBuilder=HTML::TreeBuilder->new; $TreeBuilder->parse($URL); my $Parsed=$Format->format($TreeBuilder); open FILE_OUT , '>D:\Profiles\in2228c\Desktop\info.txt' || die "No suc +h file"; print FILE_OUT "$Parsed"; close FILE_OUT; exit;

    But still I am getting the same error:
    Couldn't fetch page at 780696.pl line 9.
    Is there any other way of solving this problem?
      The write-up on LWP::Simple::get says:
      You will not be able to examine the response code or response headers (like 'Content-Type') when you are accessing the web using this function. If you need that information you should use the full OO interface (see LWP::UserAgent).
      So if I was you, I'd follow that suggestion.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://780714]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others learning in the Monastery: (2)
As of 2024-04-26 05:21 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found