Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Re: Best module to scrape tabular data fram web pages?

by mojotoad (Monsignor)
on Mar 10, 2006 at 17:25 UTC ( #535758=note: print w/replies, xml ) Need Help??


in reply to Best module to scrape tabular data fram web pages?

For your future projects, do consider HTML::TableExtract.

use HTML::TableExtract; my $te = HTML::TableExtract->new; $te->parse(join('', <>)); foreach my $row ($te->first_table_found->rows) { print join(':', @$row), "\n"; }

In reality, given the entire HTML document, you'd probably need to specify a depth/count in the constructor for H::TE.

Cheers,
Matt

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://535758]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (6)
As of 2020-07-06 13:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?