Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl-Sensitive Sunglasses
 
PerlMonks  

Re: Parsing Cell Contents of Extracted HTML Tables

by epoptai (Curate)
on Jun 29, 2001 at 23:43 UTC ( [id://92785]=note: print w/replies, xml ) Need Help??


in reply to Parsing Cell Contents of Extracted HTML Tables

I can't solve your problem but can tell you that the 'decode' method only toggles the use of HTML::Entities. Look into the 'br_translate' method which translates <br> to \n to eliminate the strange concatenation.

Perhaps you could use the information extracted from the table to reparse the file for links and such.

--
Check out my Perlmonks Related Scripts like framechat, reputer, and xNN.

  • Comment on Re: Parsing Cell Contents of Extracted HTML Tables

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://92785]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others rifling through the Monastery: (2)
As of 2024-04-25 22:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found