Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling

Re: multi line regex

by matija (Priest)
on Jan 09, 2006 at 14:54 UTC ( #521954=note: print w/replies, xml ) Need Help??

in reply to multi line regex

This is wrong in so many ways. First of all, you're parsing HTML with a regex. Don't do that. Use HTML::Parser instead.

Otherwise, there are just too many ways in which you can be tripped - tags with extra white space, tags with newlines, quotes missing or present in unexpected places, escaping of this, that or the other thing, javascript code fooling you into thinking you're in another tag when you really aren't, etc.

Second, you're trying to extract data from an HTML table using regex. Don't do that. Use HTML::TableExtract instead. It will save you a LOT of hairpulling.

Replies are listed 'Best First'.
Re^2: multi line regex
by metalfan (Novice) on Jan 18, 2006 at 17:51 UTC
    looks good, sorry for this question: but how can i use this to
    do geht the word in the first column?

    1.column | 2.column
    english word | german word

    thx for help
      Read the manual pages for HTML::TableExtract - once it parses the table, the first column will be the first element of the row array.

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://521954]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others lurking in the Monastery: (4)
As of 2022-01-27 21:33 GMT
Find Nodes?
    Voting Booth?
    In 2022, my preferred method to securely store passwords is:

    Results (71 votes). Check out past polls.