Just another Perl shrine | |
PerlMonks |
comment on |
( [id://3333]=superdoc: print w/replies, xml ) | Need Help?? |
I've got to second the vote for HTML::Parser or similar parsing engines.
A long time ago, before RSS feeds, I wrote a program to parse various newspaper websites and did the regexes by hand. I had 24 different rules for 90+ papers. When I rewrote it, I got it down to 9 rules, mainly based on web page design, since I used a parsing engine. You're going to save yourself a ton of work since if the data changes you're going to have to rewrite your regexes each time.
To disagree, one doesn't have to be disagreeable - Barry Goldwater
In reply to Re^2: Spliting a delimited string into variables
by Popcorn Dave
|
|