http://qs321.pair.com?node_id=898183


in reply to Re: Spliting a delimited string into variables
in thread Spliting a delimited string into variables

I've got to second the vote for HTML::Parser or similar parsing engines.

A long time ago, before RSS feeds, I wrote a program to parse various newspaper websites and did the regexes by hand. I had 24 different rules for 90+ papers. When I rewrote it, I got it down to 9 rules, mainly based on web page design, since I used a parsing engine.

You're going to save yourself a ton of work since if the data changes you're going to have to rewrite your regexes each time.


To disagree, one doesn't have to be disagreeable - Barry Goldwater

  • Comment on Re^2: Spliting a delimited string into variables