Re: Regexp riddles


Problems? Is your data what you think it is?
	PerlMonks

Re: Regexp riddles

by broquaint (Abbot)

on Jul 17, 2003 at 12:42 UTC ( [id://275200]=note: print w/replies, xml )

Need Help??

in reply to Regexp to extract HTML link data

Under the blind assumption that your data won't be changing too much or becomes 'faulty' (otherwise you'd be using a parser right?) then something like this ought do

my $re = qr{
  (?: <img \s+ .*? src=" ([^"]+) " .*? > )?
  <a \s+ .*? href=" ([^"]+) " .*? >
}x;

$in =
  '<td><img src="foo.jpg"><a href="index3.html">New index</a></td>';

my($href, $img) = grep defined, reverse $in =~ $re;

print "href - $href\nimg  - $img\n";

$in = '<td><a href="index3.html">New index</a></td>';

($href, $img) = grep defined, reverse $in =~ $re;

print "href - $href\nimg  - $img\n";

__output__

href - index3.html
img  - foo.jpg
href - index3.html
img  -
[download]

See. perlre for more info.
HTH

_________ broquaint