http://qs321.pair.com?node_id=1158664


in reply to question about lookaheads and threatexpert/html parsing

This is how you might do it with HTML::TreeBuilder::XPath:
use Data::Dumper; use HTML::TreeBuilder::XPath; my $html = q| <ul><li>The following Host Names were requested from a host database:< +/li> <ul> <li>192.5.5.241</li> <li>192.5.5.242</li> </ul></ul> |; my $tree = HTML::TreeBuilder::XPath->new; $tree->parse($html); $tree->eof; my @wanted; my @nodes = $tree->findnodes('//ul'); for my $node ( @nodes ) { my $text = $node->findvalue('li') or next; $text =~ m/^The following Host Name/ or next; @wanted = $node->findvalues('ul/li'); last; } print Dumper \@wanted;
Output:
$VAR1 = [ '192.5.5.241', '192.5.5.242' ];