When you have a reasonable idea of what to expect a linear regexp can often be much simpler than a complex nested recursive processing module.
Unfortunately I've been working with a bunch of buffoons lately who couldn't even perform a simple slurp without importing some module from CPAN. And now we have a bazillion CPAN modules that have to be custom compiled into packages for distribution onto our end systems when some simple code could have alleviated much extra effort for sys admins.
Blindly relying on CPAN to solve all problems is sheer stupidity. Better to understand the problem first. Why not ask the author of the post whether their divs could ever be nested (e.g. is their source random pages from the internet) etc?