I've tried to look at the raw nodes with dumper, and I cant make any sense of it. The document is very complex (see http://www.threatexpert.com/report.aspx?md5=2aafcad88572d98c154ab7d80cbafc02)
and as I mentioned, I have zero understanding of xpath. I looked at as_text, but the problem is, I just don't understand xpath format at all, to even attempt to scope my node elements to just that one section I mentioned. If I understood how the nodes were built, I think I could be ok, but to be honest, I just totally don't get this at all. When I do '//tr/td', I get _all_ of the td elements in one giant array, instead of just narrowing the damn thing to the one section I tried to match against in my post. :( | [reply] |
I recommend that you learn XPath.
There are also browser plugins that show you the XPath to a node if you click on its HTML element.
If XPath feels too complex for you to tackle but HTML / CSS selectors feel more accessible to you, you can easily convert most CSS selectors to XPath by using HTML::Selector::XPath.
| [reply] |