Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re: question about lookaheads and threatexpert/html parsing

by Anonymous Monk
on Mar 23, 2016 at 21:50 UTC ( #1158658=note: print w/replies, xml ) Need Help??


in reply to question about lookaheads and threatexpert/html parsing

$ cat junk.html <ul><li>The following Host Names were requested from a host database:< +/li> <ul> <li>192.5.5.241</li> . . . </ul></ul> $ cat jonk.xsh open --format html "junk.html"; # ls --indent /; for //ul { pwd; for ./li { pwd; print text(); }; echo; }; echo; $ xsh -q jonk.xsh /html/body/ul /html/body/ul/li The following Host Names were requested from a host database: /html/body/ul/ul /html/body/ul/ul/li 192.5.5.241

See also xpather.pl/htmltreexpather.pl which can give you paths to start with, and all the links here Re: Retrieve select information from HTML, they're examples(for tree-xpath and others)/walkthroughs/tutorials ... XML::XSH2/https://metacpan.org/pod/distribution/XML-XSH2/XSH2.pod#open,

Replies are listed 'Best First'.
Re^2: question about lookaheads and threatexpert/html parsing
by Anonymous Monk on Mar 23, 2016 at 22:07 UTC
    Thanks! I will look at this tomorrow when I get back to work!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1158658]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others studying the Monastery: (4)
As of 2022-08-10 22:45 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?