Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  

Re^3: Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element

by haukex (Archbishop)
on Nov 15, 2021 at 12:50 UTC ( [id://11138833]=note: print w/replies, xml ) Need Help??


in reply to Re^2: Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element
in thread Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element

However, I am not sure how to apply a substitution, s///, to an element containing more that just text.

The documentation of HTML::Element's content_refs_list gives an example of how to modify text nodes contained in an element and the documentation of HTML::Element::traverse shows how to use a recursive function to walk the tree. Putting those together:

sub html_trim { my $elem = shift; for my $itemref ($elem->content_refs_list) { if ( ref $$itemref ) { html_trim($$itemref) } # remove this for non-recursive else { $$itemref =~ s/^\s+|\s+$//g } } } for my $elem ($xhtml->findnodes('//div/ul/li')) { html_trim($elem) }

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11138833]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others surveying the Monastery: (4)
As of 2024-04-16 15:49 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found