Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re^12: Looking for a module that strips an HTML tag and its associated 'TEXT'

by nysus (Parson)
on Jul 29, 2020 at 15:42 UTC ( #11119999=note: print w/replies, xml ) Need Help??


in reply to Re^11: Looking for a module that strips an HTML tag and its associated 'TEXT'
in thread Looking for a module that strips an HTML tag and its associated 'TEXT'

Monstrous meaning "large and unwieldy". Yes, Mojo::DOM is well documented and no doubt a high-quality module, but there are so many method calls, it's difficult to sort through them. This slows you down.

I don't know if I said this was "something simple" but it is a problem that seems to common enough to have been solved my another module, especially in Perl that grew its legs during the dawn of the www. Pulling up a web page in a browser is an extraordinarily difficult task when you look under the hood but it should be simple to perform. I would have expected a basic task like this to be readily available in an off-the-shelf module. Indeed, there are existing modules but they were either a) buggy or b) didn't meet my specific needs.

$PM = "Perl Monk's";
$MCF = "Most Clueless Friar Abbot Bishop Pontiff Deacon Curate Priest Vicar";
$nysus = $PM . ' ' . $MCF;
Click here if you love Perl Monks

  • Comment on Re^12: Looking for a module that strips an HTML tag and its associated 'TEXT'

Replies are listed 'Best First'.
Re^13: Looking for a module that strips an HTML tag and its associated 'TEXT'
by marto (Cardinal) on Jul 29, 2020 at 16:02 UTC

    "Something simple" was a direct quote, but I think we are perhaps talking at cross purposes. I now believe (and correct me if I'm wrong) you mean that this is something seemingly so fundamental that it should exist as something anyone could call, without having to write the code you quickly put together to achieve this. Opposed to my reading that the task itself was simple. When I first encountered Mojo::DOM I recall just reading it from start to finish, and it really didn't take that long, and I don't consider myself a fast reader by any means. From time to time I find myself looking it up either to refresh my memory or to link to something for someone else.

    Yes there are many methods, that's part of the power of this and I'd say I've taken advantage of most, if not all of them. Sure at first I accept, "this slows you down", but maintain that this shouldn't be seen as a negative thing. After my short pointer, and your 20 minutes you now have the code you need, a basic understanding of how Mojo::DOM works and what it can do, and are considering publishing something to make this easier on yourself and others in future. Seems like a win-win to me. Perhaps sounding out the Mojo team as to weather you could simply provide a PR with a new method for Mojo::DOM to provide your desired functionality would be welcomed.

      Correct, that is what I meant by "something simple." I thought I would be able to find an off-the-shelf and that I would have it implemented in about 2 min. I am surprised that it turned into a much longer, windier road.

      A couple years back I got very familiar with something similar to Mojo::DOM, I think it was https://metacpan.org/pod/HTML::Element. Very useful. But I've forgotten almost all the fine details. This is probably another part of my reasoning comprehensive modules scare me off. But that said, I easily understand how to use Mojo::DOM as a result of learning HTML::Element. They are conceptually very similar.

      At any rate, thanks for your help. Appreciate it.

      $PM = "Perl Monk's";
      $MCF = "Most Clueless Friar Abbot Bishop Pontiff Deacon Curate Priest Vicar";
      $nysus = $PM . ' ' . $MCF;
      Click here if you love Perl Monks

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11119999]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chanting in the Monastery: (4)
As of 2021-10-21 20:01 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    My first memorable Perl project was:







    Results (83 votes). Check out past polls.

    Notices?