http://qs321.pair.com?node_id=1202118


in reply to Re: Batch remove "404 Not Found" URLs
in thread Batch remove URLs

Thanks Ken!
Here's what I've been using:
find . -type f -name "*.htm" -print|xargs perl -i -pe 's/http:\/\/example\.com\/[404 Not Found]/g'

I'm afraid I haven't described what I'm trying to accomplish very well, sorry.
1.) I have a list of 300 URLs
2.) I have a folder on my desktop with 100 .htm pages
3.) I want to run that list against those 100 pages and remove URLs
4.) This will leave the <a href tags in place with the text [404 Not Found] (instead of the URL - for example, <a href="[404 Not Found]">[404 Not Found]</a>).

My plan then (since some of her links have descriptive text and others link text), was/is to render those dummy tags in the HTML inactive by doing another find/replace and leaving just <a>[404 Not Found]</a> to display 404 Not Found or the link's descriptive text in the browser.

Thanks again Ken - I'll check out the perlrun link