http://qs321.pair.com?node_id=1230795

Boberts has asked for the wisdom of the Perl Monks concerning the following question:

Hey I'm looking for a regex (at least I think what is most likely a regex) to help clean up some text

The text has to consist of (i) only consist of ASCII letters, ASCII digits, apostrophes, hyphens and underscores; (ii) start with a letter or start with an apostrophe followed by a letter, (iii) do not contain a sequence of two or more apostrophes or hyphens, and (iv) end with (a) a letter, (b) a digit, or (c) an apostrophe preceded by the letter s, should be considered to be words.

Any help is appreciated