This is PerlMonks "Mobile"

Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  


in reply to Re: How to parse not closed HTML tags that don't have any attributes?
in thread How to parse not closed HTML tags that don't have any attributes?

There are about 10,000 *.html files

As I hinted, make sure to get a representative sampling of all of this input for your test cases.

into a CSV file

Definitely use Text::CSV (also install Text::CSV_XS for speed).

  • Comment on Re^2: How to parse not closed HTML tags that don't have any attributes?