Re: XML parsing vs Regular expressions

by Cody Pendant (Prior)
on Feb 17, 2008 at 05:23 UTC

in reply to XML parsing vs Regular expressions

The reasons are the same as for HTML parsing really. Your regular expression will do what you want it to do, probably, and then you'll come to trust it and it will come back to bite you when you meet an unexpected case.

Does it:

  • ignore code which is commented out?
  • allow for attribute order changing?
  • cope with the characters < and > appearing inside attributes, or CDATA sections?
There are probably a hundred more things you'd have to think of to make your regular expression solution bullet-proof, by which time you might as well have written your own XML parser.

Nobody says perl looks like line-noise any more
kids today don't know what line-noise IS ...

