http://qs321.pair.com?node_id=609789


in reply to Re: Parsing a Bibliography 2007
in thread Parsing a Bibliography 2007

I agree with this approach. Assuming there is no way to definitively lock down all possible distinct formats, you'll end up iteratively building regexen and parsing your citations until 'enough' of them are recognized and the remaining unparsed subset is sufficiently small. Do you anticipate this as a one-time effort, or will you be regularly handling new citation forms?