note
allolex
<p>
Hi Eric. You might consider looking into Andrei Mikheev's article on text segmentation in <em>Handbook of Computational Linguistics</em> and the chapter on parsing in the same book.</p>
<p>If you can give me some concrete examples of what you are looking to do, I might be able to scare up some info for you. I have to say that regular expressions are often not the best way to deal with linguistic data. Perl is also a bit slow for heavy parsing and segmenting -- especially if you use [cpan://Parse::RecDescent] ;) -- but it's definitely a good place to start.
</p>
<code>
@INBOOK{mikheev2002text,
chapter = {10},
pages = {201-218},
title = {Text Segmentation},
publisher = {Oxford University Press},
year = {2002},
editor = {Ruslan Mitkov},
author = {Andrei Mikheev},
address = {Oxford},
}
@BOOK{mitkov2002handbook,
title = {Handbook of Computational Linguistics},
publisher = {Oxford University Press},
year = {2002},
editor = {Ruslan Mitkov},
}
</code>
<div class="pmsig"><div class="pmsig-211693">
<p>
--<br />
Damon Allen Davison<br />
<a href="http://www.allolex.net">http://www.allolex.net</a>
</p>
</div></div>
399831
399831