use Text::CSV_XS 'csv';
my @words = map { $_->{Lemma} } @{
csv in => "freqrnc2011.csv", headers => "auto", sep_char => "\t"
};
Make sure to set an :encoding(...) PerlIO layer on your STDOUT when you work with (and print) wide characters.
I'll have to admit that wasn't able so far to read the .var files (which seem to contain the actual words mixed with binary data when read as CP-866) from the latter source without the use of Starling for DOS from the same website. We may have to contact the original author's son about the file format if you are interested in dictionaries from there. |