note
aitap
<p>To read [http://dict.ruslang.ru/Freq2011.zip|freqrnc2011.csv] into a Perl data structure, one could use [mod://Text::CSV_XS], which is smart enough to auto-decode UTF-8 bytes into Perl wide characters by default:
<c>
use Text::CSV_XS 'csv';
my @words = map { $_->{Lemma} } @{
csv in => "freqrnc2011.csv", headers => "auto", sep_char => "\t"
};
</c>
Make sure to set an [doc://Encode#Encoding-via-PerlIO|:encoding(...) PerlIO layer] on your <c>STDOUT</c> when you work with (and <c>print</c>) wide characters.</p>
<p>I'll have to admit that wasn't able so far to read the <c>.var</c> files (which seem to contain the actual words mixed with binary data when read as <c>CP-866</c>) from the [https://starling.rinet.ru/downl.php?lan=en|latter source] without the use of Starling for DOS from the same website. We may have to contact the original author's son about the file format if you are interested in dictionaries from there.</p>
11125044
11125208