In my example, output is in the form of a perl hash(ref) structure.
This lets you load it easily in another program using
do.
my %seen=();
while(<>)
{
chomp;
foreach my $word ( grep /\w/, split )
{
$word =~ s/[. ,]*$//; # strip off punctuation, etc.
$seen{$word}++;
}
}
use Data::Dumper;
$Data::Dumper::Terse = 1;
print Dumper \%seen;
For example, given an input file containing:
Click on a letter above to see phrasal verbs beginning with that lette
+r.
You will get a list of phrasal verbs and their definitions. If you the
+n
click on an individual verb, you will get more information on it,
including example sentences, whether it is British or American English
+,
and whether it is separable or not.
Output looks like:
{
'you' => 2,
'a' => 2,
'not' => 1,
'that' => 1,
'sentences' => 1,
'individual' => 1,
'see' => 1,
'on' => 3,
'American' => 1,
'or' => 2,
'verb' => 1,
'Click' => 1,
'list' => 1,
'English' => 1,
'letter' => 2,
'their' => 1,
'whether' => 2,
'with' => 1,
'and' => 2,
'verbs' => 2,
'of' => 1,
'is' => 2,
'definitions' => 1,
'to' => 1,
'above' => 1,
'will' => 2,
'If' => 1,
'get' => 2,
'including' => 1,
'beginning' => 1,
'it' => 3,
'example' => 1,
'information' => 1,
'separable' => 1,
'British' => 1,
'click' => 1,
'phrasal' => 2,
'then' => 1,
'You' => 1,
'more' => 1,
'an' => 1
}