Re: To Hash or to Array--Uniqueness is the question.


XP is just a number
	PerlMonks

Re: To Hash or to Array--Uniqueness is the question.

by Ryszard (Priest)

on Dec 02, 2005 at 08:28 UTC ( [id://513532]=note: print w/replies, xml )

Need Help??

in reply to To Hash or to Array--Uniqueness is the question.

Warning, untested code:

my %stathash;
while (<FH>) {
    $stathash{$_}++;
}
[download]

Has the extra advantage of counting the number of hits for each unique value.

You can then do some grooy stuff, like pulling out records which occur n times, records which appear in one set and not another (if you use two hashes, two datasets), or records which appear in both sets, (again,if you use two hashes, two datasets)

I regularly do this with sets of about 500k records to determine where my data integrity issues lie, its pretty damn fast.

Comment on Re: To Hash or to Array--Uniqueness is the question. Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://513532]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others avoiding work at the Monastery: (3)

As of 2024-04-19 21:59 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found