unsorted list

thekestrel has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: unsorted list by sgifford (Prior) on Apr 24, 2005 at 07:18 UTC
The technique you're looking for is called a shuffle. It can be done very quickly (linear time IIRC) using standard algorithms. You can try Algorithm::Numerical::Shuffle or List::Util's `shuffle` function. If your data is very large, consider sorting a list of numbers corresponding to array indices, then using those numbers as array subscripts. It will avoid making a copy of your data.	[reply] [d/l]
Re: unsorted list by eibwen (Friar) on Apr 24, 2005 at 07:29 UTC
There's always the Fisher-Yates shuffle in perlfaq4 or the pleac version: `sub fisher_yates_shuffle { my $array = shift; my $i; for ($i = @$array; --$i; ) { my $j = int rand ($i+1); next if $i == $j; @$array[$i,$j] = @$array[$j,$i]; } } fisher_yates_shuffle( \@array );` [download]	[reply] [d/l]
Re^2: unsorted list by thekestrel (Friar) on Apr 25, 2005 at 01:45 UTC
Thanks eibwen, This is the method that I ended up going with. Nice and clean and didn't require another module. Its working a treat =). Regards Paul.	[reply]
Re: unsorted list by rg0now (Chaplain) on Apr 24, 2005 at 10:23 UTC
And the obligatory note that Super Search is your best friend here, especially that recently this problem has come up quite frequently: The discussion ensued from a question: A bad shuffle, then it rolled on: Is this a fair shuffle?, ... and on: Functional shuffle, ... and on: Yet Another Fair Shuffle Node. rg0now	[reply]
Re^2: unsorted list by thekestrel (Friar) on Apr 24, 2005 at 17:03 UTC
rg0now, Thanks, its usually the first port of call... the keyword though is 'shuffle', I kept thinking 'randomly sorted list', I didn't realize shuffle was the standard term for this =P Regards Paul	[reply]
Re: unsorted list by dbwiz (Curate) on Apr 24, 2005 at 07:04 UTC
You can use rand within the sort code. `@stuff = qw[ cat dog pig cow ]; print "$_\n" for sort { rand() <=> rand() } @stuff` [download] Update. I have the (theoretical) feeling that this trick may cause an infinite loop when used with a large array, but a few tests I did returned nothing suspicious. HTH	[reply] [d/l]
Re^2: unsorted list by merlyn (Sage) on Apr 24, 2005 at 13:32 UTC
In older Perl versions, inconsistent results from the sort block could produce core dumps or duplicated values. I don't believe modern Perl ever does that, but it's still not a way to get a fair sort. -- Randal L. Schwartz, Perl hacker Be sure to read my standard disclaimer if this is a reply.	[reply]
Re^2: unsorted list by Roy Johnson (Monsignor) on Apr 24, 2005 at 12:55 UTC
No infinite loop, but Is this a fair shuffle? Caution: Contents may have been coded under pressure.	[reply]
Re: unsorted list by holli (Abbot) on Apr 24, 2005 at 06:26 UTC
I was thinking of having either a hash or array and picking a value from the list then removing that selection and repeating until the list was empty each iteration I think that is the exactly the way to go. To add some syntactic sugar (and to not repeat the logic for every loop) you could use a tied array class. holli, /regexed monk/	[reply] [d/l]
Re: unsorted list by TedPride (Priest) on Apr 24, 2005 at 07:39 UTC
EDIT: My bad. I mixed up `@$array[$i,$j]` with `@$array[$i..$j]` and misread your code. Yes, your code is functionally pretty much the same as mine, and yes, I did code a Fischer (sp?)-Yates shuffle. I'd -- myself, but it doesn't let me :) ------------------------------- The problem with that shuffle is it requires moving chunks of the array, quite inefficient with large arrays. A better linear shuffle is as follows: `use strict; use warnings; my ($n, $t); my @arr = (0,1,2,3,4,5,6,7,8,9); for (0..($#arr-1)) { $n = int (rand() * ($#arr - $_ + 1)) + $_; $t = $arr[$n]; $arr[$n] = $arr[$_]; $arr[$_] = $t; }` [download] Which is equivalent to: `For all items except last Pick random item between current item and last Swap that item with current item` [download] I have an expanded version (not included here) which tests this with a large number of iterations and then counts how many of each number ends up in each slot, for purposes of analyzing the randomness of the sort. The results were within a percent or two of perfectly random.	[reply] [d/l] [select]
Re^2: unsorted list by eibwen (Friar) on Apr 24, 2005 at 08:35 UTC
As far as I can tell, your code is a "recoding" of the Fisher-Yates shuffle: `sub fisher_yates_shuffle { my $array = shift; my $i; for ($i = @$array; --$i; ) { my $j = int rand ($i+1); next if $i == $j; @$array[$i,$j] = @$array[$j,$i]; } }` [download] And permuting your version: `for (0..($#arr-1)) { $n = int (rand() * ($#arr - $_ + 1)) + $_; $t = $arr[$n]; $arr[$n] = $arr[$_]; $arr[$_] = $t; }` [download] `for (0..($#arr-1)) { $n = int (rand() * ($#arr - $_ + 1)) + $_;` [download] `$t = $arr[$n]; $arr[$n] = $arr[$_]; $arr[$_] = $t;` [download] `}` [download] `for (0..($#arr-1)) { $n = int (rand() * ($#arr - $_ + 1)) + $_;` [download] `@$arr[$_,$n] = @$arr[$n,$_];` [download] `}` [download] `for (0..($#arr-1)) { $n = int (rand() * ($#arr - $_ + 1)) + $_;` [download] `# next if $_ == $n;` [download] `@$arr[$_,$n] = @$arr[$n,$_]; }` [download] The remaining two lines are functionally equivalent: `for (0..($#arr-1)) { # Your Version $n = int (rand() * ($#arr - $_ + 1)) + $_; for ($i = @$array; --$i; ) { # Fisher-Yates my $j = int rand ($i+1);` [download] Frankly, I prefer your formulation (with the addition of the `next if` line) as it doesn't use an incomplete `for` loop, but the codes are equivalent. UPDATE: For the benefit of the astute who realized that the remaining two lines weren't entirely equivalent: `for (0..($#arr-1)) { $n = int (` [download] `rand() * ($#arr - $_ + 1)` [download] `) + $_; # next if $_ == $n; @$arr[$_,$n] = @$arr[$n,$_]; }` [download] `for (0..($#arr-1)) { $n = int (` [download] `rand($#arr - $_ + 1)` [download] `) + $_; # next if $_ == $n; @$arr[$_,$n] = @$arr[$n,$_]; }` [download] `for (0..($#arr-1)) { $n = int (rand($#arr - $_ + 1))` [download] `+ $_` [download] `; # next if $_ == $n; @$arr[$_,$n] = @$arr[$n,$_]; }` [download] Fisher-Yates traverses down the array, whereas this version traverses up the array (due to the appended `+ $_`). Remove the addition and the codes are equivalent; leave it in and the codes are functionally equivalent.	[reply] [d/l] [select]
Re: unsorted list by cog (Parson) on Apr 24, 2005 at 12:48 UTC
While everybody seems to be pointing to shuffling the list, how about randomizing the next element you're getting? Something in the lines of (untested): `$stuff = [ 'cat', 'dog', 'pig', 'cow' ]; my %stuff map { $_ => 1 } @$stuff; my @elements = keys $stuff; for ($elements[int rand scalar @elements]) { $stuff{$_} = 0; @elements = grep $_, keys %stuff; # your code here, $_ being either a cat, a dog, etc. } for (keys %stuff) { $stuff{$_} = 1 }` [download] Do notice: I just got up! :-) If it doesn't work, that why :-) And I'm pretty sure there are better ways of doing it :-)	[reply] [d/l]
Re: unsorted list by kwaping (Priest) on Apr 25, 2005 at 21:59 UTC
This may not be the fanciest solution, but it seems to work for me: `my $number_of_elements = 5; # arbitrary, can be anything my @order = (); my @base = (0 .. $number_of_elements - 1); srand; push @order, splice(@base, rand @base, 1) while (@base);` [download]	[reply] [d/l]


go ahead... be a heretic
	PerlMonks