Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

For the bias, something you might consider is this:

  • Sort files by newest to oldest
  • Generate a random number between 0 and 1
  • Invert that number against a bounded cumulative probability distribution function
  • Scale the inverse to the length of your list
  • Pick a file using the scaled inverse as the index

If you pick a distribution that is weighted towards 0, you'll wind up picking newer files. Note: this isn't technically weighting by time -- it's biasing towards certain array slots, irrespective of whether those slots are close in access time or far apart. However, that may be sufficient for your particular application.

A good distribution for this may be the Kumaraswamy, which is bounded between 0 and 1 and has a closed form that is easy to invert. By changing the two input parameters, you'll get different shapes, including ones that bias towards 0. (You'll have to try graphing some PDF's and see what you like.)

Here's an example of how it could be used to bias in the way I described:

use strict; use warnings; my $param_a = 1.5; my $param_b = 6; my @array = ( 1 .. 100 ); sub invK { my ($F, $Ka, $Kb) = @_; return ( 1 - ( 1 - $F )**( 1 / $Kb ) )**( 1 / $Ka ); } for ( 1 .. 20 ) { my $pick = int( invK( rand(), $param_a, $param_b ) * @array ); print "$pick\n"; }

A test run gave this: 7 11 12 26 18 27 10 30 6 3 28 2 35 7 29 40 26 15 3 44

-xdg

Code written by xdg and posted on PerlMonks is public domain. It is provided as is with no warranties, express or implied, of any kind. Posted code may not have been tested. Use of posted code is at your own risk.


In reply to Re: Opening random files (with bias) based on File::Stat information. by xdg
in thread Opening random files (with bias) based on File::Stat information. by Cap'n Steve

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others admiring the Monastery: (5)
As of 2024-04-19 14:42 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found