http://qs321.pair.com?node_id=1229934


in reply to Useful heuristics for analyzing arrays of data to determine column header

This is a very interesting endeavour! Here are my two cents:

  • Comment on Re: Useful heuristics for analyzing arrays of data to determine column header
  • Download Code

Replies are listed 'Best First'.
Re^2: Useful heuristics for analyzing arrays of data to determine column header
by Laurent_R (Canon) on Feb 15, 2019 at 09:40 UTC
    Hi hdb,

    these are very interesting ideas, but I'm not really convinced by this one:

    If the first row has a number, it is not likely to be a header.
    The header could consist in years, month numbers, quarters, test IDs, etc., all appearing to be numerical.

    @ nysus: in general, a very strong principle is "know your data." Dou you know anything about the data you're going to deal with, or is this just a general purpose tool where you can't know in advance anything about the type of your data?

      Most of the data I'm dealing with will be related to people's contact info. But I'm also interested in trying to write a general purpose tool that can be used by others just for the challenge.

      $PM = "Perl Monk's";
      $MCF = "Most Clueless Friar Abbot Bishop Pontiff Deacon Curate Priest Vicar";
      $nysus = $PM . ' ' . $MCF;
      Click here if you love Perl Monks