by wizbancp (Sexton)
on Feb 13, 2007 at 15:39 UTC
Category: Web Stuff
Author/Contact Info wizbancp
Description: A script for exploring site and catch link simply specify the starting url and the searching depth (sorry for my english!:-)) at the end the script produce a text files with the address catched.
After the critics(:-)) i modified the script to catch only link address & don't also email.... =:-( usage: " url depth" or simply ""
#!/usr/bin/perl -w

require LWP::UserAgent;

open LINK,  ">", "link.txt";

if (!@ARGV)
    print "Insert starting URL: ";
    print "\nInsert searching depth: ";
    $indirizzo = $ARGV[0];
    $profond = $ARGV[1];

my @elencolink = $indirizzohttp;

my $ua = LWP::UserAgent->new; 

sub pausa #pausing the script before ending
   print "\nPress Enter to exit.\n";
   my $pausa = <STDIN>;

sub catturalink #procedure for url capture 
   my $codice = shift;
   my $cont = 0;
   while ($codice =~m/(http|https):\/\/[\w\-_]+(\.[\w\-_]+)+([\w\-\.,@
       print LINK "$indirizzolink\n";
       push @elencolink, $indirizzolink;
   print "Find $cont links\n";

sub visitapagina #capture the site code
    my $pagina = shift;
    my $response = $ua->get("$pagina");
    if ($response->is_success)
        $codicehtml = $response->content;
        print "\n -- $pagina --\n";
        print $response->status_line."\n";

my $inizio=0;
my $fine=0;

    $fine = scalar(@elencolink)-1;
    for($c=$inizio; $c<=$fine; $c++)
        print "\n$inizio  $c  $fine";

print"\n Operation ended! \n";

close LINK;
Re: Link & Email Hunter
by merlyn (Sage) on Feb 13, 2007 at 16:33 UTC
Re: Link & Email Hunter
by blue_cowdawg (Monsignor) on Feb 13, 2007 at 16:19 UTC

    The existance of just this sort of script is why I generally council clients of mine not to put email addresses on their websites directly, with the exception of "catchall" email accounts like and such.

    Email harvesting is quite frequently the tool of spammers.

    Peter L. Berghold -- Unix Professional
    Peter -at- Berghold -dot- Net; AOL IM redcowdawg Yahoo IM: blue_cowdawg
Re: Link Hunter
by wizbancp (Sexton) on Feb 14, 2007 at 08:23 UTC
    I modified the code ...:-)
    Feel the Dark Power of Regular Expressions...

