Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??
Wow... to all who responded and realized that I did not in fact describe the kind of data that I wanted to extract, I whole-heartedly apologize. Here it is: I have found an online database that lists every kind of construction contractor in the US of A. I have gone through that site, and took a copy of all of the contractors within my desired scope, and want to create an email, and snail mail list to contact all of them for the sole purpose of marketing a package that I've put together. At first, I tried Access, yes, by mrgates. However, one day I got to chatting with a friend, and he told me that Perl is so.... much better at this kind of thing than anything that MS ever dreamed up. So, I wish to extract all of the contact data from this list of webpages, and then order them according to email, and snail mail contacts. Everything...! If the contacts are limited to websites, then, if indeed possible, I wish to go into that site, and grab an email contact off of there, for building a mailing list. For those of you who are anti-spammers, I am not into spamming. I only wish to create a list that I can use to market my package for one of the construction trades. Yes, I had considered buying one of those mailing list deals from a big-time marketer, but if I can learn to do it myself, then forget them. I don't mind paying for a data extraction program, but to buy a premade list, that may or may not in fact meet my need, too big a chance there... I hope that this is specific enough. Again, thank you for your time and responses.

In reply to Re: how do I extract contact data from websites? by Meisamhe
in thread how do I extract contact data from websites? by Meisamhe

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others admiring the Monastery: (6)
As of 2024-04-25 11:50 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found