Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^3: In-browser mech-like thing?

by aquarium (Curate)
on Oct 31, 2010 at 22:11 UTC ( [id://868643]=note: print w/replies, xml ) Need Help??


in reply to Re^2: In-browser mech-like thing?
in thread In-browser mech-like thing?

and furthermore, since the entire session (not just the login) is likely to be https, you won't be able to scrape the gibberish. you can automate pressing buttons etc, but the https info sent from the server will not be intelligable, afaik.
the hardest line to type correctly is: stty erase ^H

Replies are listed 'Best First'.
Re^4: In-browser mech-like thing?
by dgaramond2 (Monk) on Nov 03, 2010 at 03:14 UTC
    If we implement the scraper as a browser plugin/addon, the browser will provide the HTTPS content (and even the DOM) for us. IIRC, Chrome permits an addon/extension to insert some script to any page and do cross-domain AJAX request (after the user allows it).

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://868643]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others learning in the Monastery: (5)
As of 2024-04-19 12:35 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found