Clear questions and runnable code get the best and fastest answer |
|
PerlMonks |
Re: HTTP Scriptingby ajt (Prior) |
on Nov 29, 2002 at 12:30 UTC ( [id://216514]=note: print w/replies, xml ) | Need Help?? |
Perl is good at this kind of thing. Perl has modules to connect to web servers (LWP), work with the cookies and passwords, and parse HTML (HTML). Perl has several HTML/XML parsers, some are general purpose parsers, and some are dedicated, e.g. link extractors, header parsers. You could argue that your choice is so wide that it becomes daunting! I would suggest the following books: Perl and LWP which is all about connecting to, collecting from, and parsing of web data. I would also suggest Data Munging with Perl, it's a little older and more generic (it's for more than just web automation), but it's a fine book and has good examples of web data mining. Web Client Programming with Perl is old and out of print, but it's freely available as an OpenBook from O'Reilly, and quite useful. I would also check out merlyn's columns as I think there are some good examples in there with good descriptions. There may also be something in Perl.com's article archive. -- ajt
In Section
Seekers of Perl Wisdom
|
|