Re: REGEX for url


P is for Practical
	PerlMonks

Re: REGEX for url

by graff (Chancellor)

on Apr 25, 2016 at 21:44 UTC ( [id://1161484]=note: print w/replies, xml )

Need Help??

in reply to REGEX for url

It looks like you're just trying to extract values of href= attributes from anchor tags (i.e. the "..." from <a href="...">) in html data.

I'm surprised that no one yet has mentioned that there are CPAN modules for doing exactly that - e.g. HTML::LinkExtor, among others. (I haven't had occasion to use them myself. but to do what you're doing, I'd start with one of those.)

Comment on Re: REGEX for url Select or Download Code

Replies are listed 'Best First'.
Re^2: REGEX for url by wrkrbeee (Scribe) on Apr 25, 2016 at 21:46 UTC
You are exactly right, extract data between anchor tags. I will try the CPAN module you mentioned. Thank you!!	[reply]
Re^3: REGEX for url by graff (Chancellor) on Apr 25, 2016 at 21:52 UTC
Having looked a little more at the CPAN search results, I find it odd that the man page for HTML::LinkExtor appears to be shorter and simpler than the one for HTML::SimpleLinkExtor -- I'm not sure what "Simple" is supposed to refer to in the latter module.	[reply]

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://1161484]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others surveying the Monastery: (2)

As of 2024-04-24 23:52 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found