Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: youtube parser/scrabber

by Corion (Patriarch)
on Aug 18, 2021 at 11:35 UTC ( [id://11135922]=note: print w/replies, xml ) Need Help??


in reply to youtube parser/scrabber

I would look at what requests youtube-dl sends and replicate that. youtube-dl itself has an option to output all the data it scrapes as JSON, but if you don't want it as external dependency, that's out.

jwz maintains youtubedown, which you can look at to find what/how it scrapes the information. At one time I looked at converting that to a module to make it accessible from other programs, but that wasn't as easy as I thought either.

Replies are listed 'Best First'.
Re^2: youtube parser/scrabber
by igoryonya (Pilgrim) on Aug 18, 2021 at 13:08 UTC
    Big one - 145KB without the comments!
    And it supports more, then just the youtube.
    A lot of work was put into it!

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11135922]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others cooling their heels in the Monastery: (3)
As of 2024-04-26 02:24 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found