![]() |
|
Problems? Is your data what you think it is? | |
PerlMonks |
Re: Solving possible missing linksby Corion (Pope) |
on Jul 16, 2019 at 18:33 UTC ( #11102938=note: print w/replies, xml ) | Need Help?? |
Going through the links pointing to external sites is also interesting to scan for spam links, hence this is indeed desireable. I currently lack the time to do it myself, and database access is somewhat scarce, but the relevant DB schema is (roughly):
And Real, Working SQL to query these tables is (also at Replies with outbound links, but that's for gods only to access):
This SQL should be refined to also catch https:// links, and then some Perl code needs to be written to verify that the text is an actual link. Test cases for text with links would be for example:
Negative test cases would be:
Ideally, we will be able to refine this code later to highlight outbound links that are not on the whitelist of Perlmonks links.
In Section
Perl Monks Discussion
|
|