Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re^2: How to get raw node content

by roboticus (Chancellor)
on Sep 15, 2017 at 20:56 UTC ( [id://1199480]=note: print w/replies, xml ) Need Help??


in reply to Re: How to get raw node content
in thread How to get raw node content

LanX:

Thanks, I just gave that a try, and it worked nicely for a non-reaped node. That'll give me what I'm looking for, 95% of the time.

However, I can't seem to make it work for a reaped node. I tried:

http://perlmonks.org/?node_id=1199455;displaytype=xml

But in return I got:

<node id="1199455" title="Reaped: Re: .pl to .exe" created="2017-09-15 + 09:28:14" updated="2017-09-15 09:28:14"> <type id="11">note</type> <author id="52855">NodeReaper</author> <data> <field name="doctext"> This node was taken out by the [NodeReaper] on [localtime://2017-09-15 + 16-15-40]<BR>Reason: &#91;[hippo]]: Unformatted, without context and + apparently off-topic<p>You may view [href://?op=viewreaped;node_id=1 +199455|the original node and the consideration vote tally].</p> </field> <field name="root_node">288628</field> <field name="parent_node">288628</field> </data> </node>

So I tried changing it to:

http://perlmonks.org/?op=viewreaped;node_id=1199455;displaytype=xml

And got:

<node id="53641" title="Visit Reaped Nodes" created="2001-01-23 00:08: +15" updated="2005-08-22 15:36:03"> <type id="14">superdoc</type> <author id="937169">InGoodGraces</author> </node>

Edit: on re-reading your reply, I looked for and found the XML link you mentioned. I hadn't noticed it. Unfortunately, it also fails on a reaped node.

...roboticus

When your only tool is a hammer, all problems look like your thumb.

Replies are listed 'Best First'.
Re^3: How to get raw node content
by LanX (Saint) on Sep 15, 2017 at 23:43 UTC
    > However, I can't seem to make it work for a reaped node.

    I think this is intentionally so (or simply nobody cared to provide this xml interface here)

    Reaped nodes are meant to become invisible to discourage spammers.

    And I'm not sure if really all input text is passed thru, since this might lead to a vulnerability for the viewer.

    Anyway I once was capable to download all reaped nodes in order to train a spam filter, but I can't recall how I did it.

    Probably I only got it in the HTML form.

    Cheers Rolf
    (addicted to the Perl Programming Language and ☆☆☆☆ :)
    Je suis Charlie!

    EDIT

    Actually I see an xml-displaytype link in the original text of reaped nodes, but only in my role as pm-dev.

    And it doesn't show the raw text but the perl-code producing this node.

    In other words this "show original and vote tally" node with op=viewreaped; belongs a totally different class of nodes.

      LanX:

      Thanks for your help and follow-up, I appreciate it.

      ...roboticus

      When your only tool is a hammer, all problems look like your thumb.

Re^3: How to get raw node content
by Anonymous Monk on Sep 15, 2017 at 21:31 UTC

    you need logged in (and correct link)

    or you can use corion's backup

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1199480]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chilling in the Monastery: (4)
As of 2024-04-20 00:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found