How to get raw node content

Replies are listed 'Best First'.
Re: How to get raw node content by LanX (Saint) on Sep 15, 2017 at 18:30 UTC
TL;dr, but isn't the the xml-link on top of each post already sufficient? e.g. http://perlmonks.org/?node_id=1199471;displaytype=xml How much "rawer" do you need it? :) Cheers Rolf _{(addicted to the Perl Programming Language and ☆☆☆☆ :) Je suis Charlie!} PS: When your only tool is a thumb, you wished you could hold a hammer. ;-).	[reply]
Re^2: How to get raw node content by roboticus (Chancellor) on Sep 15, 2017 at 20:56 UTC
LanX: Thanks, I just gave that a try, and it worked nicely for a non-reaped node. That'll give me what I'm looking for, 95% of the time. However, I can't seem to make it work for a reaped node. I tried: `http://perlmonks.org/?node_id=1199455;displaytype=xml` But in return I got: <node id="1199455" title="Reaped: Re: .pl to .exe" created="2017-09-15 + 09:28:14" updated="2017-09-15 09:28:14"> <type id="11">note</type> <author id="52855">NodeReaper</author> <data> <field name="doctext"> This node was taken out by the [NodeReaper] on [localtime://2017-09-15 + 16-15-40]<BR>Reason: [[hippo]]: Unformatted, without context and + apparently off-topic<p>You may view [href://?op=viewreaped;node_id=1 +199455\|the original node and the consideration vote tally].</p> </field> <field name="root_node">288628</field> <field name="parent_node">288628</field> </data> </node> [download] So I tried changing it to: `http://perlmonks.org/?op=viewreaped;node_id=1199455;displaytype=xml` And got: `<node id="53641" title="Visit Reaped Nodes" created="2001-01-23 00:08: +15" updated="2005-08-22 15:36:03"> <type id="14">superdoc</type> <author id="937169">InGoodGraces</author> </node>` [download] Edit: on re-reading your reply, I looked for and found the XML link you mentioned. I hadn't noticed it. Unfortunately, it also fails on a reaped node. ...roboticus When your only tool is a hammer, all problems look like your thumb.	[reply] [d/l] [select]
Re^3: How to get raw node content by LanX (Saint) on Sep 15, 2017 at 23:43 UTC
> However, I can't seem to make it work for a reaped node. I think this is intentionally so (or simply nobody cared to provide this xml interface here) Reaped nodes are meant to become invisible to discourage spammers. And I'm not sure if really all input text is passed thru, since this might lead to a vulnerability for the viewer. Anyway I once was capable to download all reaped nodes in order to train a spam filter, but I can't recall how I did it. Probably I only got it in the HTML form. Cheers Rolf _{(addicted to the Perl Programming Language and ☆☆☆☆ :) Je suis Charlie!} EDIT Actually I see an xml-displaytype link in the original text of reaped nodes, but only in my role as pm-dev. And it doesn't show the raw text but the perl-code producing this node. In other words this "show original and vote tally" node with `op=viewreaped;` belongs a totally different class of nodes.	[reply] [d/l]
Re^4: How to get raw node content by roboticus (Chancellor) on Sep 16, 2017 at 01:01 UTC
Re^3: How to get raw node content by Anonymous Monk on Sep 15, 2017 at 21:31 UTC
you need logged in (and correct link) or you can use corion's backup	[reply]
Re: How to get raw node content by huck (Prior) on Sep 15, 2017 at 19:53 UTC
When i want to do that i just use the "view page source" option in my browser. If that is still too much you may want to try adding displaytype=print ie http://www.perlmonks.org/?node_id=1199455&displaytype=print and then "view page source"	[reply]
Re^2: How to get raw node content (whitespace and nodelet hack) by LanX (Saint) on Sep 16, 2017 at 15:06 UTC
Actually your approach is better to see whitespaces and linebreaks directly inside the browser, since the XML view in FF doesn't show it. FWIW here a nodelet hack which opens a JS.alert() with the HTML of the post you are trying to reply to. `<script><!-- function show_quote() { alert(document.querySelector("div.preview").innerHTML.match(/^[^]*?(?= +<hr> <div class="editnodetext">)/)[0]); } --></script> <a href='javascript:show_quote()'> show_quote</a>` [download] (you need to be in a "comment on" node to make it work) it was part of my plans to extend my wiki-syntax with comfortable quoting of a users post... ... of course milking the XML-displaytype would be more reasonable here. Cheers Rolf _{(addicted to the Perl Programming Language and ☆☆☆☆ :) Je suis Charlie!}	[reply] [d/l]
Re^2: How to get raw node content by LanX (Saint) on Sep 15, 2017 at 20:44 UTC
Sorry for nitpicking ... ... it's very close but that's not the raw input of the poster. For instance `[links]` are expanded and code tags have an extra download link. Cheers Rolf _{(addicted to the Perl Programming Language and ☆☆☆☆ :) Je suis Charlie!}	[reply] [d/l]
Re^2: How to get raw node content by roboticus (Chancellor) on Sep 15, 2017 at 21:02 UTC
huck: Thanks, but that still has the HTML entities encoding and bracket ([ ]) munging. ...roboticus When your only tool is a hammer, all problems look like your thumb.	[reply]
Re^3: How to get raw node content by karlgoethebier (Abbot) on Sep 16, 2017 at 14:38 UTC
How raw can we get? `lynx -dump http://perlmonks.org > dump.txt` Lynx? Best regards, Karl �The Crux of the Biscuit is the Apostrophe� `perl -MCrypt::CBC -E 'say Crypt::CBC->new(-key=>'kgb',-cipher=>"Blowfish")->decrypt_hex($ENV{KARL});'`Help	[reply] [d/l] [select]
Re^4: How to get raw node content by LanX (Saint) on Sep 16, 2017 at 14:43 UTC
Re^5: How to get raw node content by karlgoethebier (Abbot) on Sep 16, 2017 at 14:47 UTC


Perl-Sensitive Sunglasses
	PerlMonks

How to get raw node content

EDIT