Comment a block that match a keyword

yorkwu has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 16, 2007 at 12:37 UTC
This is very dependant upon the correct and logical formatting of the input. It could probably be simplified through refactoring and needs a lot of testing, but the principles it uses might help: ##! perl -slw use strict; while( <DATA> ) { chomp; print, next unless m[TIMINGCHECK]; my $count = tr[(][(] - tr[)][)]; { s[^][//]; print; last unless defined( $_ = <DATA> ); chomp; $count += tr[(][(] - tr[)][)]; redo unless $count < 0; } print; } __DATA__ ... (CELL ... (TIMINGCHECK .... .... ) ) (CELL (CELLTYPE "SEDFQD1") (INSTANCE uTrigger/TrcInclCtrlReg_reg[13]) (DELAY (ABSOLUTE (IOPATH CP Q (0.10:0.15:0.25)(0.09:0.15:0.24)) ) ) (TIMINGCHECK (SETUP (posedge SI) (posedge CP) (0.14:0.23:0.41)) (SETUP (negedge SI) (posedge CP) (0.09:0.16:0.30)) ....(random lines) (HOLD (negedge SI) (posedge CP) (0.00:0.00:0.00)) (HOLD (negedge D) (posedge CP) (0.00:0.00:0.00)) ) ) [download] Output: `c:\test>junk6 ... (CELL ... // (TIMINGCHECK // .... // .... // ) ) (CELL (CELLTYPE "SEDFQD1") (INSTANCE uTrigger/TrcInclCtrlReg_reg[13]) (DELAY (ABSOLUTE (IOPATH CP Q (0.10:0.15:0.25)(0.09:0.15:0.24)) ) ) // (TIMINGCHECK // (SETUP (posedge SI) (posedge CP) (0.14:0.23:0.41)) // (SETUP (negedge SI) (posedge CP) (0.09:0.16:0.30)) // ....(random lines) // (HOLD (negedge SI) (posedge CP) (0.00:0.00:0.00)) // (HOLD (negedge D) (posedge CP) (0.00:0.00:0.00)) // ) )` [download] Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply] [d/l] [select]
Re^2: Comment a block that match a keyword (did) by tye (Sage) on Aug 17, 2007 at 02:51 UTC
Here is a simple demonstration that Perl does indeed have a "post-condition loop construct" (contrary to the claim nearby in this thread), and that it is useful. Here is a trivial refactoring of the above code to make use of that. I don't repeat the input nor output, but if you provide it with the former, then it will duplicate the latter. `#!/usr/bin/perl -lw use strict; while( <DATA> ) { chomp; if( ! m[TIMINGCHECK] ) { print; next; } my $count= tr[(][(] - tr[)][)]; do { s[^][//]; print; last # Leaves the while( <DATA> ) loop if ! defined( $_= <DATA> ); chomp; $count += tr[(][(] - tr[)][)]; } while( 0 <= $count ); print; }` [download] Note that the last jumps out of both "loops", which can be considered an improvement since the original code would warn about trying to print an undefined value (when given incomplete input). And, here is one way I might refactor this (in order to avoid repeating the code used to read input and the code to count parens): `#!/usr/bin/perl -lw use strict; my $count= -1; while( <DATA> ) { chomp; $count= 0 if m[TIMINGCHECK]; $count += tr[(][(] - tr[)][)] if 0 <= $count; s[^][//] if 0 <= $count; print; }` [download] Which also duplicates the above output, though it might not always agree on all inputs (it doesn't warn on incomplete input, certainly). - tye	[reply] [d/l] [select]
Re^2: Comment a block that match a keyword by FunkyMonk (Chancellor) on Aug 17, 2007 at 00:01 UTC
There's a couple of things That smell bad to me about this. They're personal things; I don't have any doubts that the code works. It's just things that trouble me: a block pretending to be a loop, and the use of redo and last therein. (this is definatley just me) chomp that isn't the first thing done in the "loop". I'd have put it first, even if any subsequent print (etc) had to include `\n`. I'd be happier with a do block, rather than a bare block. Perhaps it's just me :) Like I said, personal things. update: added stuff about do after further thought	[reply] [d/l] [select]
Re^3: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 17, 2007 at 01:45 UTC
I wasn't particularly enamoured with it, hence my "could be refactored" comment. The reason I didn't refactor it at the time was I couldn't see a nice way how to. Noting your "personal preferences" emphasis, I hope you don't mind if I respond with my take on things? a block pretending to be a loop, and the use of redo and last therein. <Reveal this spoiler or all in this thread> And that repetition of 'do stuff' is a problem. In this case, having read a line at the top of the while loop, we need to do some stuff (initialise out parens count from the first line) enter the loop construct do some more stuff (prepend the comment card and print) read the next line, check for eof. chomp the line we just read. Do some more stuff (adjust the parens count from the new line) decide whether to loop or not And the only way I know how to do that in perl (without artificial means like setting flags and/or double condition tests) is redo. You said: I'd be happier with a do block, rather than a bare block., but that doesn't work: `#! perl -slw use strict; my $i =0; do{ print ++$i; redo if $i < 5; }; __END__ c:\test>junk2 1 Can't "redo" outside a loop block at c:\test\junk2.pl line 7.` [download] You'd have to do `#! perl -slw use strict; my $i =0; do{{ print ++$i; redo if $i < 5; }}; __END__ c:\test>junk2 1 2 3 4 5` [download] That is, embed a bare block within the do block, and that is redundant and very obscure. You could adopt a Perl 6 like construct: `LOOP:{ ... redo LOOP; }` [download] which could be construed as clearer. But frankly, redo in a bare block is a perfectly valid and useful construct and, I think, it is better to just become familiar with it than to obscure it. Indeed, it is actually the most flexible looping construct. It can be used to construct all many other looping constructs Perl has. Even the much decried but extremely flexible C-style for loop with its otherwise unique ability to vary multiple indexes concurrently. `## draw the diagonals for( my $x=0, my $y=0; $i < $xMax; $x++, $y++ ) { draw( $x, $y ); +} for( my $x=0, my $y=$yMax; $i < $xMax; $x++, $y-- ) { draw( $x, $y ); +}` [download] It's a little used feature, but when you need it, you need it: chomp that isn't the first thing done in the "loop". I'd have put it first, even if any subsequent print (etc) had to include \n. Hm. I'm not sure what the position of chomp has to do with the loop construct. The chomp has to follow the readline. The readline has to occur in the middle of the loop. I'm still not happy with the construction I posted, but I haven't come up with a better one. Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply] [d/l] [select]
Re^4: Comment a block that match a keyword (do) by tye (Sage) on Aug 17, 2007 at 02:12 UTC
Re^4: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 17, 2007 at 10:33 UTC
Re^2: Comment a block that match a keyword by yorkwu (Novice) on Aug 16, 2007 at 13:51 UTC
Wow! what a cool way to find out the balanced bracketed block. Thank you, BrowserUK! I really learn a lot. York	[reply]
Re^2: Comment a block that match a keyword by moritz (Cardinal) on Aug 17, 2007 at 10:52 UTC
I quite like that solution, but it will fail if there are unbalanced brackets in quoted strings, like `"foo("`. If you want to consider that, you have to purge all quoted strings first. You can achieve this with the regexes from Regexp::Common::delimited. Perl 6 in German	[reply] [d/l]
Re^3: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 17, 2007 at 11:29 UTC
Yes. Hence my "This is very dependant upon the correct and logical formatting of the input." caveat. If there are any errors in the balancing of parens, it will fail horribly, but as the text is obviously source to some parser somewhere, it's a reasonable, pragmatic, economic ROI decision to say: This 'comment out timing checks script' is only usable on source that parses correctly using a.n.other tool. A pragmatic decision to save having to reverse engineer that a.n.other tool's parser from scratch and without the originial specs. It will also fail in many cases that would (probably; no spec!) be successfully parsed by that other parser. For example, if the close parens placements are coalesed on a single line, rather than laid out in a logically structured way as per the OPs example: `(CELL (TIMINGCHECK (SETUP (posedge SI) (posedge CP) (0.14:0.23:0.41)) (SETUP (negedge SI) (posedge CP) (0.09:0.16:0.30)) ....(random lines) (HOLD (negedge SI) (posedge CP) (0.00:0.00:0.00)) (HOLD (negedge D) (posedge CP) (0.00:0.00:0.00)) ))` [download] In this case, the close paren of the `(CELL` block will also be commented out and the result will fail to parse with that other tool. In an ideal world one would go back to the authors of a.n.other tool, request a copy of their parser, or the specifications from which it was drawn, and produce a 'proper parser' script that understood all the rules of the input language and performed the required operation. But this isn't an ideal world, and time is money, and performing ad-hoc text munging tasks like this are exactly what Perl was invented for. (Amongst other things. :) But then again, it seems that the authors of a.n.other tool were either on-the-ball or responsive to in-use experience of using their tool, because it seems they already may have provided an option that make this entire thread redundant. It's just a shame that post hasn't received the attention and votes it deserves. It's a non-perl solution, but, assuming the poster has correctly recognised the nature of the data and OP is using the correct a.n.other tool, by far the best solution to the OPs problem. Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply] [d/l] [select]
Re^4: Comment a block that match a keyword (Verilog SDF) by toolic (Bishop) on Aug 17, 2007 at 14:35 UTC
Re: Comment a block that match a keyword by NetWallah (Canon) on Aug 16, 2007 at 14:05 UTC
Same result in 3 lines of code (Use BrowserUk's __DATA__ block): `while( <DATA> ) { m/$TIMINGCHECK/ .. m/^\s$\s$/ or print , next; print "//$_" }` [download] This leverages the "Toggle" nature of the ".." operator (perldoc perlop). "An undefined problem has an infinite number of solutions." - Robert A. Humphrey "If you're not part of the solution, you're part of the precipitate." - Henry J. Tillman	[reply] [d/l]
Re^2: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 16, 2007 at 14:26 UTC
Try it with this slight variation of data: `... (CELL ... (TIMINGCHECK .... .... ) ) (CELL (CELLTYPE "SEDFQD1") (INSTANCE uTrigger/TrcInclCtrlReg_reg[13]) (DELAY (ABSOLUTE (IOPATH CP Q (0.10:0.15:0.25)(0.09:0.15:0.24)) ) ) (TIMINGCHECK (OTHERTEST (SETUP (posedge SI) (posedge CP) (0.14:0.23:0.41)) (SETUP (negedge SI) (posedge CP) (0.09:0.16:0.30)) ....(random lines) (HOLD (negedge SI) (posedge CP) (0.00:0.00:0.00)) (HOLD (negedge D) (posedge CP) (0.00:0.00:0.00)) ) ) )` [download] Your code is relying on th happy fortuity of the sample data, that the close of the block is a single ')' on a line by itself. Mine actually counts the parens to determine when the block is closed. Its a hack, but a useful one. Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply] [d/l]
Re^3: Comment a block that match a keyword by NetWallah (Canon) on Aug 16, 2007 at 18:38 UTC
OK - here is updated code - one line added, and some tweaking to count parens, but still using the flip-flop. This works with your variation of the data... I'm nor arguing your point that my original code was a hack - I simply wanted to illustrate the value of the flip-flop operator for this type of situation, including the fact that it CAN be use properly in production level code, and that it simplifies code. ##! perl -slw use strict; my $p=0; # Counts number of unmatched parens while( <DATA> ) { ##m/$TIMINGCHECK/ .. do {$p++ for m/\(/g; $p-- for m/$/g;$p==0} o +r print , next; ## Faster+simpler version, plagerizing BrowserUK's paren counting m +echanism m/\(TIMINGCHECK/ .. ($p+=tr[(][(] - tr[)][)])==0 or print ,next; print "//$_" } __DATA__ (CELL ... (TIMINGCHECK .... .... ) ) (CELL (CELLTYPE "SEDFQD1") (INSTANCE uTrigger/TrcInclCtrlReg_reg[13]) (DELAY (ABSOLUTE (IOPATH CP Q (0.10:0.15:0.25)(0.09:0.15:0.24)) ) ) (TIMINGCHECK (OTHERTEST (SETUP (posedge SI) (posedge CP) (0.14:0.23:0.41)) (SETUP (negedge SI) (posedge CP) (0.09:0.16:0.30)) ....(random lines) (HOLD (negedge SI) (posedge CP) (0.00:0.00:0.00)) (HOLD (negedge D) (posedge CP) (0.00:0.00:0.00)) ) ) ) [download] "An undefined problem has an infinite number of solutions." - Robert A. Humphrey "If you're not part of the solution, you're part of the precipitate." - Henry J. Tillman	[reply] [d/l]
Re^4: Comment a block that match a keyword by BrowserUk (Patriarch) on Aug 16, 2007 at 18:43 UTC
Re^2: Comment a block that match a keyword by clinton (Priest) on Aug 16, 2007 at 15:00 UTC
This leverages the "Toggle" nature of the ".." operator You learn a new thing every day. Interesting. Range operators in perlop Clint	[reply]
Re: Comment a block that match a keyword by shoness (Friar) on Aug 16, 2007 at 16:19 UTC
Are your library cells missing the checks? You could just leave these checks in place during SDF backannotation and just add "+notimingchecks" at compile or runtime to your simulator. VCS, MTI and NC all support that switch. You wouldn't have any Perl fun though!	[reply]
A reply falls below the community's threshold of quality. You may see it by logging in.


go ahead... be a heretic
	PerlMonks