Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Programatically reparagraphinating text

by ww (Archbishop)
on Feb 16, 2007 at 14:45 UTC ( #600446=note: print w/replies, xml ) Need Help??

in reply to Programatically reparagraphinating text

Interesting project!

blahblahblah ++ re use of a dictionary. Coupled with the regex in the OP (or, perhaps, one that's rather more specific and insistent on the presence of periods), you may have something of a start on that part of the problem.

It does seem to me that reflowing text (horizontally) around ascii art will be problematic, at best. Perhaps it would also be well to accept a less design-oriented target and accept leaving anything determined to be ascii art as an inline item (takeout box, dropin, for a couple of terms that may clarify my intent), with the reformatted text above and below.

eg, NOT:

test here yada ya da   0000 01 02 03 04...
ya da'in continues       0010 0f 0e 0d...

but rather:

test here yada ya da

0000 01 02 03 04...
0010 0f 0e 0d...

ya da'ing continues

My next notion may be unmanageable, but might be worth exploring: Would creation of a second dictionary containing such common elements as the address fragments at the beginning of each line of a hex dump (2nd example) and the multiple spaces initiating each line in the BBS logo be worth the effort?

and <big grin> while use of a dictionary might not have returned this result; the mouthful of the title might have been reduced by using "reparagraphing"?

  • Comment on Re: Programatically reparagraphinating text

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://600446]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others exploiting the Monastery: (6)
As of 2020-08-15 08:18 GMT
Find Nodes?
    Voting Booth?
    Which rocket would you take to Mars?

    Results (78 votes). Check out past polls.