Hi Alex, On Fri, Apr 14, 2023 at 13:26:59 +0200, Alejandro Colomar wrote: > Hi Oskari, > > On 4/14/23 05:19, Oskari Pirhonen wrote: > > Remove leading whitespace and collapse multi-line declarations into a > > single line using (g)awk. > > I can't reak awk(1) :( > Awww man, but I even left the optional semicolons in... > But I like the idea. I implemented the same using sed(1) after your > suggestion. Does the below patch look good to you? > I actually had an earlier version with sed(1), but it used looping/branching to handle the multi-line bits, so I figured it was a bit ugly and didn't send it. I didn't think to try `-z`. It seems to do the same thing, so LGTM. > Cheers, > Alex > > P.S.: I forgot about writing a man page. I'll start now. > I was about to say "and license file and appropriate blurb" but then I saw your commit. I've got some suggestions for the man page, so I'll send some patches sometime soon. - Oskari Since you said you can't read awk, then just to satisfy your curiosity, here's what was going on: BEGIN { RS = ";\n" ORS = RS } This block is run at the start before any records are processed. The default Record Separator is "\n", but here we set it to ";\n". In (g)awk, a value of `RS` that is >1 char is actually a regex, but we only need to match a literal string. The Output Record Separator is by default also "\n". { gsub(/\n/, " ") sub(/^ +/, "") gsub(/ +/, " ") print } This block is run on all records, since it doesn't have any patterns for contitional execution attached to it. `gsub()` does an in-place global regex replace on a string, similar to the `s/regex/replace/g` you're familiar with. It takes an optional third arg, but if it's left out then it has an implicit `$0` which means the entire current record. `sub()` is like `gsub()`, but only does the first match, similar to `s/regex/replace/`. `print`, without any args, prints the current record followed by `ORS`. This was set earlier because the `RS` is consumed from the input as it is being processed record by record, but we want to keep the output looking intact. Hopefully not so bad after all. Awk is pretty nice IMO (and gawk in particular), and I would recommend checking it out if you find yourself borded one day :)
Attachment:
signature.asc
Description: PGP signature