How to delete prefix, suffix in a string matching a pattern and split on a character using sed?

Question

I have the following string, which is the output of a cassandra query in bash I want to split this string so as to remove the string in the beginning till the last + symbol and then remove the tail end, which is (XYZ rows). So, the string becomes A|1|a B|2|b C|3|c D|4|d. Now, I want to split this string

Accepted Answer

$ sed 's/[^+]*[+]*(.*[^ ]) *(.*)$/1/;y/ |/n /' <<< 'col1|col2|col3+++++++++++A|1|a B|2|b C|3|c D|4|d  (3 rows)'A 1 aB 2 bC 3 cD 4 dThe substitution does the following (hat tip to potong for pointing out how to get rid of one more substitution):s/    [^+]*      # Match non-plusses    [+]*       # Followed by plusses    (         # Capture the next group        .*     # Any characters (greedily)        [^ ]   # that end with a non-space    )         # End of capture group     *         # Spaces    (.*)       # Followed by whatever in parentheses$/1/          # Replace all that by the capture groupresulting in this intermediate stage:$ sed 's/[^+]*[+]*(.*[^ ]) *(.*)$/1/' <<< 'col1|col2|col3+++++++++++A|1|a B|2|b C|3|c D|4|d  (3 rows)'A|1|a B|2|b C|3|c D|4|dThe transformation (y///) turns all spaces into newlines and pipes into spaces.Spaces other than the ones separating rowsIf there are spaces within column and we assume that each entry has the format[spaces]entry[spaces]i.e., exactly two sets of spaces per entry, we have to replace the transformation y/// with another substitution,s/([^ |])( +[^ |])/1n2/gThis looks for spaces following not a space or pipe and followed by not a space or pipe, and inserts a newline before those spaces. Result:$ var='col1 | col2 | col3 +++++++++++ A | 1 | a B | 2 | b C | 3 | c D | 4 | d (3 rows)'$ sed 's/[^+]*[+]*(.*[^ ]) *(.*)$/1/;s/([^ |])( +[^ |])/1n2/g' <<< "$var" A | 1 | a B | 2 | b C | 3 | c D | 4 | d

Advertisement

Answer

Spaces other than the ones separating rows