Move all rows in a tsv with a certain date to their own file

Question

I have a TSV file with 4 columns in this format The 4th column is a date string Example : 2020-12-09 12:34:22 I want every row with the same date to go into its own file For example, file 20201209 should have all rows that start with 2020-12-09 in the 4th column file 20201210 should have all rows that start

Accepted Answer

With GNU awk to allow potentially large numbers of concurrently open output files and gensub():awk '{print > gensub(/-/,"","g",$(NF-1))}' fileWith any awk:awk '{out=$(NF-1); gsub(/-/,"",out); if (seen[out]++) print >> out; else print > out; close(out)}' fileThere&#8217;s ways to speed up either script by sorting the input first if that&#8217;s an issue.

Advertisement

Answer