Awk script to sum multiple column if value in column1 is duplicate

Question

Need your help to resolve the below query. I want to sum up the values for column3,column5,column6, column7,column9,column10 if value in column1 is duplicate. Also need to make duplicate rows as single row in output file and also need to put the value of column1 in column 8 in output file input file output file Tried below code, but it

Accepted Answer

$ cat tst.awkBEGIN {    FS=OFS="|"}NR==1 {    print $0, "h"    next}{    keys[$1]    for (i=2; i<=NF; i++) {        sum[$1,i] += $i    }}END {    for (key in keys) {        printf "%s", key        for (i=2; i<=NF; i++) {            printf "%s%s", OFS, sum[key,i]        }        print OFS key    }}$ awk -f tst.awk filea|b|c|d|e|f|g|hIN27201800023963|10|11|72|11|62|62|IN27201800023963IN27201800024098|80|67|6|0|1|765|IN27201800024098IN27201800024099|11.01|190|66|18|3|20.45|IN27201800024099The above outputs the lines in random order, if you want them output in the same order as the key values were read in, it&#8217;s just a couple more lines of code:$ cat tst.awkBEGIN {    FS=OFS="|"}NR==1 {    print $0, "h"    next}!seen[$1]++ {    keys[++numKeys] = $1}{    for (i=2; i<=NF; i++) {        sum[$1,i] += $i    }}END {    for (keyNr=1; keyNr<=numKeys; keyNr++) {        key = keys[keyNr]        printf "%s", key        for (i=2; i<=NF; i++) {            printf "%s%s", OFS, sum[key,i]        }        print OFS key    }}$ awk -f tst.awk filea|b|c|d|e|f|g|hIN27201800024099|11.01|190|66|18|3|20.45|IN27201800024099IN27201800023963|10|11|72|11|62|62|IN27201800023963IN27201800024098|80|67|6|0|1|765|IN27201800024098

Advertisement

Answer