Remove duplicates and keep line which contains max value from one column – LINUX

Question

everyone! I'd like to remove duplicates and keep lines with the highest value from one column (4th column) in a file with 4 fields. I must do this in a Linux server. Before After Thank you so much and I'm sorry if I asked something repeated! But I didn't find an answer for my problem. Answer You can try this,

Accepted Answer

You can try this, if it is no problem to get the output without the header:tail -n +2 file.txt | sort -k1,1 -k4,4rn | sort -uk1,1Explanation:tail -n +2 file.txtwill remove the headers so they don&#8217;t get involved in all the sorting.sort -k1,1 -k4,4rnwill sort by column 1 first (-k1,1) and then by column 4 numerically and in reverse order (-k4,4rn)Finally: sort -uk1,1Will remove duplicates taking into account just the first column.Be aware that -k1,1 means from column one to column one, hence -k4,4 is from column 4 to column 4. Adjust to fit your columns.

Advertisement

Answer