Uniq skipping middle part of the line when comparing lines

Question

Sample file aabbccddeeffgghhiijj aabbccddeellgghhiijj aabbccddeeffgghhiijj I want to skip 6th field &#8216;ff&#8217; when comparing for an unique line, also I want the count of # of duplicate lines in front. I tried this, without any luck: Expected output 3 aabbccddee*gghhiijj Answer Discussion With &#8211;un…

Accepted Answer

$ awk -F'' -v OFS='' '{$6="*"} 1' xslin1 | sort | uniq -c      3 aabbccddee*gghhiijjDiscussionWith --unique, sort outputs only unique lines but it does not count them.  One needs uniq -c for that.  Further, sort outputs all unique lines, not just those that sort to the same value.The above solution does the simple approach of assigning the sixth field to *, as you wanted in the output, and then uses the standard pipeline, sort | uniq -c, to produce the count of unique lines.

Advertisement

Answer

Discussion