using awk to count the number of occurrences of pattern from another file

Question

I am trying to take a file containing a list and count how many times items in that list occur in a target file. something like: I have coopted the following code to get the values that are present in target.txt: But the output does not include the desired items that are in the liist.txt but not the target.txt I

Accepted Answer

awk '  NR==FNR{a[$0]; next}  {    for(i=1; i<=NF; i++){      if ($i in a){ a[$i]++ }    }  }  END{    for(key in a){ printf "%s %dn", key, a[key] }  }' list.txt target.txtNR==FNR{a[$0]; next} The condition NR==FNR is only true for the first file, sothe keys of array a are lines of list.txt.for(i=1; i<=NF; i++) Now for the second file, this loops over allits fields.if ($i in a){ a[$i]++ } This checks if the field $i is present as a keyin the array a. If yes, the value (initially zero) associated with that key is incremented.At the END, we just print the key followed by the number of occurrences a[key] and a newline (n).Output:blonde 2red 0black 0Notes:Because of %d, the printf statement forces the conversion of a[key] to an integer in case it is still unset. The whole statement could be replaced by a simpler print key, a[key]+0. I missed that when writing the answer, but now you know two ways of doing the same thing. 😉In your attempt you were, for some reason, only addressing field 2 ($2), ignoring other columns.

Advertisement

Answer