Unix shell script to search for error codes in thousand files then print the count in text file

Question

I need to find both 150+ eventType and errorCodes in 1700 files each day. That means i have to loop over 1700 files to find the occurrence count of 150+ eventType/errorCode and put those counts in a text file as a daily report. I have placed those eventType/errorCode values in a text file separated by commas:…

Accepted Answer

Here is a GNU awk script (as its own script file, for reusability) that parses the event types and error codes the log file and reports the counts of matching event types and error codes for each date.#!/usr/bin/awk -f/^[0-9]+,[0-9]+$/ {    # this line contains event type and error code    split($0, data, ",");    keys[data[1]][data[2]] = 0;}match($0, "EventType=([0-9]+).*ErrorCode=([0-9]+)", key) {    # this line is from the log file    if (key[1] in keys && key[2] in keys[key[1]]) {        match($0, "OrigEventTime=([0-9-]+)", date);        datecount[date[1]][key[1]][key[2]]++;    }}END {    for (d in datecount) {        for (k1 in datecount[d]) {            for (k2 in datecount[d][k1]) {                printf("%st%s/%st%dn",                        d, k1, k2, datecount[d][k1][k2]);            }        }    }}Running it (note thot this requires GNU awk):$ awk -f script.awk codes.txt run.logThe output is not quite in the format that you wanted, but I&#8217;m hoping it&#8217;s close enough:2016-06-11  10008/4569  12016-06-21  10008/4569  42016-06-21  40000/4006  1(I duplicated the data that you gave us a few times and change a date and one of the event types and error codes).UPDATE: I reworked the script for GNU awk versions older than 4.0 (that do not understand arrays of arrays):#!/usr/bin/awk -f/^[0-9]+,[0-9]+$/ {    # this line contains event type and error code    split($0, data, ",");    keys[data[1],data[2]] = 1;}match($0, "EventType=([0-9]+).*ErrorCode=([0-9]+)", key) {    # this line is from the log file    if (keys[key[1],key[2]] == 1) {        match($0, "OrigEventTime=([0-9-]+)", date);        count[date[1],key[1],key[2]]++;    }}END {    for (comb in count) {        split(comb, field, SUBSEP);        printf("%st%s/%st%sn", field[1], field[2], field[3], count[comb]);    }}

Advertisement

Answer