Skip to content
Advertisement

Convert column to matrix format using awk

I have a gridded data file in column format as:

JavaScript

I would like to convert it to matrix format as:

JavaScript

Where top 20.5 21.5 22.5 indicate y and side values indicate x and the inside values indicate the corresponding grid values.

I found a similar question here Convert a 3 column file to matrix format but the script is not working in my case.

The script is

JavaScript

Advertisement

Answer

The following awk script handles :

  • any size of matrix
  • no relation between row and column indices so it keeps track of them separately.
  • If a certain row column index does not appear, the value will default to zero.

This is done in this way:

JavaScript

How does it work:

  • PROCINFO["sorted_in"] = "@ind_num_asc", states that all arrays are sorted numerically by index.
  • (NR==1){next} : skip the first line
  • {row[$1]=1;col[$2]=1;val[$1" "$2]=$3}, process the line by storing the row and column index and accompanying value.
  • The end statement does all the printing.

This outputs:

JavaScript

note: the usage of PROCINFO is a gawk feature.

However, if you make a couple of assumptions, you can do it much shorter:

  • the file contains all possible entries, no missing values
  • you do not want the indices of the rows and columns printed out:
  • the indices are sorted in column-major-order

The you can use the following short versions:

JavaScript

which outputs

JavaScript

or for the transposed:

JavaScript

This outputs

JavaScript
User contributions licensed under: CC BY-SA
8 People found this is helpful
Advertisement