Bash: Read in file, edit line, output to new file

Question

I am new to linux and new to scripting. I am working in a linux environment using bash. I need to do the following things: 1. read a txt file line by line 2. delete the first line 3. remove the middle part of each line after the first 4. copy the changes to a new txt file Each line

Accepted Answer

Updated answer assuming tab delimSince there is a tab delimiter, then this is a cinch for awk. Borrowing from my originally deleted answer and @geek1011 deleted answer:awk -F"t" '{print $1, $NF}' infile.txtHere awk splits each record in your file by tab, then prints the first field $1 and the last field $NF where NF is the built in awk variable for the record&#8217;s Number of Fields; by prepending a dollar sign, it says &#8220;The value of the last field in the record&#8221;.Original answer assuming space delimiterLeaving this here in case someone has space delimited nonsense like I originally assumed.You can use awk instead of using bash to read through the file:awk 'NR>1{for(i=1; $i!~/pdf/; ++i) firstRec=firstRec" "$i} NR>1{print firstRec,$i,$NF}' yourfile.txtawk reads files line by line and processes each record it comes across. Fields are delimited automatically by white space. The first field is $1, the second is $2 and so on. awk has built in variables; here we use NF which is the Number of Fields contained in the record, and NR which is the record number currently being processed.This script does the following:If the record number is greater than 1 (not the header) thenLoop through each field (separated by white space here) until we find a field that has &#8220;pdf&#8221; in it ($i!~/pdf/). Store everything we find up until that field in a variable called firstRec separated by a space (firstRec=firstRec" "$i).print out the firstRec, then print out whatever field we stopped iterating on (the one that contains &#8220;pdf&#8221;) which is $i, and finally print out the last field in the record, which is $NF (print firstRec,$i,$NF)You can direct this to another file:awk 'NR>1{for(i=1; $i!~/pdf/; ++i) firstRec=firstRec" "$i} NR>1{print firstRec,$i,$NF}' yourfile.txt > outfile.txtsed may be a cleaner way of going here since, if your pdf file has more than one space separating characters, then you will lose the multiple spaces.

Advertisement

Answer

Updated answer assuming tab delim

Original answer assuming space delimiter