extract substrings starting with same pattern in a file

Question

I have a fileA.txt which contains strings: I would like to extract all the substring which start with RS Output: I tried something like this: However I only get the first string RS0247 printed out when I do echo Answer Given the three sample lines pasted above in the file f... Assuming a fixed format: Assuming a flexible format (and

Accepted Answer

Given the three sample lines pasted above in the file f&#8230;Assuming a fixed format:awk '{print $2"_"$4}' fRS0247_RS0255RS0332_RS0451RS0332_RS0247Assuming a flexible format (and that you&#8217;re only interested in the first two occurrences of fields that start with RS:awk '{f=1;for (i=1;i<=NF;i++){if($i~/^RS/){a[f]=$i;f++}}print a[1]"_"a[2]}' fRS0247_RS0255RS0332_RS0451RS0332_RS0247Edit 1:And assuming that you want your own script patched rather than an efficient solution:#!/bin/bashwhile read linedo        str=""        for word in $line        do                if [[ "$word" =~ ^RS ]]                then                        if [[ -z $str ]]                        then                                str=$word                        else                                str+="_${word}"                        fi                fi        done        echo "$str"done < fileA.txtEdit 2:In terms of efficiency; I copied and pasted those 3 lines into fileA.txt 60 times (180 lines total).  The runtimes for the three attempts above in the same order are:real    0m0.002sreal    0m0.002sreal    0m0.011s

Advertisement

Answer