The sed command is not working with regex

Question

I'm parsing the output of a HTTP GET request with sed to retrieve the contents of a given html tag. The result of that request is like this: "

Hello!

v1.0.4-b

" And I want to retrieve the version number inside the p element. However, sed seems to have a bug in regex parsing. When I use: sed 's/.*

//' It correctly replaces

Accepted Answer

You need to usesed -n 's~.*

([^<]*)

.*~1~p'sed -n -E 's~.*

([^<]*)

.*~1~p'See the online demo:#!/bin/bashsed -n 's~.*

([^<]*)

.*~1~p' <<< "

Hello!

v1.0.4-b

"## => v1.0.4-bThe sed 's/.*

(.*)

.*/1/' command would not work becauseYou are using a POSIX BRE pattern where the unescaped ( and ) are treated as literal parentheses chars, not a capturing group. In POSIX BRE, you need (...) to define a capturing group (this is why you get the invalid reference 1 exception)If you add -E option to enable POSIX ERE, you can use (...) to define a capturing groupYou are not matching /p, you have p in the pattern.As there are slashes in the pattern, it is more convenient to choose regex delimiters other than /, I chose ~ here.Also, I used -n option to suppress default line output and p flag to print only the result of the substitution.

Advertisement

Answer