How to grep for a pattern in the files in tar archive without filling up disk space

Question

I have a tar archive which is very big ~ 5GB. I want to grep for a pattern on all files (and also print the name of the file that has the pattern ) in the archive but do not want to fill up my disk space by extracting the archive. Anyway I can do that? I tried these, but

Accepted Answer

Here&#8217;s my take on this:while read filename; do tar -xOf file.tar "$filename" | grep 'pattern' | sed "s|^|$filename:|"; done < <(tar -tf file.tar | grep -v '/$')Broken out for explanation:while read filename; do &#8212; it&#8217;s a loop&#8230;tar -xOf file.tar "$filename" &#8212; this extracts each file&#8230;| grep 'pattern' &#8212; here&#8217;s where you put your pattern&#8230;| sed "s|^|$filename:|"; &#8211; prepend the filename, so this looks like grep. Salt to taste.done < <(tar -tf file.tar | grep -v '/$') &#8212; end the loop, get the list of files as  to fead to your while read.One proviso: this breaks if you have OR bars (|) in your filenames.Hmm.  In fact, this makes a nice little bash function, which you can append to your .bashrc file:targrep() {  local taropt=""  if [[ ! -f "$2" ]]; then    echo "Usage: targrep pattern file ..."  fi  while [[ -n "$2" ]]; do        if [[ ! -f "$2" ]]; then      echo "targrep: $2: No such file" >&2    fi    case "$2" in      *.tar.gz) taropt="-z" ;;      *) taropt="" ;;    esac    while read filename; do      tar $taropt -xOf "$2"        | grep "$1"        | sed "s|^|$filename:|";    done < <(tar $taropt -tf $2 | grep -v '/$')  shift  done}

Advertisement

Answer