How to delete lines that match elements from another file

Question

I am in the process of learning Perl and I am trying to figure out how to do this task. I have a folder with a bunch of text files and I have a file ions_solvents_cofactors that contains bunch of three letters list. I wrote a script that opens and reads each file in a folder and should delete those

Accepted Answer

I would make use of theTie::Filemodule, which allows you to tie an array to the module so that any changes you make to the array are reflected in the fileI’ve used glob to find all the .txt files, with the option :bsd_glob so as to support spaces in the file pathsThe first job is to build a hash %matches that maps all the values in ions_solvents_cofactors to 1. This makes it trivial to test the PDB files for the required valuesThen it’s just a matter of using tie on each .txt file, and testing each line to see whether the value in column 4 is represented in the hashI use variable $i to index into the @file array which maps the on-disk file. If a match is found then the array element is deleted with splice @file, $i, 1. (This naturally leaves $i indexing the next element in sequence without incrementing $i.) If there is no match then $i is incremented to index the next array element, leaving the line in placeuse strict;use warnings 'all';use File::Glob ':bsd_glob';use Tie::File;my %matches = do { open my $fh, '<', 'ions_solvents_cofactors.txt'; local $/; map { $_ => 1 } split ' ', <$fh>;};for my $pdb ( glob '*.txt' ) { tie my @file, 'Tie::File', $pdb or die $!; for ( my $i = 0; $i < @file; ) { next unless my $col4 = ( split ' ', $file[$i] )[3]; if ( $matches{$col4} ) { printf qq{Removing line %d from "%s"n}, $i+1, $pdb; splice @file, $i, 1; } else { ++$i; } } }

Advertisement

Answer