Dear users, I have a large file with the following columns
chr10_46938 EXON=28/28 STRAND=-1 ENSP=ENSGALP00000004070 SIFT=tolerated(0.38)
chr10_46966 EXON=28/28 STRAND=-1 DOMAINS=Low_complexity_(Seg):Seg SIFT=tolerated(0.66) ENSP=ENSGALP00000004070
chr10_46987 EXON=28/28 STRAND=-1 SIFT=tolerated(0.93) ENSP=ENSGALP00000004070
chr10_47071 ENSP=ENSGALP00000004070 SIFT=tolerated(0.97) EXON=28/28 STRAND=-1
chr10_47164 EXON=28/28 STRAND=-1 DOMAINS=Low_complexity_(Seg):Seg SIFT=tolerated(0.37) ENSP=ENSGALP00000004070
chr10_47466 ENSP=ENSGALP00000004070 SIFT=tolerated(0.11) STRAND=-1 EXON=28/28 DOMAINS=PROSITE_profiles:PS50196,Pfam_domain:SSF50729
I want to select only the first column and the SIFT=tolerated(..)
information, but this is not in fixed columns, example column 2. How to select only this information I want to have for example the following output :
chr10_46938 SIFT=tolerated(0.38)
chr10_46966 SIFT=tolerated(0.66)
chr10_46987 SIFT=tolerated(0.93)
chr10_47071 SIFT=tolerated(0.97)
chr10_47094 SIFT=tolerated(1)
chr10_47164 SIFT=tolerated(0.37)
chr10_47466 SIFT=tolerated(0.11)
What command to use on UNIX to get this list?