Parsing Coding Seq w grep
1
0
Entering edit mode
6.2 years ago

Hello,

I have downloaded the E.coli ref strain K-12 MG1655 coding sequence (https://www.ncbi.nlm.nih.gov/nuccore/NC_000913.3). I want to extract certain sequences of genes like gyrA, 23SrRNA,... etc

when using grep like

grep "[gene=gyrA]" EColiK12MG1655_cds.fasta it simply prints everything starting from gyrA (and the letters turn red and White for unknown reasons).

Best regards

genome gene • 1.0k views
ADD COMMENT
0
Entering edit mode
6.2 years ago

I assume you want to extract sequence from the GenBank file. If this is true, there are dedicated libraries that will make this a whole lot easier, See this post

ADD COMMENT

Login before adding your answer.

Traffic: 1472 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6