downloading protein sequences
1
0
Entering edit mode
7.0 years ago

i have a list of genes and i want to download the protein sequences of the all genes of mouse at a time using ncbi tool kit please help me in this issue?

R RNA-Seq sequence gene • 1.7k views
ADD COMMENT
1
Entering edit mode
ADD REPLY
0
Entering edit mode
7.0 years ago
Bill Pearson ★ 1.0k

Take a look at the slides in this link:

http://fasta.bioch.virginia.edu/biol4230/lects/biol4230_9_web_resources.pdf

Slides 4 - 7 show a general strategy for using NCBI's esearch/efetch sites to (1) look up the refseq protein accession for your genes using a search term (your gene name) and (2) download the protein sequences using the accession. Slide 7 does the entire process for a search term; you would like to substitute the line:

s_args = "db=protein&term=GSTM*+AND+human[organism]"+ \
               "+AND+srcdb_refseq[prop]&idtype=acc&retmax=1000";

for something where:

 "term="db=protein&term="+your_mouse_gene_name+"AND mouse[organism]" + ...

Other slides in the lecture describe other strategies from other resources, but if you want to use NCBI, you will need to do the two step esearch/efetch process.

ADD COMMENT

Login before adding your answer.

Traffic: 1709 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6