Batch download UniProt proteomes with list of proteome IDs
2
4
Entering edit mode
6.3 years ago
JMac ▴ 40

Is there any way to batch download UniProt proteomes based on a list of proteome IDs (e.g. UP000011712).

I have a list of about 1,000 proteome IDs from UniProt. I know it is possible to do this with taxon IDs but I do not have these.

So, is it possible to batch download proteomes based on proteome IDs, or is there any way to convert proteome ID to taxon identifier?

Thanks for any help.

uniprot proteome script • 4.5k views
ADD COMMENT
5
Entering edit mode
6.3 years ago

You can do this programmatically using URLs like https://www.uniprot.org/uniprot/?query=proteome:UP000005640&format=fasta (FASTA format)

Use format=txt, format=xml etc for other download formats.

There is a related FAQ for programmatic access with a code example to download the UniProt reference proteomes for all organisms below a given taxonomy node in compressed FASTA format: https://www.uniprot.org/help/api_downloading

ADD COMMENT
0
Entering edit mode

Thanks, this is exactly what I was looking for. I rewrote the example code to accept proteome IDs instead of taxa IDs.

ADD REPLY
0
Entering edit mode

But only the index html file is getting downloaded instead of the fasta file. How can i retrieve the fasta of th ereference proteome ID ??

ADD REPLY
0
Entering edit mode

This RESTful URL should work: https://www.uniprot.org/uniprot/?query=proteome:UP000005640&format=fasta Please contact the UniProt helpdesk if the problem persists. If you include your code, we can look into your problem in detail.

ADD REPLY
1
Entering edit mode
6.3 years ago

The README file for reference proteomes has UniProt ID mapping to taxon ID. You can also download files by taxonomic division. Locate the file(s) containing the species you're interested in and filter them based on species.

ADD COMMENT

Login before adding your answer.

Traffic: 2517 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6