International Protein Index: No Entries Found
1
0
Entering edit mode
12.0 years ago

I've been asked to retrieve the fasta sequences of a set of proteins identified by their International Protein Index. I've been using http://www.ebi.ac.uk/Tools/dbfetch/dbfetch to retrieve the sequences, however for a few IPIs, the sequences were not found.

Examples:

http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPI&id=IPI00000178&format=fasta&style=raw

http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPI&id=IPI00000495&format=fasta&style=raw

http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPI&id=IPI00896455&format=fasta&style=raw

could it be a typo in the identifiers or could it be something like a 'deprecated' identifier ?

Thanks

protein sequence fasta • 1.9k views
ADD COMMENT
0
Entering edit mode

Production of the IPI database ceased last year (http://www.ebi.ac.uk/IPI/IPIhelp.html). IPI has been replaced by datasets provided by UniProt, which is extending equivalent complete proteome coverage into many other species. Where possible IPI use of IPI should be replaced by the use of the equivalent UniProt entries. For cases where you are only interested in the sequences, I suggest searching UniParc since this has all the IPI sequences and provides details of identical sequences in other data sources.

ADD REPLY
2
Entering edit mode
12.0 years ago

On the IPI page you can also query the "IPI History", which finds a record for IPI00000178 and also for the other two.

ADD COMMENT
1
Entering edit mode

The IPI identifier tracking database "IPI History" is also available in dbfetch/WSDbfetch. So Pierre can just tweak the URLs being used to get information to find out why. Just replace the database name (db) with 'ipihistory' and either set the format to 'default' or remove the format specifier. For example:

http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPIhistory&id=IPI00000178 http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPIhistory&id=IPI00000495 http://www.ebi.ac.uk/Tools/dbfetch/dbfetch?db=IPIhistory&id=IPI00896455

The returned tab-delimited table details which versions of IPI the identifier appeared in, and details of why an entry was removed (see http://www.ebi.ac.uk/IPI/HistoryFormat.html). If the specific entry/sequence data referred to by the IPI is required, then you can either look-up the IPI identifier in UniParc (it helps if you have the full identifier including the sequence version), also available in dbfetch/WSDbfetch, or use the IPI release information from "IPI History" to identify the required release files on the FTP site (ftp://ftp.ebi.ac.uk/pub/databases/IPI/).

ADD REPLY

Login before adding your answer.

Traffic: 2230 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6