How is it possible that multiple proteins maps to single transcript ids ?
2
0
Entering edit mode
6.4 years ago
jack ▴ 960

Hi all,

I've downloaded transcript mapping to PDB annotation from GENECODE

When I look at the table, I see that one transcript id maps to multiple proteins. I was pondering about the reason behind of it. Here are some examples:

                  V1   V2
1  ENST00000567584.1 1UJU
2  ENST00000567584.1 1WHA
3  ENST00000567584.1 1X5Q
4  ENST00000567584.1 2W4F
5  ENST00000569114.1 1UJU
6  ENST00000569114.1 1WHA
7  ENST00000569114.1 1X5Q
8  ENST00000569114.1 2W4F
9  ENST00000564122.1 1UJU
10 ENST00000564122.1 1WHA
protein RNA-Seq sequencing ChIP-Seq • 1.2k views
ADD COMMENT
0
Entering edit mode
6.4 years ago
Hussain Ather ▴ 990

A single transcript can have multiple ORFs that can give rise to multiple protein sequences.

ADD COMMENT
0
Entering edit mode
6.4 years ago

It helps to look up the definitions for each:

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000569114

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000567584

Note how in this case both are marked as "retired".

In summary the data mapping stores all mappings, even historical ones that are now obsolete.

ADD COMMENT
0
Entering edit mode

This problem should be addressed in next Ensembl release: C: Ensembl ID converter not working?

ADD REPLY

Login before adding your answer.

Traffic: 1965 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6