Question

How is it possible that multiple proteins maps to single transcript ids ?

0

Entering edit mode

6.4 years ago

jack ▴ 960

Hi all,

I've downloaded transcript mapping to PDB annotation from GENECODE

When I look at the table, I see that one transcript id maps to multiple proteins. I was pondering about the reason behind of it. Here are some examples:

                  V1   V2
1  ENST00000567584.1 1UJU
2  ENST00000567584.1 1WHA
3  ENST00000567584.1 1X5Q
4  ENST00000567584.1 2W4F
5  ENST00000569114.1 1UJU
6  ENST00000569114.1 1WHA
7  ENST00000569114.1 1X5Q
8  ENST00000569114.1 2W4F
9  ENST00000564122.1 1UJU
10 ENST00000564122.1 1WHA

protein RNA-Seq sequencing ChIP-Seq • 1.2k views

ADD COMMENT • link updated 11 months ago by Ram 43k • written 6.4 years ago by jack ▴ 960

score 0 · Answer 1 · 2017-11-28

0

Entering edit mode

6.4 years ago

Hussain Ather ▴ 990

A single transcript can have multiple ORFs that can give rise to multiple protein sequences.

ADD COMMENT • link 6.4 years ago by Hussain Ather ▴ 990

score 0 · Answer 2 · 2017-11-28

0

Entering edit mode

6.4 years ago

Istvan Albert 100k

It helps to look up the definitions for each:

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000569114

https://www.ensembl.org/Homo_sapiens/Transcript/Idhistory?t=ENST00000567584

Note how in this case both are marked as "retired".

In summary the data mapping stores all mappings, even historical ones that are now obsolete.

ADD COMMENT • link 6.4 years ago by Istvan Albert 100k

0

Entering edit mode

This problem should be addressed in next Ensembl release: C: Ensembl ID converter not working?

ADD REPLY • link 6.4 years ago by GenoMax 141k