Biostar Beta. Not for public use.
Question: Repeated product tag in a genbank file
0
Entering edit mode

While I was parsing all annotation records from the Pongo abelii (Sumatran orangutan) genome I realized that there are some annotations repeated, same location, strand, even note tag, but different (in some cases) product tag, here's an example from a log:

Species: PONAB | 42202927:42211982(+) | Product: zinc finger protein 155 isoform X1 | Note: By Gnomon.
Species: PONAB | 42202927:42211982(+) | Product: zinc finger protein 155 isoform X1 | Note: By Gnomon.
Species: PONAB | 42202927:42211982(+) | Product: zinc finger protein 155 isoform X1 | Note: By Gnomon.
Species: PONAB | 42202927:42211982(+) | Product: zinc finger protein 155 isoform X1 | Note: By Gnomon.
Species: PONAB | 42204537:42211982(+) | Product: zinc finger protein 155 isoform X2 | Note: By Gnomon.

What does this mean? Are they redundant? Should I remove repeated annotations or not?

Best regards, Marcos

Entering edit mode
1

Duplicated features does happen from time to time, and they can often be ignored safely depending on what you're doing. It may be an artifact of a reannotation process or similar.

Its worth noting that one of those entries has a different coordinate, so may be something subtly different (since those are isoforms, it may be the same entity but with a legitimately subtly different start site).

ADD REPLYlink 9 months ago
Joe
12k
Entering edit mode
0

This may be a glitch from Gnomon (NCBI's eukaryotic gene prediction tool). Let NCBI know by emailing the help desk.

ADD REPLYlink 9 months ago
genomax
68k

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.0