Genbank: Gi Numbers Vs Accession Numbers?
2
2
Entering edit mode
13.2 years ago
Bio_X2Y ★ 4.4k

Some documentation I found suggests that a GenBank GI number will change each time the sequence changes - even if only one base is affected. The Accession number, on the other hand, remains the same.

However, the accession number is usually qualified with a version number suffix, e.g. "GL000191.1". As far as I know, this version number also increments each time the sequence changes.

Does this mean that the relationship between GI number and the Accession.Version pair is one-to-one, and so either would be equally suitable as a unique identifier for a sequence?

Thanks.

genbank identifiers • 15k views
ADD COMMENT
3
Entering edit mode

Yes, that's right. Accession.Version is probably preferred by humans, and GI number by machines.

ADD REPLY
2
Entering edit mode

Sorry, I can't resist :-P http://lmgtfy.com/?q=NCBI+gi

ADD REPLY
0
Entering edit mode

@Pierre, hmmmm, while I can see the funny side, I had read that page, but I still wasn't 100% clear if the relationship was always one-to-one... :) e.g. I once thought that the version number would increment if the sequence changed OR the meta-information of the sequence changed, e.g. the gene symbol. I now know that isn't the case now, but I thought maybe GI would increment on a meta change....

ADD REPLY
1
Entering edit mode
13.2 years ago

Yes, you're right: see http://www.ncbi.nlm.nih.gov/Sitemap/sequenceIDs.html

The two systems of identifiers run in parallel to each other. That is, when any change is made to a sequence, it receives a new GI number AND an increase to its version number.

ADD COMMENT
1
Entering edit mode
13.2 years ago

Your observation is true. They run in parallel, but GI system is older. So, there's a lot of sequences tagged with version 1 but with many GIs in its history. For example, check the history of L42023 and its subsequences at the NCBI Sequence Revision History. You'll see a many GI changes without a version number associated with it.

Besides that, GI and versions change only when the sequence itself changes, not the annotations. Changes in annotations can be traced only by modification date. So, GIs and versions are mostly useless.

ADD COMMENT

Login before adding your answer.

Traffic: 3077 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6