GI identifiers lost
0
0
Entering edit mode
6.6 years ago

I have a set of proteins from diverse Bacteria. I'm trying to analyze them using DAVID but it cannot recognize more than 80% of my GI numbers. Here you have one example:

gi|386865221|ref|YP_006278169.1| membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b]

So, the GI numbers is:

386865221

But DAVID (with identifier as PROTEIN_GI_ACCESSION) cannot recognize it. I also tried with NCBI to find the GI number but, there is nothing. Finally, I did a protein blast with the original protein sequence (see below) and I obtained a 100% match protein:

AFI69771.1 membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b]

Strikingly, I tried to make an ID conversion with biodbnet but, again, I didn't found anything.

GI identifier without any AFI associated

My question is, is there any easy way to relate my GI and AFI identifiers?

gi|386865221|ref|YP_006278169.1| membrane-anchored cell surface protein [Burkholderia pseudomallei 1026b] MNKIYKTIWCETTRSWVAVSEHANGKRGGATAAATTSARPIWTRLRGISLAALAAFGLGLFASPAAFAQS NSVMCANYNNGILPTYTGYGASPSLTSPCTTGIGSWAGGVTPGSTTNWIGLSADDTQIVLNGSTGNIYFR AGGTNGNTLTMSNVAGSGPTGGVLLSGVAAGAVTATSSQAINGSQLYSLSTSASTGIGSLSSSMSTFNSS ISSLSTGLSSTNSGLTSLSTSASTGLSSANSSIASLSSGLSSTNSSLTSLSTSASSGISTAQSGVNSLST GLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLSTGL ISTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTS ASSGISSANSSVASLSTSTSTGIGSLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLS TGLSSTNSSLTSLSTSASSGISTAQSGVNSLSTGLSTTNSTVASLSTSTSTGIGSLSTGLSSTNSSLTSL STSTSSGISSANSSIASLSTSTSTGISSLSTGLSTTDSNLASLSTSTSTGLSSATSSIASLSTSTSTGIG SLSTGLSTTNSNLTSLSTSTSTGLSSANSSITSLSSGLSTTNSNVASLSSGLSSTNSSLTSLSTSTSSGI SSANSSVASLSTSTSTGIGSLSTGLSTTNSNLTSLSTSTSTGLSSANSSITSLSSGLSTTNSNVASLSTG LSSTNSSLTSLSTSTSSGISSANSSIASLSTSTSTGISSLSTGLSTTDSNLTSLSTSTSTGLSSANSSIT SLSSGLSTTNSNVASLSSGLSATNSSLTSLSTSASSGISSANSSVASLSTSTSTGIGSLSTGLSTTNSNL TSLSTSTSTSLSSANSSITSLSSGLSTTNSNVASLSTGLSSTNSSLTSLSTSASSGISSANSSVASLSTS TSTGLSSANSSITSLSSGLSTTNSNVASLSSGLSTTNSSLTSLSTSASSGISSANSSIASLSTSTSTGIS SLSTGLSTTDSNLASLSTSTSTGLSSAASSIASLSTSTSTGISSLSTGLSTTDSNLTSLSTSTSTGLSSA NSSITSLSSGLSTTNSNVASLSSGLSTTNSSVASLSTGLSTTDSNLASLSTSTSTGLSSTTSSIASLSTS TSTSVDSLSTGLSTTNSSVASLSTGLSTTDSSLASLSTSTSTGLSSTTSSIASLSTSTSTSFSSALSSIG SLSTGLATTNSNLASLSTSTLTSVSSLSTGLSATNSSVASLSTSASTGLAATNSTVASLSTSTSTAVGSL STGLSTTNSNVASLSTSTSTAVGSLSTSLSTTNSNVASLSTSTSTSVNSLSTGLSTTNTSVASLSTSVTN LNTQLTSLSTTIVNSTNNVIRALPASTGIAADMSAPNAAAPSVTAGSNSVALGANSTDGGRSNVVSVGSA TQQRQITNVAAGTEGTDAVNVNQLNALSTSMSQSLAGQQGQINNLGSQLTQTQQALQQTDTMARQGIAAA TALTMLPQVEPGKTINVAVGVARFAGQSGMAFGASAHVTTNGILKLGIGVSGQNKTFGAGYGYSW

identifier GI protein conversion DAVID • 1.3k views
ADD COMMENT

Login before adding your answer.

Traffic: 2596 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6