How to get a strain name if I know only an assembly ID?
1
0
Entering edit mode
5.6 years ago
little_more ▴ 70

Suppose I have a tree with assembly IDs of different E.coli strains in it (all from NCBI). Is there any common way to replace each assembly ID with the corresponding strain name? I tried using BioPython (efetch) but it raises an error.

assembly • 1.7k views
ADD COMMENT
0
Entering edit mode

Always post sample ID's when asking a question like this. Posting your code/actual error is also beneficial if you don't want an alternate answer.

ADD REPLY
0
Entering edit mode
5.6 years ago
vkkodali_ncbi ★ 3.7k

You can try something like this using Entrez Direct (https://www.ncbi.nlm.nih.gov/books/NBK179288/)

esearch -db assembly -q '854998' | esummary | xtract -pattern DocumentSummary -def "NA" -element Taxid Genbank RefSeq Organism

You can replace the id 854998 (which is a RefSeq assembly ID) with either assembly UID, Genbank assembly ID, or an NCBI assembly accession.

ADD COMMENT

Login before adding your answer.

Traffic: 2303 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6