Automatic download for large quantity of genes for Metagenome database creation
1
0
Entering edit mode
6.5 years ago

I need to create a database of 400 different genes for my simulated metagenome, but it takes too much time to download one by one. So, is there any form to download then all at once or any tool to download them for me?

sequence genes • 1.3k views
ADD COMMENT
0
Entering edit mode

You are referring to genomes in the title of this post but then genes in the body. Which of the two is it since the answer would change based on that critical information.

ADD REPLY
0
Entering edit mode

It's genes, I wrote wrong on the title, thanks.

ADD REPLY
1
Entering edit mode
6.5 years ago
GenoMax 141k

Using NCBI eUtils. Change the gene name as needed.

esearch -db gene -query "dnaK" | efilter -organism bacteria | efetch -format docsum | xtract -pattern GenomicInfoType -element ChrAccVer ChrStart ChrStop | xargs -n 3 sh -c 'efetch -db nuccore -format fasta  -id "$0" -chr_start "$1" -chr_stop "$2"' > dnaK_seq.fa

Sequences truncated for brevity.

>NC_017448.1:c213043-211142 Fibrobacter succinogenes subsp. succinogenes S85, complete genome
ATGGGTAAGATTATTGGTATTGACTTGGGTACGACAAACAGCTGCGTTGCCGTGATGGAAGGTGGCAAGC
CGGTCGTCATCGCCAATGCAGAAGGTTTCCGCACCACGCCGTCTATTGTCGCTTTCGGCAAGAACGGTGA
>NC_016610.1:c2966116-2965736 Tannerella forsythia 92A2, complete genome
ATGGCTGAGAAGACAAGATACTCGGATGCGGAGCTGGAAGAATTTCGCGCCATTATCATGGAGAAGCTCG
ATAAGGCAAAGAAAGATTACGACTTGTTGAGATCGGGGATTACGAATGCCGACGGCAACGACGTTTCCGA
ADD COMMENT

Login before adding your answer.

Traffic: 2387 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6