Retrieving several bacteria CDS
0
0
Entering edit mode
5.1 years ago

Hi. I want to retrieve several bacteria exomes with the condition that those bacteria has to be (ideally) homogeneously separated in evolution (I need big biological diversity among the species I choose).

Also, it would be positive to pick that bacteria species relying on some insteresting trait (they are related to human, cattle disease, they can survive at extreme conditions...). So, the files has to be annotated or the browser has to have an option where I can filter by this kind of traits.

Is there a way of doing it without writting a scrip? Which database do you recommend me (for bacterias only)?

Thanks for your time.

R sequence database retrieving • 754 views
ADD COMMENT
1
Entering edit mode

Retrieving bacterial proteomes from the NCBI is easy (you can use the search in this forum to find out more about that). However, coming up with a list that meets your criteria sounds like a lot of manual work, I don't know how you think that some script would help you there..

ADD REPLY
0
Entering edit mode

I was thinking about doing a script using some kind of evolutive distance parameter. I've been reading about that and it exit. The problem is that I dont have time or informatics skill enough for doing that script, and I though maybe there is some library that can help me.

ADD REPLY
1
Entering edit mode

You can download bacterial proteomes from NCBI using various methods. Following is one for RefSeq geneomes.

  1. Get this file: wget ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/assembly_summary_refseq.txt
  2. Create download URL's for genomes: cat assembly_summary_refseq.txt | awk 'BEGIN{FS="\t"}{print $20}' | awk 'BEGIN{OFS=FS="/"}{print $0,$NF"_genomic.faa.gz"}' > urls.txt
  3. Edit the file to remove genomes you are not interested in (no automated way to do this based on traits you are interested in).
  4. Use a simple loop to download data: for i incat urls.txt; do wget $i; done
ADD REPLY
0
Entering edit mode
wget -i urls.txt

;)

ADD REPLY
0
Entering edit mode

👍

ADD REPLY

Login before adding your answer.

Traffic: 2566 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6