Can You Explain The Differences Among These Blast Databases?
1
1
Entering edit mode
12.1 years ago
Alby ▴ 90

I am trying to blast my sequence against primate databases remotely. But there are so many databases, and most are in abbreviation. Can you explain what they are in the below example list?

chimp AACZ contigs
chimp AADA contigs
chimp bes
chimp est
chimp htgs
chimp all contig
chimp refseq contig

and which one should I use for NGS reads?

blast database next-gen sequencing command-line • 2.3k views
ADD COMMENT
1
Entering edit mode

you shouldn't use blast for aligning the reads.

ADD REPLY
3
Entering edit mode
12.1 years ago
Vikas Bansal ★ 2.4k

It will be good, if you can give more details about what kind of reads you have and what are you going to do with them using blast. If you are going to align reads using blast, then Pierre's suggestion is right. For the databases (as far as I know)-

chimp AACZ contigs

chimp AADA contigs

AACZ and AADA are prefix of accession numbers. Both of them contains sequences from whole genome shotgun and have different entries. Look at this and you can find information for both of them.

chimp bes

This are the entries from BAC end sequences and you can read little bit about the genome here and here(this one is not for chimp but give you little idea about bac)

chimp est and chimp htgs stands for express sequence tags and high throughput genome sequences

chimp all contig will include all the contigs (different assembled versions and entries) and chimp refseq contig will align against reference sequences only.

ADD COMMENT
0
Entering edit mode

The whole genome shotgun (WGS) projects are summarized in:

The AACZ corresponds to GenBank Assembly GCA_000001515, which is used as the basis for the genome build used in Ensembl: http://www.ensembl.org/Pan_troglodytes/

ADD REPLY

Login before adding your answer.

Traffic: 1696 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6