What's the difference between generating all possible genotypes from a MSA and extracting a consensus sequence?
2
0
Entering edit mode
9.9 years ago

I have seen examples of getting a consensus sequence by using software like UGENE or JalView. Now what would be an example output of generating all possible genotyopes from a Multiple Sequence Alignment result?

msa consensus genotypes • 2.5k views
ADD COMMENT
0
Entering edit mode
9.7 years ago
madk00k ▴ 360

I suppose the answer depends on the data you are working with. For example, one should use specialized genotype callers when working with virus populations. However, in general there are a lot of genotype callers available: GATK, FreeBayes, VcfTools. To use one of these tools one has to provide reference sequence and alignment (in BAM format) or a list of mutations (usually in VCF format).

I would select one sequence from multiple alignment as a reference sequence (this could be the consensus sequence, if no reference is available). Then I would remap the sequences forming the alignment to the selected reference using bwa-sw or bowtie2. The resulting BAM file can be used to generate VCF file for genotyping.

ADD COMMENT
0
Entering edit mode
9.7 years ago
Josh Herr 5.8k

I guess I am wondering why you would want to do this? You'll be able to easily identify a majority of genotypes by consensus, but rare variants may be hard to discern from sequencing errors.

You didn't provide us any information in regards to a research question or rationale for this question, so it's hard to provide some input on this basis.

ADD COMMENT

Login before adding your answer.

Traffic: 2566 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6