Making a custom reference genome
1
1
Entering edit mode
9.0 years ago
biogirl ▴ 210

Hi all,

For certain reasons, I need to make a custom reference genome to align my Illumina sequence reads against, and call SNPs. I've made my custom reference by using SPAdes to create de novo assembly from sequence reads, and indexing for the programs I'm going to use for alignment and subsequent SNP calling (so indexed for BWA, GATK and Samtools).

I only want to call SNPs, so I'm not interested in gene annotation. Basically, what I want to know is, am I missing anything crucial in the approach to making a custom reference sequence?

Thanks

genome snp sequence • 5.7k views
ADD COMMENT
0
Entering edit mode

When you map reads back to your assembly, how the alignment stats looks ?

ADD REPLY
0
Entering edit mode

Well, according to flagstat, alignment stats look alright: 96% of reads mapped, and 94% properly paired, and low percentage (0.9%) of singletons.

ADD REPLY
0
Entering edit mode

How about multi-mapped reads?

ADD REPLY
0
Entering edit mode

Not really sure - how do I check?

ADD REPLY
2
Entering edit mode
9.0 years ago
h.mon 35k

There is not a lot of detail to provide a good answer, I think. You did not detail how you assembled your genome, and if / how you evaluated your assembly for completeness and correctness, or contamination. What kind of organism, number of samples? Are you assembling the genome with only one strain, or several? Assembly quality will depend on many factors, if you are not aware of them you may be calling SNPs on artifacts.

ADD COMMENT
0
Entering edit mode

I think biogirl is not bothered about assembly stats or anything. Just want to makes representative sequences for a bunch of sequences, then align them back and call snps.

ADD REPLY
0
Entering edit mode

Geek_y is correct - just interested in a representative sequence. But to provide more detail, if it'll help, I assembled my genome using SPAdes using paired end reads from one strain. I appreciate that paired-end reads are the best to use for de novo, but it's all I have at the moment.

ADD REPLY

Login before adding your answer.

Traffic: 1780 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6