Long Runs Of A'S, G'S, C'S Or T'S In A Scaffolded Assembly (Abyss-1.3.4/Soapdenovo2)
1
0
Entering edit mode
10.9 years ago
jomaco ▴ 200

Hi,

As I understand, prior to scaffolding using long mate-pair reads, runs of N's or A/G/C/T's cannot exceed the length of the reads used for assembly (in this case 101bp). After scaffolding there are long series of N's as a result of contigs being joined together.

Can there also be long runs of A/G/C/T's after scaffolding? This is what I am seeing. I thought perhaps that joining contigs together ending in homopolymer repeats might, instead of an N, result in an A being used for example if those two contigs ended with long runs of A.

ABySS-1.3.4 was used for assembly and SOAPdenovo2 was used for scaffolding.

Thanks,

Jom.

scaffolding • 2.1k views
ADD COMMENT
1
Entering edit mode
10.8 years ago
Rayan Chikhi ★ 1.5k

Did you enable the gap-filling option during Soapdenovo2 scaffolding? I recall that Soapdenovo gapfiller does not hesitate to fill gaps with the same k-mer many times, to resolve tandem repeats.

ADD COMMENT

Login before adding your answer.

Traffic: 1978 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6