Submission of a draft genome with or without repetitive contigs?
1
0
Entering edit mode
5.5 years ago
minions-b • 0

I have a draft fungal genome assembled/scaffolded with spade (ca. 1,400 scaffolds), which I plan to submit to EMBL. However, ca. 150 of them with very short length (200-500 bp) are identified as repetitive contigs when I used nucmer/funannotate for sanity check before gene prediction. Should I exclude the repetitive scaffolds from the submission?

All suggestions are highly appreciated!

genome • 1.1k views
ADD COMMENT
0
Entering edit mode

I would hazard a guess that EMBL would reject those contigs anyway.

I think you’re best off removing them as contigs less than a kilobase or more are uninformative and most likely junk

ADD REPLY
0
Entering edit mode

Thanks! I will remove them. :)

ADD REPLY
1
Entering edit mode
5.5 years ago
Juke34 8.5k

it depends, they don’t care if it’s repeat element or not, they just care about the length of sequences. What I remember it’s that they ask to motivate your choice when you want to keep short sequences/contigs <100 bp. The best approach would be to keep all of them then when you have your embl flat file you launch the embl flat file validator and it will tell you what you have to remove (none I guess in your case).

ADD COMMENT

Login before adding your answer.

Traffic: 2712 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6