Well-resolved Phylogenetic Dataset
1
1
Entering edit mode
8.1 years ago
jnf3769 ▴ 40

Hello all,

I am looking to test an alignment-free phylogenetic tree building algorithm I wrote. It can perform both gene and species trees. I have already tested it on a single gene primate tree, but I need some more data to further characterize the algorithm. I know there is a lot of data on TreeBASE, but I am having a hard time pulling data down. Additionally, I am generally unaware of which trees are considered well-resolved.

Any info would help greatly

phylogenetics dataset phylogeny data • 2.4k views
ADD COMMENT
2
Entering edit mode
8.1 years ago
kloetzl ★ 1.1k

You might want to use data sets already used in other papers on alignment-free comparisons. Here you can download the data from andi (shameless self-plug). I also have the roseobacter data set from the spaced words paper. Send me a mail, if you are interested.

ADD COMMENT
0
Entering edit mode

I have a followup question about the 109 E. coli ST131 strains. In the assemblies (ordered or not), there are multiple nodes per fasta file. Am I right in assuming that that means there are more than one contig per file (that is, the genome was not closed)?

ADD REPLY
0
Entering edit mode

Yes, those are contigs. A lot of genome projects stay in this state.

ADD REPLY

Login before adding your answer.

Traffic: 2941 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6