Does anyone know good open data sources for metagenomic data linked to a given condition?
2
0
Entering edit mode
6.8 years ago

I'm looking for either a database (like SRA) or even a study that provides its data that has labels associated with the data. Ideally, this would be metagenomic data (either sequences or abundance tables) in a study that has a strong link between a feature like a species and the condition being studied.

Just reaching out because I haven't been able to find any studies that have enough data for my application (implementing machine learning algorithms) - so ideally we are talking about at least 100 samples for the condition being studied (controls, maybe the same).

Any help is appreciated. Thanks guys

metagenomics databases open data • 1.7k views
ADD COMMENT
1
Entering edit mode
6.8 years ago
Tm ★ 1.1k

Hi Edward,

You can check below mentioned post. It may solve your purpose http://github.com/gjospin/PhyloSift/issues/59.

ADD COMMENT
0
Entering edit mode

thanks! I'll look through that post... of course I'm probably being a little too nitpicky with my search... we'll never really find ideal data in the real world, will we?

ADD REPLY
0
Entering edit mode
6.8 years ago
erictleung ▴ 110

The closest databases I can think of are

The American Gut project has the quantity of data, so that might be the most interest to you for machine learning purposes. Good luck.

ADD COMMENT

Login before adding your answer.

Traffic: 3211 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6