Gold standard datasets
1
2
Entering edit mode
6.0 years ago
pinn ▴ 210

Hi

Currently I'm looking to construct the robust genome analysis pipeline. I came across a lot of aligners , variant callers, Structural variant callers and Copy number variation (CNV) callers. But were I can find the raw datasets for all 5 genomes( HG001,HG002,HG003,HG004 and HG005 ) ?

Can any any one provide me the link. Thanks!

genome • 2.3k views
ADD COMMENT
1
Entering edit mode

Hello pinninti1991reddy!

Please follow up on your questions. See C: Readgroups for a bam file ?

For this reason we have closed your question.

If you disagree please tell us why in a reply below, we'll be happy to talk about it.

Cheers!

ADD REPLY
0
Entering edit mode

thanks for your response

ADD REPLY
1
Entering edit mode

Please provide feedback to d-cameron by responding to their answer, and I'll reopen this question. Thank you!

ADD REPLY
0
Entering edit mode

What does that mean, OP? Do you intend following up on your past posts?

ADD REPLY
4
Entering edit mode
6.0 years ago
d-cameron ★ 2.9k

I can find the raw datasets for all 5 genomes

The 'raw' data are physical cell lines. Genome in a Bottle genomes are literally a genome in a bottle that you can test your entire sequencing pipline from sample prep all the way to the end of the bioinformatics pipeline.

That said, there is plenty of reference sequence data at the GiaB website: http://jimb.stanford.edu/giab-resources/

ADD COMMENT
0
Entering edit mode

Thanks, I find the new benchmark datasets published by zook etal 2018 (Dr.Justin Zook).

ADD REPLY

Login before adding your answer.

Traffic: 1468 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6