downloading dataset for snp analysis
1
0
Entering edit mode
8.1 years ago

Hi,

I am new to Bioinformatics,I am from Computer Science background

I want to perform comparison of tools for snp. Therefore I need some raw data files. How do I get the data files? I searched a lot on internet but couldn't find a download button on any website which when clicked will give me the file in the proper format. Example I searched SRA, GEO, NCBI. But it is so confusing for the person who is a beginner in this field. Even my friend tried but in vain.

Please help

SNP • 1.9k views
ADD COMMENT
0
Entering edit mode

By "SNP analysis", you mean SNP calling analysis? Which type of file are you looking for? fastq format, fasta format, bam format?

ADD REPLY
0
Entering edit mode

Hi, Ya SNP calling analysis.I want in fasta format.

ADD REPLY
2
Entering edit mode

Well, SRA will give you all kind of datasets that you need. Using the ftp you will find the sequencing files from a wide range of organisms. Furthermore you'll find data becoming from all the sequencing plataforms, such as Illumina, 454 ... So you'll be able to find and download whatever you are interested in.

Here is the link: SRA

You'll need to convert .sra format to fastq (or fasta). To do that there is a nice easy tool called fastq-dump.

I forgot to mention. Here there is a nice tutorial covering this issue.

ADD REPLY
1
Entering edit mode
8.1 years ago
GenoMax 141k

You could make synthetic datasets based on real genome sequence. You can get the human chromosomes here. Get *.fa.gz files.

ADD COMMENT
0
Entering edit mode

Thank you very much.I think it is useful.At least I now know where to get the source file from.I will try it.

ADD REPLY

Login before adding your answer.

Traffic: 1523 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6