Whole exome processing using python or R
0
0
Entering edit mode
2.7 years ago
Ankita ▴ 70

How to Download a single sample from ENA/SRA for Whole Exome Sequencing platform (e.g. SRR)

Convert the Raw format for sample to VCF (both somatic and germline mutations) using simple python code or R CODE OR ANY.

SRA • 1.2k views
ADD COMMENT
1
Entering edit mode

I dont think you can do that with any single line of code. Since the sequence is submitted in raw format, You have to download, perform alignment and process the samples manually according to the reads.

ADD REPLY
0
Entering edit mode

If you manage how to download SRR file (fastq), you can do the analysis without any coding by using public/ published workflows on Galaxy and an example WES workflow is here (https://galaxyproject.github.io/training-material/topics/variant-analysis/tutorials/exome-seq/tutorial.html). Please read the manual or search on google how to download files from SRA/ENA. There are plenty of tutorials and automated scripts to address your query on downloading files from ENA/SRA. If you have preliminary coding experience, I would suggest to use either NF core prebuilt workflows or snakemake prebuilt workflows.

ADD REPLY
0
Entering edit mode

You can download files directly to Galaxy from the ENA Browser. e.g. Go to https://www.ebi.ac.uk/ena/browser/view/SRR212430?show=reads Click on "Show Column Selection" and check "fastq_galaxy" (assuming it's the generated fastq file you want) "Generated FASTQ files: Galaxy" column will now be visible in the table below. Click on a link in this column to start a download to Galaxy.

ADD REPLY

Login before adding your answer.

Traffic: 1322 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6