Biostar Beta. Not for public use.
ENA data download : difference between "FASTQ" and "Submitted" files ?
0
Entering edit mode
13 months ago
guillaume.rbt • 590
France

Hi all,

I'm currently downloading public datasets on the ENA platform.

When I want to download files of a specific study (for example here : https://www.ebi.ac.uk/ena/data/view/PRJEB23709 ), for a given sample I have the choice between "FASTQ file" or "Submitted File".

I was wondering what is the difference between those two files (the "submitted" one being sligthly bigger that the corresponding "FASTQ" one)

Thanks

ENA EBI fastq • 164 views
ADD COMMENTlink
1
Entering edit mode

If I remember well you can also upload (un)aligned BAM files , which ENA will then convert back to fastq I think, but as ATpoint I as well suggest to always go for the fastq version.

ADD REPLYlink
1
Entering edit mode

In this specific case Submitted file appears to contain the actual sample name. If you get the ENA fastq files then you may need to keep track of metadata for the sample names. So in this case I suggest that you download a sample ENA and Submitted files. Compare them (they should be identical) and the probably get the Submitted files instead.

ADD REPLYlink
0
Entering edit mode

I've checked and they are indeed the same files. The difference in size was only due to the modification of reads name in the "FastQ" files.

ADD REPLYlink
1
Entering edit mode
11 weeks ago
ATpoint 17k
Germany

I do not know what Submitted Fileis. Use FASTQ file. There are ways to speed up the download, see my tutorial Fast download of FASTQ files from the European Nucleotide Archive (ENA)

ADD COMMENTlink
0
Entering edit mode

Ok ! (thanks for the tutorial, it is very useful)

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1