Biostar Beta. Not for public use.
Viewing and editing FASTQ files
1
Entering edit mode
17 months ago

I have a FastQ file which appears to be a truncated one, due to which there is a formatting error, and I'm neither able to upload it in FASTQC software nor use that for aligning it with a reference Genome. The size of the file is around 16 GB. I want to view where the formatting style error occurred (most probably at the end of the file) and edit in such a way that it follows the correct format.

Doing it in a text editor is nearly impossible because the machine crashes if I try to open such a large file. Is there any other tool I can use to edit a small part of the FastQ file?

0
Entering edit mode

You can also try validateFiles from Jim Kent's utils to see where the problem is. Add execute permission chmod u+x validateFiles after you download.

0
Entering edit mode

Yes, I tried this and located where the error has possibly occurred. I still need a way to correct it.

0
Entering edit mode

If you tell us what the error says we can try to help you fix it.

0
Entering edit mode

As far as I know, the BAM file which I have was generated from a different pipeline and when I try to convert to fastq, it results in a truncated file having format errors like:

1) ^B symbol is present in some places instead of breaking into a new line 2) few incomplete records 3) RNA sequence having characters like B, F, @ apart from ATCG

1
Entering edit mode
7 months ago
ATpoint 17k
Germany

Use repair.sh from BBMap and do not even try text editors. That is both impossible and not reproducible.

0
Entering edit mode

I tried this too, it starts running and shows this error finally:

Invalid maximum heap size: -Xmxm Error: Could not create the Java Virtual Machine. Error: A fatal exception has occurred. Program will exit.