Hi, I have difficulty in calculation for average insert size and standard deviation of RNA-seq NGS data to submit GEO database.
I generated two fastq files of paired end read(Read1, Read2) using illumina NGS sequencer.
And, I used bacteria genome as reference which has 3 chromosomes. For analysis, I used bowtie2 pipeline to align paired end data to each chromosomes. To get average insert size, such as 'samtools stats' is good using alignment.
But I think using generated each sam file ,which is divided from fastq data, for calculation is not accurate. Is there any way to get the average insert size and standard deviation from total fastq?
Thank you in advance
Oh, thanks a lot. But my fastq files are not overlap between reads because read length is shorter than inner distance. So in my case, what is the best option for bbmap? Is it right to calculate after merging the three chromosomses fasta file?
You should ideally do it using the original fastq data if you have it. But if not, fasta should work (I think).