Entering edit mode
5.4 years ago
Ric
▴
430
I have two folders and each of them contain the same file names.
ls -1
10_S0_L001_R1_001.fastq.gz
10_S0_L001_R2_001.fastq.gz
11_S0_L001_R1_001.fastq.gz
11_S0_L001_R2_001.fastq.gz
Is there a way to extract from each dataset the flowcell name and use it as unique filename?
Thank you in advance.
Thank you, is there a script for it?
You can use standard unix tools (such as
cut
,awk
,tr
,grep
, etc) or non-standard ones (e.g.,bioawk
) to extract metadata from your fastqs. Like genomax said, you'd need to extract some other identifier(s) in order to make your filenames unique.