how to get percent identity between two RNA-seq fastq files?
1
0
Entering edit mode
5.4 years ago
lkianmehr ▴ 100

Hello, Does anybody have an idea about the way to get percent identity between two or more RNA-seq fastq files ? or to show how many of the sequences in one of them are found in another one?

thanks in advance

percent identity RNA-seq • 1.2k views
ADD COMMENT
1
Entering edit mode
5.4 years ago
 comm \
     <(gunzip  -c 11.fq.gz | paste - - - - | cut -f 2 | sort)  \
     <(gunzip  -c f2.fq.gz | paste - - - - | cut -f 2 | sort) | \
     awk -F '\t' '{if($1=="" && $2=="") {C+=1.0;};} END {print C/NR;}'
ADD COMMENT
0
Entering edit mode

I have run that command on two fastq files that are sequenced on the same library, I got one value = 0.119817, what does it mean? it means they have just 0.1 percent difference?

ADD REPLY
1
Entering edit mode

1/10 of reads in common.

ADD REPLY

Login before adding your answer.

Traffic: 3185 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6