I am new will dealing with the SSMethylSeq data and when I want to further analyse a sample, I found that different from other samples which the sequenced fragments were targeted at the bottom strand, that sample have a quite large number of reads from the top strand. Just shown as below:
Number of sequence pairs with unique best (first) alignment came from the bowtie output:
CT/GA/CT: 5495873 ((converted) top strand)
GA/CT/CT: 0 (complementary to (converted) top strand)
GA/CT/GA: 0 (complementary to (converted) bottom strand)
CT/GA/GA: 11278214 ((converted) bottom strand)
I think it is really unusual for SSMethylSeq data as the data should be "biased" to the bottom strand so I want to check with the targeted CpG sites for the SSMethylSeq. However, all the things I can find it that SSMethylSeq is targeting at "over 3.7 million individual CpG dinucleotide sequences for their methylation state". I would like to know whether there is a full list for the CpG sites/ sequences this protocol targeted at. Thank you very much!! :)