Hi All,
I downloaded the data from GEO (GSE17972). It seems the data format is quite special one: qmap. Anyone have the script to transfer qmap bis-seq format To bigwig/wig/bedgraph.
GEO: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE17972 OR the question is same as: how to understand the qmap bis-seq format?
*Please ignore all the answer below. Since the reads were aligned to YH Genome, rather than GRCh37/38. I suggestion all the user who try to use this data should download the fastq and re-mappint to human genome, so that the result could be comparable with other dataset.*
* Download Finished, Alignment Finished, BW finished *
Okay. After all the guys' help. This problem has been totally solved.
1, this dataset is hg18 based.
2, qmap format has been listed by Devon Ryan..
3, in the qmap file, every base have been shown not only CpG, but every base.
4, therefore, when you calculate methylation level of CpG, you need extract the value for CpG sites.
Thanks.
ENA appears to have the fastq files in case you want to get the original data.
Yes. Thanks. I notice it: http://www.ebi.ac.uk/ena/data/view/PRJNA119949 However, I don't want to download the huge fastq data again. I think I can trust the mapping from BGI, right?
Long as you can understand the format, which you say is special :)
Okay. let change to "how to understand the qmap bis-seq format". Thanks genomax2! by the way, who is genomax1?