Question

Off topic:Unsupervised selection of inter-cluster highly variable genes

0

Entering edit mode

5.5 years ago

elb ▴ 250

Hi guys, I have a big data.frame of RNA-Seq counts in which rows are genes while columns are samples. I clustered this big matrix and I identified 6 major clusters. They have some common genes, i.e. genes that do not show a huge variation between the samples (around 100 patients) and some genes that characterize each cluster because the expression is different between the clusters. For example: in one cluster 10 genes are highly expressed while in all the other clusters the same genes are poorly expressed and do not change substantially comparing to the first cluster. Is there a way to select the highly "significant" or variable genes that characterize each cluster with respect to the others in order to end up with a list of cluster-specific genes whose expression is peculiar of that cluster? I know that a way is to perform a log2 (fold change) but I would like to performe this analysis in an unsupervised way without to select the comparisons for the fold change calculation. Can anyone help me with some idea or references so that I can select the cluster-specific relevant genes?

Thank you in advance

e.

RNA-Seq R variance • 886 views

ADD COMMENT • link updated 5.5 years ago by GenoMax 141k • written 5.5 years ago by elb ▴ 250