Off topic:Unsupervised selection of inter-cluster highly variable genes
0
0
Entering edit mode
5.5 years ago
elb ▴ 250

Hi guys, I have a big data.frame of RNA-Seq counts in which rows are genes while columns are samples. I clustered this big matrix and I identified 6 major clusters. They have some common genes, i.e. genes that do not show a huge variation between the samples (around 100 patients) and some genes that characterize each cluster because the expression is different between the clusters. For example: in one cluster 10 genes are highly expressed while in all the other clusters the same genes are poorly expressed and do not change substantially comparing to the first cluster. Is there a way to select the highly "significant" or variable genes that characterize each cluster with respect to the others in order to end up with a list of cluster-specific genes whose expression is peculiar of that cluster? I know that a way is to perform a log2 (fold change) but I would like to performe this analysis in an unsupervised way without to select the comparisons for the fold change calculation. Can anyone help me with some idea or references so that I can select the cluster-specific relevant genes?

Thank you in advance

e.

RNA-Seq R variance • 886 views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 2565 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6