Hierarchical clustering evaluation
1
0
Entering edit mode
5.8 years ago
druggable ▴ 60

Hi,

I clustered more than 10,000 genes based on gene expression profile under different treatment conditions. I used the dynamic tree cut package in R. However, it produced around 90 clusters. How do I evalute the clustering results with such a high number of clusters?

clustering gene expression • 1.5k views
ADD COMMENT
0
Entering edit mode

How do you know it is a high number of clusters? And I assume you mean 10,000 sequences.

If you know which genes it are you can check if different genes are in the same cluster. If so, this is not what you want so you need to change parameters.

ADD REPLY
0
Entering edit mode

The output of the dynamictree cut package in R is more than 90 clusters. I am clustering more than 10,000 genes based on their gene expression across various treatments.

I cannot check if the genes in the same cluster are different because that is my objective, to learn which genes behave the same way

ADD REPLY
1
Entering edit mode
5.8 years ago

The way to deal with large numbers of anything is to have a computer do the work. The question you should ask yourself is what are you trying to assess ? There are many ways to evaluate a clustering result but they depend on the context and the goal of the clustering. If you have anything resembling a ground truth, then use it. You can also use feature enrichment analysis (e.g. any annotation you may have) to assess whether the clusters make sense in the context of the experiments. Otherwise you can compute various indices such as the silhouette.

ADD COMMENT

Login before adding your answer.

Traffic: 3221 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6