Clusters moving for same dataset and code using tSNE
1
3
Entering edit mode
5.2 years ago
chilifan ▴ 120

I am clustering data using Satija's Seurat package. Link: https://satijalab.org/seurat/pbmc3k_tutorial.html I am using this exact code and their data, and I generate the same plots as them. However, visualiizing my clusters with tSNE, they are positioned differently compared to the example on satijas website. I haven't though much about this, I've clustered some data after that and had the feeling that the cells in the separate clusters are still the same, and it doesn't really matter how they are positioned. But now I'm presenting this for a group of biologists, and they are going to ask why they are positioned differently. How should I explain this? And more important: does it matter at all to the end result?

My clusterinig

Satija's clustering

tSNE Seurat Clustering scRNAseq • 4.7k views
ADD COMMENT
2
Entering edit mode
5.2 years ago

tSNE (or t-Distributed Stochastic Neighbour Embedding) is a stochastic technique (i.e based on randomness). We would no expect to get exactly the same result multiple times, unless we fix the random seed. Actaully, what you'll notice here is that mostly the bottom plot is pretty close to just being a rotation of the top one.

For a myriad of reasons, you shouldn't use tSNE for clustering. You derive your clusters by some other method, and then visualise the clusters using tSNE. Because most clustering algos are more stable than tSNE on repeated runs, you would expect the memberships of the clusters to be the same.

ADD COMMENT
0
Entering edit mode

Thank you, this makes perfect sense:) I am indeed using another method for clustering, so I conclude that I can trust the cell identities of my clusters then.

ADD REPLY

Login before adding your answer.

Traffic: 2819 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6