Question

Clusters moving for same dataset and code using tSNE

3

Entering edit mode

5.2 years ago

chilifan ▴ 120

I am clustering data using Satija's Seurat package. Link: https://satijalab.org/seurat/pbmc3k_tutorial.html I am using this exact code and their data, and I generate the same plots as them. However, visualiizing my clusters with tSNE, they are positioned differently compared to the example on satijas website. I haven't though much about this, I've clustered some data after that and had the feeling that the cells in the separate clusters are still the same, and it doesn't really matter how they are positioned. But now I'm presenting this for a group of biologists, and they are going to ask why they are positioned differently. How should I explain this? And more important: does it matter at all to the end result?

My clusterinig

Satija's clustering

tSNE Seurat Clustering scRNAseq • 4.7k views

ADD COMMENT • link updated 5.2 years ago by i.sudbery 19k • written 5.2 years ago by chilifan ▴ 120

score 2 · Answer 1 · 2019-02-13

2

Entering edit mode

5.2 years ago

i.sudbery 19k

tSNE (or t-Distributed Stochastic Neighbour Embedding) is a stochastic technique (i.e based on randomness). We would no expect to get exactly the same result multiple times, unless we fix the random seed. Actaully, what you'll notice here is that mostly the bottom plot is pretty close to just being a rotation of the top one.

For a myriad of reasons, you shouldn't use tSNE for clustering. You derive your clusters by some other method, and then visualise the clusters using tSNE. Because most clustering algos are more stable than tSNE on repeated runs, you would expect the memberships of the clusters to be the same.

ADD COMMENT • link 5.2 years ago by i.sudbery 19k

0

Entering edit mode

Thank you, this makes perfect sense:) I am indeed using another method for clustering, so I conclude that I can trust the cell identities of my clusters then.

ADD REPLY • link 5.2 years ago by chilifan ▴ 120