How to process gene expression data from Affymetrix U133 Plus 2.0 in Python
1
0
Entering edit mode
4.9 years ago
nkabo ▴ 80

Hello,

I have a gene expression dataset taken from ArrayExpress based on Affymetrix U133 Plus 2.0. Is there a workflow that describes how to complete the steps below:

1) Replacing probe IDs with gene names, 2) Removing the probes which are not specific to only one gene 3) Getting the mean of expression value of multiple probes corresponding to one gene

Thanks in advance

python affymetrix gene expression data human • 2.7k views
ADD COMMENT
1
Entering edit mode
4.9 years ago
ATpoint 81k

Don't do that in python. Use established R packages to analyse microarrays. See this tutorial.

ADD COMMENT
0
Entering edit mode

Thank you for the reply. I had used the workflow and I expected to have approximately 26k data points but it ended up with over 48k probes. Is it expected or did I do something wrong during processing?

ADD REPLY
1
Entering edit mode

The Affymetrix U133 covers a broad range of transcript types, including both coding and non-coding.

To figure out coding versus non-coding, perhaps you can get some ideas from here: A: Affymetrix Human Genome U133 Plus 2.0 Array

ADD REPLY

Login before adding your answer.

Traffic: 2616 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6