Univariate testing before or after feature selection
1
0
Entering edit mode
5.8 years ago
ab123 ▴ 50

Hi there,

Wondering if univariate have to be run before features are selected or can be applied to the selected features? The latter gives me better results.

I understand that we select features to build the best predictive models, but in my case they also seem to bring out the most significant variables.

Am I missing something? Sorry couldn't find the answer googling this so asking you guys.

Thanks!

ttest stats • 1.2k views
ADD COMMENT
3
Entering edit mode
5.8 years ago

There is no standard way to do it. Obviously your method of feature selection is important, as it is undoubtedly already applying some kind of statistical test to your data. I gave a long answer on this topic, here: A: What is the best way to combine machine learning algorithms for feature selectio

However, my answer should not be regarded as the 'gold standard' by any means.

One could easily make the argument that 'unbiased' feature selection, i.e., performing feature selection on your entire unfiltered data, is best, whilst, in other situations, it may prove beneficial to first filter out genes based on some comparison, e.g., between cases and controls.

Kevin

ADD COMMENT
0
Entering edit mode

Great answer! Thank you Kevin!

ADD REPLY

Login before adding your answer.

Traffic: 2018 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6