I applied some bioinformatics methods to retrieve the genes related to a cohort of patients having a particular disease and to a cohort of healthy controls.
I then applied some biostatistics and machine learning techniques to rank all the genes based on their importance/significance in relation to the disease. Now I have this ranked list of genes and my next step is to try to validate it.
What's a precise computational way to find confirmation (or disproval) of the association between the top genes of my ranking and the studied disease?
How can I demonstrate that the genes A, B, C are related to the studied disease D?