I’m planning to analyse RNA-seq data that was obtained from a few different batches. I’ll use batch as a covariate in my model, but I would still like to estimate the impact of known, individual batch factors (e.g. read length, read depth, # of mapped reads, prep techniques). Initially, I thought to run an ANOVA, but a colleague mentioned PVCA. Is there a standard or preferred method for this?
Why do you not just test the batch factors in your model? Then you can see how many genes are affected by each feature. And you might want to throw GC content into the mix - it's a well known thing that changes with batch.