Comparison of variant distribution between chromosomes
1
0
Entering edit mode
5.2 years ago
misterie ▴ 110

Hi,

Do you know any idea how to do comparison of SNPs and InDels distribution between chromosomes? I know, I should take account a different chromosomes size. I have calculated number of SNPs for each chromosomes as well as number of InDels. Should I take account into size of indels? Do you have any idea?

comparison vcf distribution • 1.3k views
ADD COMMENT
1
Entering edit mode
5.2 years ago
ATpoint 81k

Beyond size, one should take into account gene density (number of genes / total chr size), number of low complexity region, nuclear localization (center of nucleus vs. laminar-associated) etc. I think this is not trivial and therefore: What would be the underlying question you want to answer?

ADD COMMENT
0
Entering edit mode

I want to compare distribution of SNP -- randomness, some patterns associated with chromosome

ADD REPLY
1
Entering edit mode

This is not perhaps what you are looking for in terms of guidance for analysis, but it help you identify what to look for? Ensembl has per chromosome summary statistics, number of coding or non coding genes and short variants (defined as <50bp in length), e.g. for chromosome 1.

I guess what you are looking for is identifying which chromosome/regions on a chromosome may have a higher/lower than expected number of variants compared to the genome frequency as a whole. Have you considered looking into regions that are shown to be highly conserved evolutionarily?

ADD REPLY
0
Entering edit mode

Yes, to further elaborate on Erin's final point, as an example, recent in silico prediction tools have compared mutation frequencies against random mutation backgrounds and/or 'derived' alleles that have become fixed (conserved) in the human lineage when compared to our recent ape ancestor. If you want ideas, I would look at some of those recent tools, such as GWAVA, FATHMM MKL, CADD, DANN, etc. Ensembl has much untapped data that can be put to great use.

As a side note: conservation is the single best predictor of functionality of a mutation / variant.

ADD REPLY
0
Entering edit mode

I have got my own VCF files that contains information about InDels and VCF (also annotated) and I want to do some comparison between chromosomes. It is not a human genome.

ADD REPLY

Login before adding your answer.

Traffic: 2047 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6