How else can one quantify a DNA sequence besides GC-content and length?
5
0
Entering edit mode
8.7 years ago
jolespin ▴ 150

I'm looking for other ways of quantifying sequences besides looking at GC-content and length? I could look at coverage and stuff but what about just from the raw sequence? The size of the sequences are gene-length to contig-length of good assemblies (1000 nt - 200 000 nt)

gc seq genome gene sequence • 2.6k views
ADD COMMENT
0
Entering edit mode

the phrasing of your question implies you are looking for quantification related to physical properties.... and probably of a relatively short DNA sequence ... is that correct?

If not, there are TONS of ways. Please disambiguate.

ADD REPLY
5
Entering edit mode
8.7 years ago
5heikki 11k

kmer content

ADD COMMENT
0
Entering edit mode

Can you use this as a single value? I know I could get the % for each kmer in the sequence but I would have multiple values. Maybe stdev of it?

ADD REPLY
0
Entering edit mode

Well, you could get a single value of e.g. how many unique tetramers there are in a given sequence. I'm not so sure how useful that would be though.

ADD REPLY
3
Entering edit mode
ADD COMMENT
2
Entering edit mode
8.7 years ago
JC 13k

Time ago I wrote some code for quantify different DNA properties (compositions, bias, complexity): https://github.com/caballero/SeqComplex

ADD COMMENT
1
Entering edit mode
8.7 years ago

Composition vectors have been used to build phylogenetic trees from sequences: http://www.aporc.org/LNOR/12/ISORA2010F02.pdf

These are somewhat related to kmers though.

ADD COMMENT
1
Entering edit mode
8.7 years ago

I hate to bang my own drum, but you can find plenty of DNA structural parameters and other relevant metrics in my work on DNA atlases from 15 years ago (Jensen et al., 1999; Pedersen et al., 2000). This includes parameters such as predicted DNA curvature, flexibility and stability.

ADD COMMENT

Login before adding your answer.

Traffic: 1860 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6