Reduce saturation by deleting the 3rd position in each codon of DNA seqs. R ape package
1
0
Entering edit mode
9.2 years ago
dago ★ 2.8k

I am performing a phylogenentic analysis using specific marker genes.

I selected the aa sequences, aligned them, used as template for aligning the respective nucleotide sequences.

I created a saturation plot with the "ape" package in R, and I can see that same seq are saturated.

In order to have a more robust results I would like, now, to delete the third position in each codon.

Is there a way/program to do this?

I guess this could be done with

seq
test3[seq(1, length(test3), by = 3)] # take only the first nucleotide in the codon

but I cannot find a way of taking only the first one and the second one.

I am also open to any suggestion for different way of doing this task

alignment phylogeny DNA R • 3.5k views
ADD COMMENT
3
Entering edit mode
9.2 years ago
David W 4.9k

In R you can use negative indices to exclude elements.

Here's an example using one of ape's datasets (not this is an alignment (matrix), not a single sequence (vector) as in your example

ADD COMMENT
0
Entering edit mode

Ok, really good thanks!

ADD REPLY
0
Entering edit mode

I might not understand the command AlignmentToFilter[, -seq(1, ncol(AlignmentToFilter), by=3) ] .. but it is apparently skipping the first codon position in my hand while command AlignmentToFilter[, -seq(3, ncol(AlignmentToFilter), by=3) ] worked for me and omitted the 3rd codon position.

ADD REPLY

Login before adding your answer.

Traffic: 1758 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6