Biostar Beta. Not for public use.
How to identify best-hit taxonomy of many gene sequences efficently
0
Entering edit mode
13 months ago
gwrathe • 0

Hello, I have many sequences, each represents a species. I have the aim of identifying these sequences using their best-hit and recording that information to find the relative diversity of my community. I could go on to NCBI refseq and grab the best-hit for each of my sequences and record them but that would be impractical and I know there is a better way! I've heard of people using eFetch or other tools.

Could anyone please help me with recommendations on programs, tools, or instructions on how I can achieve my aim?

Thank you for helping me out! Much appreciated.

ADD COMMENTlink
4
Entering edit mode

eutils won't help much unless you search for keywords or accessions. How about sequence searches that give you best hits, with the option to directly give you the taxonomic classification (blastdb5) or LCA (diamond/mmseq2)

blast

diamond

mmseqs2

ADD REPLYlink

Login before adding your answer.

Similar Posts
Loading Similar Posts
Powered by the version 2.1