I've been trying to assign the taxonomy of some blastn hits from a locally downloaded nt database by referencing the names.dmp and nodes.dmp files included in taxdump. Everything went smoothly for almost all ~300,000 hits, except for six taxids which aren't present in the taxdump files.
These taxids do return a match when searching NCBI's online taxonomy browser (https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cg), but the match they return doesn't show that taxid on the page (ie when searching taxid 1796529, it returns a beetle subfamily with a taxid of 2558035). Four of the taxids even return the same subfamily (each search returns a page with the same taxonomy, but a slightly different taxid which is also perplexing). Here are the six:
1859523
1796529, 1796531, 1796534, 1796527
1796546
Not sure if it's some kind of aliasing or these taxids are deprecated, but does anyone know what's going? Any help is much appreciated!
Thanks for looking that up! Any idea why those taxids specifically are aliased?
I also found that these aliases are present in the merged.dmp file included in taxdump so that should make automation fairly simple!