I have metatranscriptomic data for several communities. When I use HISAT2 to align the reads from some of the pairings to a single genome I typically get near 50% of the reads aligning. There are a couple cases where near 100% of the reads align to a single genome. In these cases I am assuming that the genomes are very similar. Unfortunately I only have the whole genome sequence for one of the strains.
Is there a way to limit the amount of reads aligning to the wrong genome in HISAT2? My instinct is to increase the stringency of the alignment using the --score-min option. However I would like to hear from the community in case there is a more commonly used solution.
You have noted the main problem yourself. There are no magic bullets/programs here especially since one is looking at relatively small snippets of sequence that could be very similar.
There is BBSplit from BBMap suite which can allow fine grained control over how reads are binned/classified that you can try but at the end of the day limitations imposed by technology are going to prevail.