How do I choose a reference genome to order the contigs once after being assembled? in my case, I have no information about clinical isolate of the microorganism. Therefore, based on what do I choose a reference for conting ordering using Mauve. My sample is "ERR209055" downloaded from EBI and task is "antimicrobial resistance gene identification".
In the above case, if I have a human sample with E.coli, how to choose the reference?
How do I do that?
Depends on what your goal is I think. But if it is only human and E.coli you could use human and E.coli. You could also only use E.coli if you are not interested in the human genes. I personally think that if the goal is to find antimicrobial resistane genes you dont have to worry to much about human genes. But you said you already had done a assembly.
For predicting genes there are many tools: https://en.wikipedia.org/wiki/List_of_gene_prediction_software
How do I choose raw data from ENA, to find anti-microbial resistance genes and multi-loci sequence typing. Can I select human isolate containing single microorganism or multiple microorganism species? I choose "SRR1060710" for both the above task. Am I doing right?
Just to test or to try to find those genes it does not matter if you choose data from a single microorganism or multiple microorganism (metagenome). It helps if you know upfront that the organism has shown antimicrobial resistance in other studies. Globally:
Most resistance proteins consist of multiple domains. You often find a gene that codes for one domain but for resistance multiple domains are needed.