Entering edit mode
8.5 years ago
ilbiotecnologo
•
0
Hi. I'm an absolute beginner to programming and work with linux terminal with bioinformatic file
I have a multifasta (database) file (Trascritti.fa
) with thousand of transcripts in fasta format.
Like this:
>VIT_201s0011g03530.1
AATTAAGCATAAATACTCACTCTTACCCCCTTATTTTCTTATCTCTCATCACTTTTGGTGCGAAGAATTG
GACCATGAGAACAAGCTGCAATGGGTGTAGGGTTCTTCGCAAGGCATGCAGCCAAGACTGCATCATCAAA
CCTTGCCTTGAGTGGATCAAAAATCCTGATTTCCAAGCCAATGCTACTCTCTTCCTTGCCAAATTCTATG
>VIT_201s0011g03540.1
CAGGTAGCGTGAAGTTAAACCCTAGCGCTTTAGACAAACAGCTGTAGTCACCGCCCACAAACACCCCCAC
AGCCTCTGAGACACCACCTCAAACCTTTCCACTTAAATACACATCCCTCACACCCTTTTCAATTCCGTAC
TATAAATCTCTCTGCAACAACGGCAGCAACGCCCTACAGCCATGAGGATGAGCTGCAATGGCTGTAGGGT
>VIT_201s0011g03550.1
CATGCAAAGCTGAACGCGATGCTGTGATTGGTGGTAAGTGGTAGTTGAGTAAATTTGACAGTGAAGGAAG
GCCGAAATGGTAAAAGACTAAGGCTAGAAGTAGAATACCACTGTTCTTCTCATCACGTGGGCCCATGAAA
TACTGCATGACCCATGAGGCTCCCTCTCCTGCTCACTCTCTCTATCATTCGCTCTCGCCCAAAATAGCCT
>VIT_201s0011g03560.1
TTCGCCTTCTCTTTCTCTCTGAAACCCTCTCTTTCTCTCTCTAGACCAAGAGATGGGAGAAGGAAAGGGT
TCCACTCTAGTCCATCTAGTTGTGGTGGTTCTGAGTCTCGTCGCCTTTGGCTTTGCCGTTGCTGCTGAGC
GCCGCAGAAGCGTCGGTACAATAGTTACAGATGATCGAAATGCTACCTACTGTGTTTACAACTCTGATGT
>VIT_201s0011g03570.1
ATGGTGAAAGTTCCCAAGTCGAGAGATGGAGAGAATCACGTTAAAGTGCACAAGTATGGGGTGGGGAAGA
CGAAGAAGAGAGTGAAGGAGGAGGTGGGGAAGGTGGAAGACGAGAAGAGGGATGACAGAGAATCCATTGC
>VIT_201s0011g03580.2
TGATGCGATATTTATCAGATTTTTATTTATTTTATTTTATTAATACTTTGTTAAGGAGTTGGTGCCAAAA
CCGATGAGACTTCTCGCGGGACGCACCGCCTGTGTGAGGGAGTAAAAAAAATAATTAAAAATAAATAAAG
And an other ID.txt
file that have only a list of id like this
VIT_201s0011g03540
VIT_201s0011g03550
VIT_201s0011g03560
The two file are in the same directory
I need a simple way to extract the ID with sequence from the multifasta (database)
Your contribution will be appreciate with the best regards