Hi all friends,
I used TRAPID for GO annotation of a plant de novo assembled transcriptome, the output is a large text file (about 16 MB) like below:
ab29703 GO:0019219
ab29703 GO:0009059
ab29707 GO:0044446
ab29707 GO:0006810
ab29707 GO:0044424
Contig6742 GO:0044260
Contig6742 GO:0003824
Contig6742 GO:0016772
I plan to use WEGO (enter link description here) for categorizing and plotting the GO annotation results, but the input of WEGO is something like:
ab29703 GO:0019219 GO:0009059
ab29707 GO:0044446 GO:0006810 GO:0044424
Contig6742 GO:0044260 GO:0003824 GO:0016772
Could you please help me out how I can change the original output format to accept by WEGO? Any comments and suggestion for using other tools for plant GO annotation and plotting would be highly appreciated.
Thanks for your reply. However, my question differed with yours; actually, I would like to have all GO term from one gene in every row of a text file. Please let me know if you have any suggestion.
Best
Yes, but the principle is the same right? Switch the genes with the go terms?
Sorry, I am not much familiar with R. Could you please put the appropriate command for using "dplyr" to get GO term from one gene in every row of a text file?
Thanks for your help,
Assuming you have a tab-del txt file named "exampl.txt"
Thank you very much b.nota. I try it.
I tried your suggested command, it worked well.
again thanks