Hello!
I need to create a data base or data repository to store like 110 GB of information related to assemblies, genome sequences with a lot of different format files (mdm5, .txt, .fastqc, xls) from an insect. My questions are: which are the best platforms to introduce these files and making accessible to all members of the group and if is it better normalize all the formats to .csv for example to have all the information in an unique format or not.
Thanks!
It depends, and I am afraid there is no single, simple answer. Are the members of the group biologists and / or wet-lab researchers with no bioinformatics skills, or are they bioinformaticians or wet-lab researchers with bioinformatics training? Will they be doing large-scale usage of the data, or the data will be used to support their more focused wet-lab research? Everybody will have access to the whole data, or there will be restrictions on what each people can use and access?
This is a nebulous question in its current form. One could imagine a simple web page hosting all of this information (hopefully organized into folders etc), a SharePoint repository or a proper LIMS (there are a few options). So the spectrum is rather wide.
Can you be more specific? Do you envision having
roles
of different types for different users or it would be afree for all
?