reseq data curation question
1
0
Entering edit mode
8.2 years ago

I am working with virus data and I was wondering what the difference is between viral.1.protein.faa, viral.2.protein.faa and viral.nonredundant_protein.1.protein.faa? See ftp.ncbi.nlm.nih.gov/refseq/release/viral/ . If I am making a database to blast against all the viruses, should I use all three files or just viral.1.protein.faa + viral.2.protein.faa. Thanks.

refseq • 1.2k views
ADD COMMENT
0
Entering edit mode
8.2 years ago
GenoMax 141k

From NCBI RefSeq FAQ:

RefSeq transcript and protein records that are not yet annotated on the corresponding genome, and autonomous non-redundant proteins (WP_ accession prefix) that are not yet directly annotated on a genome.

You will have to decide.

ADD COMMENT

Login before adding your answer.

Traffic: 2555 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6