Rosetta make_fragments.pl fails: "Cannot initialize readdb for nr database"
1
0
Entering edit mode
4.0 years ago
Alex_I • 0

I'm trying to run Rosetta to do a fold and dock of the C-terminal of SARS-CoV-2 spike protein.

When I run make_fragments.pl for the first time, it downloads a bunch of nr*.gz files, and then fails with the following error:

[fastacmd] ERROR: ERROR: Cannot initialize readdb for nr database

This is a new Rosetta install of the latest version.

Digging into this a bit, make_fragments.pl runs install_dependencies.pl which runs fastacmd -D 1, like this:

        my $cmd = "$Bin/blast/bin/fastacmd -D 1 > $datdir/nr";

In my case the data is in /mnt/data/rosetta/tools/fragment_tools/databases so I tried changing to that directory and running /mnt/data/rosetta/tools/fragment_tools/blast/bin/fastacmd -D 1 directly; same error.

The databases directory is rather large, 217GB. It contains nr.00.phd nr.00.phi nr.00.phr nr.00.pin nr.00.pog nr.00.ppd nr.00.ppi nr.00.psq nr.00.tar.gz.md5 and so on, for nr.00 through nr.38; also nr.pal nr.pdb nr.pos nr.pot nr.ptf nr.pto. It doesn't seem like there were any issues with downloading and unpacking the nr database.

I'm not sure why fastacmd produces an error or how to fix it. What is it looking for? Why is the thing it's looking for not there?

Environment: Rosetta 2020.08.61146 (rosetta_bin_linux_3.12_bundle.tgz), Ubuntu 18.04 on AWS instance

software-error nr rosetta • 2.7k views
ADD COMMENT
2
Entering edit mode
4.0 years ago
GenoMax 141k

nr database is now in v5 format by default. I am not sure if rosetta has been updated to account for that fact. You may need to download v4 format database from here, if rosetta needs the older format.

ADD COMMENT
0
Entering edit mode

Thanks! How recent of a change is this? My Rosetta seems to be from March 9, 2020.

It looks like the nr download is handled by ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/2.10.0/ncbi-blast-2.10.0+-src.tar.gz in c++/src/app/blast/update_blastdb.pl (it was expecting ncbi-blast-2.9 but couldn't find it so I had to bump the version) and fastacmd is from ftp://ftp.ncbi.nih.gov/blast/executables/legacy.NOTSUPPORTED/2.2.17/blast-2.2.17-x64-linux.tar.gz

ADD REPLY
1
Entering edit mode

This change was done on Feb 4th, 2020: New layout for NCBI BLAST FTP database site starting February 4, 2020

You may need to use blastdbcmd instead (sounds like that is retrieving fasta sequences).

ADD REPLY
0
Entering edit mode

@genomax - That was totally the right answer; I swapped v4 data in and everything else worked. Feel free to post that as an answer. Rosetta patch coming up :)

ADD REPLY
0
Entering edit mode

Thanks for the confirmation. I moved my comment to an answer. You can accept it (green check) to provide closure to this thread.

ADD REPLY
0
Entering edit mode

Hi, could you please describe how did you sapped to v4? I'm having the same problem.

Thanks,

ADD REPLY
0
Entering edit mode

Hi, could you please describe how did you sapped to v4? I'm having the same problem.

ADD REPLY
0
Entering edit mode

Hello Diego,

I did something like this: In a clean rosetta install, edit main/tools/fragment_tools/install_dependencies.pl to comment out these four lines

    # system("perl $datdir/update_blastdb.pl nr");
    # system("perl $datdir/update_blastdb.pl nr");
    # system("perl $datdir/update_blastdb.pl nr");
    # (system("perl $datdir/update_blastdb.pl nr") == 0) or do { &clean_nr_tgz; };

Run these commands to download v4 (if you don't have wget, install it first)

mkdir main/tools/fragment_tools/databases/
cd main/tools/fragment_tools/databases/
wget 'ftp://ftp.ncbi.nlm.nih.gov/blast/db/v4/nr*gz'

and NOW you can run install_dependencies.pl as usual

ADD REPLY
0
Entering edit mode

Thank you very much.

I'm going to try this.

Best regards

ADD REPLY
0
Entering edit mode

I ran into this issue today and I was wondering if this was ever patched in Rosetta, or should I still be falling back to the v4 format data?

I've been using the Linux release version "2021.16+release.8ee4f02"

ADD REPLY

Login before adding your answer.

Traffic: 2241 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6