On Tue, Aug 28, 2012 at 4:10 AM, Mark Gibbins <xiy...@gmail.com> wrote:
> I couldn't determine from your message, but the alias file and database
> parts must be in the same path.
Right. But the FAQ[1] "can I use preformatted BLAST database"
contradicts this. So most likely Halocaridina's alias file is in a
separate directory, while the volumes in the database directory. And
probably that is what the issue is: SS ignores the volumes, and never
sees the alias file which is in a separate directory.
I think we should remove that FAQ entry. For one, the alternative
solution is confusing and less preferable since SS handles it out of
the box (thanks to Mark). Second, FAQ makes it sound like an edge
case while it is not. Third, let's not suggest users that modifying
alias files themselves is ok. It's not. It incurs additional
administrative burden. Yannick?
> The specifications of the multipart alias
> file from NCBI force this.
I don't think so. Alias files are plain text files with a pointer to
the volumes. If the pointers are absolute path, BLAST+ can find them
regardless of whether alias and volumes are in the same directory or
not. Yannick's solution (the FAQ) suggests the same.
[1]: http://www.sequenceserver.com/#faq
--
Anurag Priyam
Hi Mark and Anurag,Thanks for the quick reply. Answers to your questions are:1) The .nal file is in the same directory as the DBs created using "sequenceserver format-databases".2) Each DB has six files associated with it having extensions: .nhr .nin .nog .nsd .nsi .nsq3) For each DB in that directory, there is a symlink to the fasta file that was used to build the database and the symlink shares the same name as each DB. Thinking that having the symlinks and DBs sharing the same name in the same directory might be the problem, I moved the symlinks out of the directory, restarted Apache and revisited the SS page. Same "- ignoring it" message.4) The .nal file has the following structure:DBLIST /home/data_processed/db_4_blast/Abarenicola_pacifica /home/data_processed/db_4_blast/Alciopa_spp [absolute paths to the other 43 DBs, each separated by a space]5) I'm using the basenames of each DB (i.e.,no extensions) with an absolute path in the .nal file, which should be correct syntax as Anurag pointed out.
Hi Mark,Appreciate the effort. Unfortunately, error still persists using the alias that you provided. I actually tried the same thing yesterday (removing the absolute paths) and got the "-ignoring it" message.One interesting behavior I noticed yesterday, which started me down the path of trying to use an alias file and might shed some light on what's going on.The individual FASTA files that I'm building the DBs from have anywhere from 40K-200K entries. The DBs created from them using "sequenceserver format-databases" are picked up by SS and work fine from the web interface.Yesterday morning, I concatenated all 45 FASTA files into a single FASTA file (headers have taxon-specific tags, so easy to track) and ran "sequenceserver format-databases" against that ~4.5GB file to generate a "mega" DB. Once created in the SS DB directory where all of the other DBs are, I restarted Apache and revisited the SS page and "mega" DB wasn't listed while everything else was.
The error was exactly the same as with the alias file: "Found a multi-part database volume at /path/to/SS/db_directory - ignoring it"
--
You received this message because you are subscribed to the Google Groups "sequenceserver" group.
To post to this group, send email to sequenc...@googlegroups.com.
To unsubscribe from this group, send email to sequenceserve...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msg/sequenceserver/-/P0ZisuIEk-kJ.
For more options, visit https://groups.google.com/groups/opt_out.
Hi Mark,
Thanks again for the help. I eventually got everything worked out and the alias file that you sent is now working.I feel a little stupid since it came down to a simple renaming of the .nal file. Originally, I had just used the same .nal file name and pasted the alias information you provided into it. It dawned on me last night after going through this thread that might be the problem.Specifically, I changed the filename from "All_taxa_08_27_12.nal" to "All_taxa.nal" and the alias was picked up and added to the SS list of available DBs (I also check that it indeed worked by doing a BLAST). To verify that the old "All_taxa_08_27_12.nal" name was really the problem, I renamed the now working "All_taxa.nal" back to "All_taxa_08_27_12.nal", restarted SS and it was ignored/disappeared from the DB list. Renamed it again to "All_taxa.nal", restarted and again the alias works.Is this expected behavior?Cheers, thanks again and apologize for wasting your time,
To view this discussion on the web visit https://groups.google.com/d/msg/sequenceserver/-/BQMujU09hjUJ.
$ cat md5nr.pal
#
# Alias file created Sun Apr 1 16:21:26 2012
#
#
TITLE md5nr
#
DBLIST md5nr.00 md5nr.01 md5nr.02 md5nr.03 md5nr.04 md5nr.05 md5nr.06 md5nr.07
#
#GILIST
#
#OIDLIST
#
$ ls
build_env md5nr.00.psi md5nr.01.psi md5nr.02.psi md5nr.03.psi md5nr.04.psi md5nr.05.psi md5nr.06.psi md5nr.07.psi
md5nr md5nr.00.psq md5nr.01.psq md5nr.02.psq md5nr.03.psq md5nr.04.psq md5nr.05.psq md5nr.06.psq md5nr.07.psq
md5nr.00.phr md5nr.01.phr md5nr.02.phr md5nr.03.phr md5nr.04.phr md5nr.05.phr md5nr.06.phr md5nr.07.phr md5nr_blast.tar.gz
md5nr.00.pin md5nr.01.pin md5nr.02.pin md5nr.03.pin md5nr.04.pin md5nr.05.pin md5nr.06.pin md5nr.07.pin md5nr.pal
md5nr.00.psd md5nr.01.psd md5nr.02.psd md5nr.03.psd md5nr.04.psd md5nr.05.psd md5nr.06.psd md5nr.07.psd
[2013-08-28T16:36:46.204690 #6471] INFO -- : Found a multi-part database volume at /home/jfk/Software/MD5nr/md5nr.01 - ignoring it.
--
You received this message because you are subscribed to the Google Groups "sequenceserver" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sequenceserve...@googlegroups.com.
Visit this group at http://groups.google.com/group/sequenceserver.
To view this discussion on the web visit https://groups.google.com/d/msgid/sequenceserver/bb3387e7-8363-4de5-ac22-19f7662468c8%40googlegroups.com.
D, [2013-08-29T09:13:12.854989 #10647] DEBUG -- : method: blastp
D, [2013-08-29T09:13:12.855219 #10647] DEBUG -- : sequence: MTGTTGATAWR
D, [2013-08-29T09:13:12.855494 #10647] DEBUG -- : database: ["69d7ff233621b78e5ef844130befbae9"]
D, [2013-08-29T09:13:12.855640 #10647] DEBUG -- : advanced:
top - 09:15:03 up 1 day, 16:50, 3 users, load average: 0,77, 0,31, 0,15
Tasks: 144 total, 2 running, 142 sleeping, 0 stopped, 0 zombie
%Cpu(s): 20,9 us, 7,7 sy, 0,0 ni, 26,0 id, 45,1 wa, 0,0 hi, 0,3 si, 0,0 st
KiB Mem: 1017684 total, 950224 used, 67460 free, 31272 buffers
KiB Swap: 2074620 total, 370312 used, 1704308 free, 602548 cached
D, [2013-08-29T09:14:38.228354 #10707] DEBUG -- : sequence: MTGTTGATAWR
10742 jfk 20 0 2301m 308m 306m R 50,4 31,1 0:12.15 blastp
23 root 20 0 0 0 0 S 9,3 0,0 0:54.39 kswapd0
9096 root 20 0 0 0 0 S 0,3 0,0 0:01.67 kworker/0:2
10737 jfk 20 0 23300 1644 1136 R 0,3 0,2 0:00.12 top