Hi Samantha,
If you look at denovo_map.log file you will see all the individual ustacks commands. Instead of running denovo_map.pl, you can queue up the individual ustacks runs independently on your cluster. Once that is complete, you can run cstacks, sstacks, tsv2bam, gstacks, and populations. If you add the “--dry-run” flag to denovo_map.pl it will print out all the commands it would run (without running them). You may consider writing your sysadmins and asking them to override the maximum walltime. You may also consider down-sampling those samples with >40million reads as that will significantly reduce your run time as well.
Best,
julian
Hi Paige,
1) You are able to run ustacks independently on your samples. If you do so, these outputs should all be in a single, stacks output directory.
2) You can then run cstacks on all of these samples, or a subset (if you have a large number of samples you may choose to build your catalog from a representative subset, trying to get at least one copy of each locus/allele into the catalog), but it must be a single cstacks run.
3) You can run sstacks in a single run on all samples, or you can run it independently on each sample (either way you supply the catalog you generated in the previous step).
All of these outputs go into the same stacks output directory.
4) Finally, you have to run tsv2bam and gstacks, one time each on the whole data set.
(Populations can be run as many times as you like, once the core pipeline steps are complete, and you can vary the samples that are processed using different population maps.)
Hope this helps—
julian
Yes, you need to have unique IDs for your samples. If you ran denovo_map.pl with the --dry-run option, it would show you each ustacks command demonstrating the unique IDs.
You can either re-run ustacks with different IDs for each sample, or you can use some UNIX commands to search and replace the ID field in each of the output files. The former is simpler (if you don’t know UNIX well), but the latter is faster (particularly if you know UNIX).