Hi all,
Though there have already been some posts relating to salmon, I realized that it had not yet been officially announced on this group. Salmon is a new piece of software on which we've been working hard for a while now. In many ways, salmon is a successor to Sailfish. Though there are some significant methodological differences, the goal remains largely the same; provide rapid and accurate estimates of isoform abundance. With salmon, we're working on increasing both accuracy and speed, as well as providing more options that might be useful in different scenarios.
For example, salmon supports both alignment-free and alignment-based modes of operation. The interface for the alignment-free mode of operation is much like that of sailfish; you first build a salmon index, and then quantify abundance for different experiments by providing salmon directly with the reads. Salmon uses a fundamentally different type of index from Sailfish and thus, indices will have to be rebuilt. However, the index construction step is fairly efficient (it should actually be much faster than that of sailfish for very large transcriptomes).
The alignment-based mode of salmon allows one to use existing alignments, provided as a SAM or BAM file (CRAM files should also work but we haven't tested this yet), to quantify transcript abundance. This might be particularly useful in scenarios where alignments have already been computed, when they must be computed anyway for some other purpose, or when one has specific requirements / desires about alignment scoring and details.
All development for salmon is currently happening in the
develop branch of the Sailfish github repository. This means that if you clone / download from this branch and run the build process, your 'bin' directory will contain salmon as well as sailfish. We're also providing pre-built executables for salmon under the
"releases" portion of the repository (the most current release is v0.2.0, available
here). These are updated whenever significant improvements are made or features are added, but they are typically a little bit behind the "bleeding-edge" version that is available when building from source. Currently, the binaries are only being built for Linux, but we hope to provide prebuilt binaries for OSX in the future. Since, for the time being, there's not a separate source repository or user group, we welcome questions and discussion of salmon here and on the github repo. The documentation is available
here on
ReadTheDocs.
We're currently writing the manuscript describing the methods and algorithms behind salmon. As with sailfish, we plan to make a pre-print available. However, the purpose of developing salmon out in the open is for people to test and use it as early as possible and to give us feedback and suggestions, so please give it a try if you have a chance! More details are available
in the docs.
--Rob