Hello experts,
I am having issues with X! Tandem in combination with PeptideProphet and iProphet and I was wondering if someone could help, please.
As a background, we (our lab) have been using X!Tandem on its own using its expectation value cut-off for the past years, which seemed to work fine with our Sciex Triple-TOF instrument. More recently, we have switched to using Orbitrap instruments which perform a lot better and I have realized that Comet, whether alone or combined with other TPP peptide validation tools (PeptideProphet/iProphet), gives us great coverage. But I have also noticed that X!Tandem results suffer a huge loss when processed with PeptideProphet/iProphet, and I am wondering if I am doing something wrong. Here is an example:
First, I understand that Tandem must be set up to output all search results for the Prophets to work. So I make sure to set up the
output, results parameter to "all", which should override the max valid e-value:
This results in Tandem reporting ~40k valid models:
Running the Prophets, it seems that PeptideProphet fails to properly capture and fit the two peaks of false and valid model distributions, especially for charge +2 and +3 ions (btw, is there a way to extend the graphs to fval > 10?). Maybe because of this, I get only ~17k PSMs at 1% error rate:
On the other hand, if I analyze the same data using Comet, I get well-separated score dists and over 47k validated PSMs:
Some more surprising observations:
- When I set "output, results" to "valid", X!Tandem still reports about the same number of valid models (39935). This indicates that X!Tandem does not actually underperform by 64% (17k vs 47k), but rather only 15% (40k vs 47k).
- If I set "output, results" to "all" and change "output, maximum valid expectation value" to "1000.0", X!Tandem reports ~104k valid models. This is confusing because it suggests that the "all" option does not override the max e-value cut-off. Either way, the PeptideProphet output (score distributions and num correct) still looks the same with these settings.
I am concerned that I am not setting up the Tandem workflow correctly, or that the tools are not actually working right. I would appreciate any feedback.
Here is a link to the analysis input files:
If I have forgotten something or if you want the full analysis files, please let me know.
Thank you,
Farshad