Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
The problem with running xinteract on OMSSA pepxml
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  18 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Ping  
View profile  
 More options Jun 30, 1:29 pm
From: Ping <yanpp...@gmail.com>
Date: Tue, 30 Jun 2009 10:29:02 -0700 (PDT)
Local: Tues, Jun 30 2009 1:29 pm
Subject: The problem with running xinteract on OMSSA pepxml
Hi,

I am trying to run the xinteract on the omssa pep.xml output files. my
omssa's version is  2.1.4, my TPP version is 4.2.1. But I couldn't get
it through. I search the old post, there is a similar post, but the
problem was solved by specifying enzyme to xinteract.

I tried it, but is still not working. InteractParse went through, but
PeptideProphetParser got stuck by a segmentation fault.

Any help would be greatly appreciated!

Many Thanks,

Ping

***** output for interactParser and PeptideProphetParser

InteractParser 'interact.pep.xml' 'omssa.pep.xml' -L'7' -E'trypsin' -C
-P
 file 1: ParoSaliv_SHAM_03.pep.xml
 processed altogether 2623 results

PeptideProphetParser 'interact.pep.xml' DECOY=DECOY MINPROB=0
NONPARAM
Using Decoy Label "DECOY".
Using non-parametric distributions
 (OMSSA) (minprob 0)
WARNING!! The discriminant function for OMSSA is not yet complete.  It
is presented here to help facilitate trial and discussion.  Reliance
on this code for publishable scientific results is not recommended.
init with OMSSA Trypsin
MS Instrument info: Manufacturer: UNKNOWN, Model: UNKNOWN, Ionization:
UNKNOWN, Analyzer: UNKNOWN, Detector: UNKNOWN

 PeptideProphet  (TPP v4.2 JETSTREAM rev 1, Build 200905131510
(linux)) AKeller@ISB
 read in 75 1+, 1790 2+, 749 3+, 0 4+, 0 5+, 0 6+, and 0 7+ spectra.
Initialising statistical models ...
WARNING: No decoys with label DECOY were found in this dataset.
reverting to fully unsupervised method.
Iterations: .........10.........20
Segmentation fault


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Zhi Sun  
View profile  
 More options Jun 30, 2:08 pm
From: "Zhi Sun" <z...@systemsbiology.org>
Date: Tue, 30 Jun 2009 11:08:56 -0700
Local: Tues, Jun 30 2009 2:08 pm
Subject: RE: [spctools-discuss] The problem with running xinteract on OMSSA pepxml
Hi all,

I just checked in two programs, and did not put the log message.

I will give a short description here:

createChargeFile.pl: this program reads the ms2 file generated by CPM
(charge prediction program), or directory which contains a list of dta file,
and creates a .charge file. The .charge file has two columns:
<spectrum_id>\t<charges>.

mergeCharges.pl: this program reads the .charges file produced by above
program and the original mzXML file, creates a new mzXML file from the old
one using the charge info in .charge file, and recreates the mzXML index.

These programs are used to process the ETD data.

Thanks,
Zhi


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Zhi Sun  
View profile  
 More options Jun 30, 2:08 pm
From: "Zhi Sun" <z...@systemsbiology.org>
Date: Tue, 30 Jun 2009 11:08:56 -0700
Local: Tues, Jun 30 2009 2:08 pm
Subject: RE: [spctools-discuss] The problem with running xinteract on OMSSA pepxml
Hi all,

I just checked in two programs, and did not put the log message.

I will give a short description here:

createChargeFile.pl: this program reads the ms2 file generated by CPM
(charge prediction program), or directory which contains a list of dta file,
and creates a .charge file. The .charge file has two columns:
<spectrum_id>\t<charges>.

mergeCharges.pl: this program reads the .charges file produced by above
program and the original mzXML file, creates a new mzXML file from the old
one using the charge info in .charge file, and recreates the mzXML index.

These programs are used to process the ETD data.

Thanks,
Zhi


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jun 30, 6:23 pm
From: Ping <yanpp...@gmail.com>
Date: Tue, 30 Jun 2009 15:23:03 -0700 (PDT)
Local: Tues, Jun 30 2009 6:23 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Is this something else and suppose to be a new post?

YP

On Jun 30, 11:08 am, "Zhi Sun" <z...@systemsbiology.org> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 2, 1:38 pm
From: Ping <yanpp...@gmail.com>
Date: Thu, 2 Jul 2009 10:38:44 -0700 (PDT)
Local: Thurs, Jul 2 2009 1:38 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Does any one have suggestion on this problem? I used the omssacl -op
option to generate the pepxml result. Will this cause problems?

Thanks,

Ping Yan


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
David Shteynberg  
View profile  
 More options Jul 2, 2:20 pm
From: David Shteynberg <dshteynb...@systemsbiology.org>
Date: Thu, 2 Jul 2009 11:20:28 -0700
Local: Thurs, Jul 2 2009 2:20 pm
Subject: Re: [spctools-discuss] Re: The problem with running xinteract on OMSSA pepxml
Hi Ping,

The problem is this message:
WARNING: No decoys with label DECOY were found in this dataset.
reverting to fully unsupervised method.
Iterations: .........10.........20

The OMSSA modelling relies on having a nonparameteric model which is
built from decoys IDs from the database.  Are you searching your data
against a database that contains some decoys?  Are you specifying the
correct decoy_tag to TPP?  The decoy_tag is a unique string that
identifies all your decoy sequences in the database,  all decoy names
must begin with this string.

-David


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jimmy Eng  
View profile  
 More options Jul 2, 7:44 pm
From: Jimmy Eng <j...@systemsbiology.org>
Date: Thu, 02 Jul 2009 16:44:48 -0700
Local: Thurs, Jul 2 2009 7:44 pm
Subject: Re: [spctools-discuss] The problem with running xinteract on OMSSA pepxml
Ping,

I just downloaded OMSSA 2.1.4 and tried the direct pep.xml export
myself.  I do see a problem with the resulting pep.xml file that the
"-op" option generates that's causing the problem you're seeing.

The key error message in your output is this:
WARNING: No decoys with label DECOY were found in this dataset.

Looking at the generated pep.xml files, OMSSA seems to be placing some
number in the protein="" attribute of each search_hit element.  Whereas
PeptideProphet expects this protein attribute to contain some protein
identifier that includes the DECOY string for those decoy matches.  In
the converters we use, the value of the protein attribute is the first
word of the protein definition line.

As for a fix, we need someone at NCBI to address this and hopefully
someone here will contact them about this.  For you in the short term,
you're going to need a developer to modify you pep.xml files to replace
the value in the "protein" attribute with the first word from the
"protein_descr" attribute of each search_hit entry.

- Jimmy


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
David Shteynberg  
View profile  
 More options Jul 2, 10:15 pm
From: David Shteynberg <dshteynb...@systemsbiology.org>
Date: Thu, 2 Jul 2009 19:15:34 -0700
Local: Thurs, Jul 2 2009 10:15 pm
Subject: Re: [spctools-discuss] The problem with running xinteract on OMSSA pepxml
If you are running the tpp on the commandline you can correct this
problem in the omssa output by running the steps of xinteract
separately and enabling InteractParser option -P.

On 7/2/09, Jimmy Eng <j...@systemsbiology.org> wrote:

--
Sent from my mobile device

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 3, 1:08 am
From: Ping <yanpp...@gmail.com>
Date: Thu, 2 Jul 2009 22:08:01 -0700 (PDT)
Local: Fri, Jul 3 2009 1:08 am
Subject: Re: The problem with running xinteract on OMSSA pepxml
Jimmy,

Thanks a lot for your explanation. I am going to write a script to fix
the pep.xml file and see how it goes.

Ping

On Jul 2, 4:44 pm, Jimmy Eng <j...@systemsbiology.org> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 3, 1:12 am
From: Ping <yanpp...@gmail.com>
Date: Thu, 2 Jul 2009 22:12:11 -0700 (PDT)
Local: Fri, Jul 3 2009 1:12 am
Subject: Re: The problem with running xinteract on OMSSA pepxml
David,

Thanks a lot for your response. I generated the decoy database and
rerun
the OMSSA earlier.

I run the command line manually. And as you suggested, I used option -
p for
InteractParser, but when I run PeptideProphetParse, got this error:

ERROR: NAN probability density detected.  Please alert the
developer !!!

I will try Jimmy's suggestion and see how it goes.

Thanks again!

Ping

On Jul 2, 7:15 pm, David Shteynberg <dshteynb...@systemsbiology.org>
wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jake W  
View profile  
 More options Jul 3, 6:46 am
From: Jake W <jrwba...@gmail.com>
Date: Fri, 3 Jul 2009 03:46:25 -0700 (PDT)
Local: Fri, Jul 3 2009 6:46 am
Subject: Re: The problem with running xinteract on OMSSA pepxml
One solution to this is to use the indexing option ("-o T") in
formatdb when formatting your BLAST database for OMSSA searching.  In
my experience, that will result in OMSSA writing the first word of the
protein definition line to the "protein" attribute in the pepXML.
Without the -o option, OMSSA just puts a number in the "protein="
attribute indicating that the hit came from the nth entry in the
database.

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 3, 5:16 pm
From: Ping <yanpp...@gmail.com>
Date: Fri, 3 Jul 2009 14:16:25 -0700 (PDT)
Local: Fri, Jul 3 2009 5:16 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Hi Jake,

Thanks for your response. I tried -o T option with formatdb, and rerun
the omssacl, but got this error message.

"omssacl.cpp", line 279: Fatal: Exception in COMSSA::Run: NCBI C++
Exception:
    "/net/napc02/vol/pubchem/Users/lewisg/checkin/cxx/src/serial/
serialobject.cpp", line 228: Error: NCBI-BlastDL::Blast-def-line.title

Ping

On Jul 3, 3:46 am, Jake W <jrwba...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 6, 3:31 pm
From: Ping <yanpp...@gmail.com>
Date: Mon, 6 Jul 2009 12:31:27 -0700 (PDT)
Local: Mon, Jul 6 2009 3:31 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Jimmy,

As you suggested, I replace the value in the "protein" with the first
word from the "protein_descr" in each search_hit entry. But when I run
PeptideProphetParser, it still gives the error message:

ERROR: NAN probability density detected.  Please alert the
developer !!!

Is there anything else I need to fix?

Thanks!

Ping

Here is the example of the result after I replace protein/
protein_descr

for not decoy string, it looks like this:

<search_hit hit_rank="1" peptide="YPSRPLPPPPPFGLGFVPPPPPPYGPGR"
peptide_prev_aa="R" peptide_next_aa="I" protein="gi|73975149|ref|
XP_862242.1|" num_tot_proteins="3" num_matched_ions="15"
tot_num_ions="54" calc_neutral_pep_mass="2949.569"
massdiff="0.59985990298992" is_rejected="0" protein_descr="gi|
73975149|
ref|XP_862242.1| PREDICTED: hypothetical protein XP_857149 isoform 2
[Canis familiaris]" num_tol_term="2" num_missed_cleavages="0">
<alternative_protein protein="gi|73975151|ref|XP_862273.1|"
protein_descr="gi|73975151|ref|XP_862273.1| PREDICTED: hypothetical
protein XP_857180 isoform 3 [Canis familiaris]"/>
<alternative_protein protein="gi|73975153|ref|XP_862298.1|"
protein_descr="gi|73975153|ref|XP_862298.1| PREDICTED: hypothetical
protein XP_857205 isoform 4 [Canis familiaris]"/>
<search_score name="pvalue" value="0.000002382663488"/>
<search_score name="expect" value="0.028320338214958"/>
</search_hit>

For decoy string, it looks like this:

<search_hit hit_rank="2" peptide="QESARYSAKVTVAGLEESATEAQQQIR"
peptide_prev_aa="K" peptide_next_aa="S" protein="decoy_2098
2" num_tot_proteins="1" num_matched_ions="17" tot_num_ions="52"
calc_neutral_pep_mass="2949.481" massdiff="0.9728599071700
05" is_rejected="0" protein_descr="decoy_20982">
<search_score name="pvalue" value="0.000014221885248"/>
<search_score name="expect" value="0.168173793062284"/>
</search_hit>

On Jul 2, 4:44 pm, Jimmy Eng <j...@systemsbiology.org> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jimmy Eng  
View profile  
 More options Jul 6, 4:55 pm
From: Jimmy Eng <j...@systemsbiology.org>
Date: Mon, 06 Jul 2009 13:55:06 -0700
Local: Mon, Jul 6 2009 4:55 pm
Subject: Re: [spctools-discuss] Re: The problem with running xinteract on OMSSA pepxml
Has anyone been able to feed OMSSA native pepXML output (using -op
option) through PeptideProphet?  I don't know if anyone has tested this
or if OMSSA has only been tested using a separate converter.

Replacing the protein word was to address the error message having no
DECOY entries.  This fix hopefully addressed that one point which is
seems to have done.  So to address the follow-up problem, minimally
you'll want to include the diagnostic output from PeptideProphetParser
in case anything obvious shows up from that info.  Possibly you'll need
to have a developer, who knows more than I do, take a closer look at
your data.

- Jimmy


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 9, 1:21 pm
From: Ping <yanpp...@gmail.com>
Date: Thu, 9 Jul 2009 10:21:40 -0700 (PDT)
Local: Thurs, Jul 9 2009 1:21 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Jimmy,

Thanks for your information.

I debug the TPP source code. There is a small bug inside the
NonParametricDistribution.cxx which causes the NAN error in the
previous tries.  I fix it and the PeptideProphetParser is working fine
now.

Also after enabling InteractParser option -P as Davis suggested, I do
not need to replace the value in the "protein" with the first word
from the "protein_descr" in each search_hit entry.

Thanks again,

Ping

On Jul 6, 1:55 pm, Jimmy Eng <j...@systemsbiology.org> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
GATTACA  
View profile  
 More options Jul 14, 11:46 am
From: GATTACA <dfer...@umich.edu>
Date: Tue, 14 Jul 2009 08:46:08 -0700 (PDT)
Local: Tues, Jul 14 2009 11:46 am
Subject: Re: The problem with running xinteract on OMSSA pepxml
Ping,

Could you post what changes you made to the
NonParametricDistribution.cxx to fix the NAN error?

Thanks.

On Jul 9, 1:21 pm, Ping <yanpp...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ping  
View profile  
 More options Jul 14, 12:28 pm
From: Ping <yanpp...@gmail.com>
Date: Tue, 14 Jul 2009 09:28:40 -0700 (PDT)
Local: Tues, Jul 14 2009 12:28 pm
Subject: Re: The problem with running xinteract on OMSSA pepxml
Sure. I am not completely sure that my fix is correct.

Under two functions:

NonParametricDistribution::densityFit
NonParametricDistribution::varBWdensityFit

I changed :

(*d)[i] = (*d)[i] / k->size();

into:

if ( k->size() == 0)
  (*d)[i] = 0;
else
  (*d)[i] = (*d)[i] / k->size();

That is where I got the NAN error at the first point.

Thanks,

Ping

On Jul 14, 8:46 am, GATTACA <dfer...@umich.edu> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
David Shteynberg  
View profile  
 More options Jul 14, 12:47 pm
From: David Shteynberg <dshteynb...@systemsbiology.org>
Date: Tue, 14 Jul 2009 09:47:52 -0700
Local: Tues, Jul 14 2009 12:47 pm
Subject: Re: [spctools-discuss] Re: The problem with running xinteract on OMSSA pepxml
Thanks for the fix.  I already checked one in like it into SVN a few
weeks back.  The upcoming release will avoid this problem.

-David


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google