Hi MSstats team,
I'm running into a problem part-way through the data summarization step of my phopsphoproteomics data.
The protein-level summarization appears to complete just fine for the phosphoproteomics data, and the first 4 replicates of the proteomics data, before crashing halfway through the fifth replicate.
It gives the following error message:
`Warning in dcast.data.table(LABEL + RUN ~ FEATURE, data = input, value.var = "newABUNDANCE", :
'fun.aggregate' is NULL, but found duplicate row/column combinations, so defaulting to length(). That is, the variables [LABEL, RUN, FEATURE] used in 'formula' do not uniquely identify rows in the input 'data'. In such cases, 'fun.aggregate' is used to derive a single representative value for each combination in the output data.table, for example by summing or averaging (fun.aggregate=sum or fun.aggregate=mean, respectively). Check the resulting table for values larger than 1 to see which combinations were not unique. See ?dcast.data.table for more details.
<simpleError in .Primitive("length")(newABUNDANCE, keep = TRUE): 2 arguments passed to 'length' which requires 1>
Warning in merge.data.table(input[, colnames(input) != "newABUNDANCE", with = FALSE], :
Input data.table 'y' has no columns.
Error in merge.data.table(input[, colnames(input) != "newABUNDANCE", with = FALSE], :
The following columns listed in `by` are missing from y: [FEATURE, RUN, cen]
`
These are the same PSM and annotation files I used for TMT proteomics using `MSstatsTMT`, with no problem.
I'm using R 4.4.2, MSstastsPTM 2.8.1, MSstats 4.14.1
I'll see if I can subset and anonymize the data to upload as an example.
Many thanks,
Sam