Dear Anthony.
It seems I managed to solve the channel name in the annotation. But I'm still facing some issues:
Warning message:
In melt.data.table(mq_input, measure.vars = channels, id.vars = c("ProteinName", :
'measure.vars' [Reporterintensitycorrected1, Reporterintensitycorrected2, Reporterintensitycorrected3, Reporterintensitycorrected4, ...] are not all the same type. By hierarchy order, the resulting data value column will be of type 'double'. All variables that are not already of type 'double' will be coerced. Check the DETAILS in ?melt.data.table for more about coercion.
It is particularly strange, because in my annotation file I am using the "Reporter intensity" columns not "Reporter intensity Corrected", so probably this warning can be ignored.
Then the only real issu that I found out is that:
INFO [2025-01-23 18:02:45] ** 'Norm' information in Condition is required for normalization. Please check it. At this moment, normalization is not performed.
But in my annotation file (attached) I put the "Norm" in the 16th channel correctly. ALso I am sharing a better evidence.txt file.
My main objective with Ms Stats is to normalize the results using the reference channel, align by the meadians, and make imputation of missing values, then after that, use the data at protein level to third party applications.
This is my code:
library(MSstatsPTM)
library(readxl)
maxq_tmt_evidence <- read.table("data/evidence_subset.txt", sep = "\t", header = TRUE)
maxq_tmt_annotation <- read_excel("data/file.xlsx")
head(maxq_tmt_evidence)
head(maxq_tmt_annotation)
msstats_format_tmt = MaxQtoMSstatsPTMFormat(evidence=maxq_tmt_evidence,
annotation=maxq_tmt_annotation,
fasta=('data/Musmusculus_uniprotkb_AND_reviewed_true_AND_model_o_2024_06_25.fasta'),
fasta_protein_name="uniprot_ac",
mod_id="\\(Phospho \\(STY\\)\\)",
use_unmod_peptides=TRUE,
labeling_type = "TMT",
which_proteinid_ptm = "Proteins")
# Save the PTM data to a CSV file
#write.csv(msstats_format_tmt, file = "Data/msstats_format_tmt_PTM.csv", row.names = FALSE)
# Save the PROTEIN data to a CSV file
#write.csv(msstats_format_tmt$PROTEIN, file = "Data/msstats_format_tmt_PROTEIN.csv", row.names = FALSE)
head(msstats_format_tmt$PROTEIN, n = 40)
# Summarize data using dataSummarizationPTM_TMT
summary_data_tmt <- dataSummarizationPTM_TMT(
data = msstats_format_tmt,
method = "msstats",
global_norm = TRUE,
reference_norm = TRUE,
MBimpute = TRUE,
verbose = TRUE
)
Thank you in advance!
With the best regards,
Montoni.