I am currently working on TMT proteomics analysis.
The software does not (to my knowledge!) handle different isoforms. It does state that duplicate isoforms are added to together.
on this page of the FAQ.
My TMT data had the peculiarity of having EXACTLY the same value for many of the isoforms (e.g.
P17677-2 & P17677). I talked to the technical staff and I think the reason is that they match the datapoints against a libarary. By doing so it registers counts for any sequence/proteinID that match the identified datapoint. In my case this meant that the software added all of those togheter - in effect multiplying the number of counts by the number of isoforms.
Therefore only those isoforms showing unique sets of counts are real.
Splitting my dataset into two.
a) For each isoform, added together the unique full counts for isoforms - representing full counts for a particular protein - all isoforms.
b) I also did an "isoform" analaysis. Here I kept the unique isoform counts. Since the gene names then will be e.g. Slc1a3-2 - pathway analysis wont work, but DEG etc should (see above more accurate answer). This I only used for comparing the abundance of different isoforms between my samples.
Other things
1. My standard TMT analysis automatically excluded all proteins that didnt find counts in every sample. This was obviously a problem since it excluded a gene that I had KO in half the samples.
Therefore I recommend to use the "raw counts" as input to omicsPG. It will normalise within samples as Counts per million (for that sample) CPM.
2. Manually remove identified cRAP genes (Hs contamination) from your dataset.
3. As always make sure proteins that excel/sheets abbreviate to sep-2 or march-5 have their acutal names (Septin2 and marchf5).
Hope this is of help to someone :)
Friendly regards, Staffan