Chris,
FWIW, we took a similar file format conversion path for Bruker data,
but unfortunately I don't have experience with the X!Tandem/TPP
issues.
The processing path we took was:
- Data Analysis export of XML file (centroided "line" MS2 spectra),
followed by perl script conversion to .dta (the XML format contains
some metadata like compound indexes for each scan that was useful for
downstream processing).
- The .dta files were submitted to Sequest and Mascot, giving the
expected search results.
For your #4, do you know if the .dta to mzXML converter you are using
expect a specific .dta file name format like <sample>.<first
scan>.<last scan>.<charge>.dta? If so, and your .dta file names are
different than this format, perhaps that is contributing to the
scrambled/incorrect scan numbers in the generated mzZML.
Andrew.