I'll reply on-list to give some closure to this thread for anyone interested in the problem.
I asked Daniel to send me one of his >2GB mzXML files to take a look at myself. If it wasn't a Windows 32-bit binary issue, I suspected that the problem wasn't with Tandem, Comet or the TPP tools (as I recall we dealt with large files years ago) but rather it was with the conversion program itself.
Running "tail t1.mzXML" returned:
<offset id="41225">3</offset>
<offset id="41226">3</offset>
<offset id="41227">3</offset>
<offset id="41228">3</offset>
<offset id="41229">3</offset>
<offset id="41230">3</offset>
</index>
<indexOffset>1</indexOffset>
<sha1>23609787a67e3997d93fc3e1bcde4015474eeae6</sha1>
</mzXML>
And the first thing that jumps out is the indexOffset value is completely wrong and this would cause all tools to not be able to read this file. A quick fix is to run the TPP's "indexmzXML" tool on this file to re-index the file which will also generate a correct index offset value:
indexmzXML t1.mzXML
After running this command and re-naming the generated "
t1.mzXML.new" file, I was able to read the mzXML file using both readmzXML and Comet. Anyways, something needs to be fixed with qtofpeakpicker to write correct >2GB mzXML files. Minimally it needs to be a 64-bit binary. A feasible but poor workaround is to simply run indexmzXML on each file.
- Jimmy