splitting large files (via msconvert?)

102 views
Skip to first unread message

Gautam Saxena

unread,
Jan 8, 2013, 4:26:14 AM1/8/13
to spctools...@googlegroups.com
I have some large MS raw files (eg 500 GB or greater). I'd like to ideally split it into "n" mzXML files of roughly the same size so that each X!Tandem process takes roughly the same amount of time. Is there a way to accomplish this elegantly using msconvert or some other technique? (For now, the MS files are Thermo raw files; however, they may be other MS vendor formats that we can convert to mzXML using msconvert or we may have the mzXML file itself as a starting point.)

Gautam Saxena

unread,
Jan 8, 2013, 4:56:01 AM1/8/13
to spctools...@googlegroups.com
Bonus: Similarily, is there a way to break it up in "n" file sizes so that very roughly they correspond to certain sizes (eg 100 GB) each? Thus, if one MS file was say ~220 GB, it would be split into 2 files of roughtly ~100GB; but if another MS files were ~540 GB, it would be split into 5 files of roughly ~100GB. 

Gautam Saxena

unread,
Jan 8, 2013, 4:59:29 AM1/8/13
to spctools...@googlegroups.com
FYI: I meant to write "MB" wherever I wrote "GB".

Brian Pratt

unread,
Jan 8, 2013, 11:46:17 AM1/8/13
to spctools...@googlegroups.com
Not sure how helpful an answer this is, but (as you perhaps already know) there are several parallelized versions of X!Tandem that do that for you.

Brian

--
You received this message because you are subscribed to the Google Groups "spctools-discuss" group.
To view this discussion on the web visit https://groups.google.com/d/msg/spctools-discuss/-/Iox2AXtPMaoJ.

To post to this group, send email to spctools...@googlegroups.com.
To unsubscribe from this group, send email to spctools-discu...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/spctools-discuss?hl=en.

Eric Deutsch

unread,
Jan 8, 2013, 11:57:40 AM1/8/13
to spctools...@googlegroups.com, Eric Deutsch

There’s also a recent paper that describes such a strategy:

 

http://pubs.acs.org/doi/abs/10.1021/pr300561q

Magnus....@gmail.com

unread,
Feb 1, 2013, 9:47:23 AM2/1/13
to spctools...@googlegroups.com, Eric Deutsch
Yes, and the mzXML decomposer is available here: http://www.ms-utils.org/decomposition/. Please let us know if you encounter some problems running it.

~Magnus
Reply all
Reply to author
Forward
0 new messages