Hi Daniel:
Thank you for the interest.
There are few ways to solve the problem with clustering and classification of time-series when, as you pointing, the sequences (ordered sax words collections) or dictionaries (unordered word frequency tables) are of the different size - i.e. when obtained with the use of a sliding window and a numerosity reduction.
SaxMinDist is the function you would typically use for the numerosity reduction. I am not sure about its performance for raw classification/clustering.
One way would be to use SAX words frequency vectors - basically each of time series is converted into the numerical frequency vectors. You will have something like follows (there first column is all the words, right columns represent words occurrences in the series 1,2,...). Then you can use whatever distance you like to these vectors: Euclidean, Humming etc.
words | series1 | series 2| ...
aac | 0 | 5 |
bba | 2 | 3 |
....
ccc |5 | 0 |
Well, there is yet another way too.
The implementation of all these techniques is not very difficult, you could reuse a lot of my code for these three techniques, but it will require some time to get through it. Once you'll checkout the code and will be able to build it in eclipse, I can help with directions/issues.