Only the "Custom mode" respects the custom data split settings. The others, like "Find global model" use heuristics to pick the training and validation sets automatically based on the total number of rows. I would recommend always configuring the custom mode, but for most people the automatic options should work well.
The percent of data used for history only affects searches that use the delay() or sma() building-blocks. The setting effectively controls the maximum delay allowed in the delay() or sma() functions. These require dedicating a portion of the data for history, otherwise they would reference rows before the first row. If you don't enable the delay() or sma() building-blocks, this setting is ignored and has no effect.
The software uses a time based seed for random numbers. The only fixed seed is for the data-splitting so that starting and stopping a search respects the previous split. We could make this configurable though, it's a good idea.
Michael