The average is not really what is wanted here. And, in fact, I did not get the same results as you. For (Split, Accuracy), and rounding accuracies to the nearest percentage, I got
(10%, 89%)
(20%, 90%)
(40%, 92%)
(60%, 94%)
(80%, 97%)
(90%, 97%)
It looks like -- as one would expect -- the more training data, the better the accuracy, up to an asymptotic accuracy of around 97%. But as the amount of training data increases, the amount of test data decreases, and -- as Activity 2.2 Questions 2 and 3 imply (and as one would expect) -- if there is not much testing data then the test is unreliable.
The question asks for the "true accuracy". Of the choices given in the question -- 50%, 90%, 95% and 100% -- I would choose 95%, because it's the closest to 97%. In fact, the file segment-test.arff contains a large number (810) of test instances, independent of the training set, and if you specify this under "Supplied test set" Weka uses the whole of the training set for training, and returns an accuracy of 96%.
Hope this helps
cheers
ian