'Fixing' the spec is not the right way to approach this, and sub-request concepts are complicated and often flavored with implementation opinions (ie. this works easily in java and python, so why not include it).
You essentially want a custom method, which understands parsing/processing/ordering/processed-response-injection. This did come up in the old news group *ages* ago. There are too many variants/desired-expected-fallback-behaviors to settle on one implementation, and nothing about this fits with "It is designed to be simple!".
At the time, I think the suggestion to implement the composition-structure-processing method, build up your composition/logic structure, make a single call with that structure, have the server method be able to understand it and process it and return a single result. Moving this down into the base spec layer offers little additional advantages.
That said, I'd encourage you to implement it and see how the you find the results/experience, and come back here to share that and promote such a change if you still feel as strongly. I am not in favor it for the spec, but in the right situations the concept is interesting, but saving a network call is usually not the reason to do this.
Real world (ish) examples are appreciated, since sum is not really a compelling example of such a need. :)