Statistics of problem-solution pairs

David Stork

unread,

Mar 1, 2023, 8:57:46 PM3/1/23

to

Are there any large-scale simulation studies of the statistics of symbolic problem-solution pairs?

Let's consider just symbolic integration of some class of functions, for instance the class covered by the Risch algorithm (exponentials, logarithms, trigonometric functions, addition, subtraction, multiplication, and division). Suppose we quantify the "size" of an integration problem by the number of leafs in its tree-based representation, and likewise for the problem's anti-derivative.

Problems of a given size may have solutions spanning a range of sizes, of course. Thus there is some statistical distribution of the sizes, and thus a statistical relation (perhaps correlation) between problem and solution sizes.

So if we double the size of an integration problem will the size of its solution double? or increase faster than linear? or slower than linear? What about the variance in sizes? Or other statistics?

Naturally there is an infinite number of integration problems of a given size, so any simulation study will have inherent uncertainties. Nevertheless, this seems like an interesting, and potentially fruitful, class of simulation problems, for which we must use large-scale computer-algebra systems.

Anyone interested in working on this?

--David G. Stork

Albert Rich

unread,

Mar 1, 2023, 10:12:47 PM3/1/23

to

From my experience implementing symbolic integrators, there is little to no correlation between the size of integrands and their optimal antiderivatives. Derivatives of relatively small expressions can be huge. Conversely antiderivatives of relatively small expressions can be huge.

Albert

David Stork

unread,

Mar 2, 2023, 1:04:19 AM3/2/23

to

Albert,

My experience over decades suggests instead that there IS (or at least MAY BE) interesting structure in the relation between problems and solutions, and a principled exploratory study might give unexpected results.

Here's one:

Integrate[((c + d Tan[e + f x])^{5/2}(a + b Tan[e + f x] + c Tan[e + f x]^2))/(a + b Tan[ e + f x])^{9/2},x]

has a solution that takes 36 Mbytes to write out!

--David Stork

Albert Rich

unread,

Mar 2, 2023, 3:35:31 AM3/2/23

to

Your example validates my point. The antiderivative of this relatively small integrand is huge. But the antiderivative of many other small integrands are small. Thus no correlation.

I recommend you use parentheses rather than curly brackets around fractional exponents, since curly brackets designate lists in Mathematica.

BTW, the leaf size of the valid antiderivative Rubi (the Rule-Based Integrator) returns for your example is only 789 leaves. So if you are planning to use antiderivative leaf size in your research, I suggest using the size of optimal antiderivatives like those Rubi usually delivers. Rubi is freely available for downloading at https://rulebasedintegration.org/

Albert

Richard Fateman

unread,

Apr 9, 2023, 4:30:32 PM4/9/23

to

While there are easy ways of sometimes bounding the size of the derivative of an expression, you rapidly
run out of rules for more general "problems/solutions". For instance, here is a problem.
expand ((x+1)^(2^n)).
characterize the number of terms as a function of the integer n.

Simulation of random expression problems could (in general) provide super-exponential growth in size.

Another study of "statistics" might be -- given a free on-line computer algebra system that
does some task (you can pick indefinite integration), what is the distribution of inputs
from [random?] clients?
I suspect you will get (a) homework problems and (b) people just trying it out to see if
it works. People can ask ChatGPT to do math, and it sometimes gets the right answer.
It sometimes doesn't.
A (much) earlier experiment we ran at Berkeley TILU collected some problems
(maybe a few hundred?) and mostly found that people were unlikely to master the
first stage of the problem: getting the syntax right. Thus we collected stuff like
sin x, sinx, sin(x), Sin(x), Sin[x], SinX.

As for whether this is interesting or not, I would not expect simulation -- where you write
a program P to generate problems -- to reveal much other than the behavior of program P.
RJF