There are some interesting issues that arise in this problem.
1. Taking a step back from the actual query (how to devise a
formula for use in a Mexican context) I wonder why one needs a
specifically Mexican formula for what is essentially a physical
relationship between physical measurements. True the
birthweight measurements themselves might all average out to be smaller
(say) than US averages, but conditional upon country, why should
the physical relationship {ultrasound --> birthweight | Mexico} have a
different formula than {ultrasound --> birthweight | US }
or {ultrasound --> birthweight | US } say, apart from the
intercept? Does the type of ultrasound measurements taken
differ from country? Is there some sort of genetic hypothesis that
Mexicans have (say) longer femurs for the same birthweight than US or UK?
2. Following on from 1, if there are formulas that
others have suggested, and perhaps they devised those by stepwise
regression, then your data provide a great opportunity to test
those formulas. Their formulas provide a genuine prior hypothesis for
you, and the p-values you get from testing the variables included in
their models should be (waving hands) a lot more valid than the p-values
from stepwise regression. It is at least as much a
contribution to science to test someone else's model for validity as it
is to come up with yet another of many suggested regression formulae
based on stepwise. (In fact there would be nothing to stop
you publishing both an evaluation and your own "best
stepwise" formula in the sense that it is a data summary that lets
your data speak for itself in the same way as other datasets have.)
So a.) one can compare others' overall models on your
data to see which does best. Use Mean squared error of
prediction. (perhaps standardise all variable first to avoid
needing an intercept)
b.) For variable selection, I would start by looking through
the ultrasound literature for the most commonly used and most significant
variable, test that in your data, then after including it (if
significant) check whether the next most common significant variable is
needed, etc., in sequence.
(Wiser heads than mine might suggest a reference for a better methodology
for assessing several competing models. )
3. To get a better understanding of your data I would suggest
doing principal components first, save the scores, then regressing those
scores on the birthweight. The first PC will probably be a measure
of overall size of the fetus, and have all numbers in the first column of
the component matrix (eigenvector coefficients of the transformation)
with similar values, indicating that all measurements tend to be
big together or all small together, and this will be
very significantly related to the birthweight. If you get any other
significant scores, then they will tell you whether particular
measurements (or combinations or ratios of measurements) are
related to the response in the sense that they tweak the first overall
size effect of the first PC. To understand any other
significant scores, look at the numbers in the corresponding column of
the component matrix. Numbers that are big (>0.3, say) and of
the same sign indicate important variables in the direction of increasing
or decreasing birthweight; number that are big and of opposite sign
may represent ratios that are of interest (e.g. bigger skull to femur
ratio may be important - total guess here since I know next to
nothing about ultrasound measurements. )
Hope something here is useful,
regards, Barry