Hiya.
The worst thing to do is post something that's uninformative
(technical term :), but I wanted to tack on my own question to this
community.
David's approach, of immigrating to the schools where the authors of
your favorite papers and books are, is the way to go. There are plenty
of such folks that are made famous by
videolectures.net and iTunes U
that I'm aware of: Michael Jordan (lot of nonparametric Bayesian
statistics, Berkeley), Larry Carin (ditto, lots of applied govt work,
Duke), David Blei (ditto, inventor of LDA for document analysis, now
at Princeton), Alan Willsky and John Fisher (statistical signal
processing and statistical inference, MIT), as well as numerous others
who have made major contributions in their own practical fields
(radar, finance, geology, etc.).
But straight grad school is a very risky proposition, and as you say,
a certificate would be much more attractive to a hacker practitioner
with domain knowledge than taking real analysis classes while finding
a chunk of knowledge for your thesis/dissertation. We sometimes forget
that the primary task of professors who take on graduate students is
to create the next generation of professors for the academy, so
seeking a PhD usually increases the odds of getting into any of these
world-renowned programs.
Stanford is making a great entrepreneurial gamble by offering
certifications, and the fact that David is considering it is very
encouraging (I'm getting a PhD in statistical signal processing, and I
don't know how straight data mining works). I'd be curious to know if
anyone else is offering such programs, or if there are grad programs
that focus on practical applications of statistical learning (since
the standard formula for a stats degree in most schools is lots of
classic math and stats, with a couple of applied/programming courses
in R or Matlab---forget Hadoop/Mahout at these).
Which brings me to my question. How does someone from the "traditional
research" section (math & domain expertise), with a good handle on
basic programming of large, complicated algorithms (e.g., in Matlab or
Python, maybe Clojure :P), get to the next level of hacker skill, of
cloud/cluster computing for complicated linear algebra/mathy
algorithms on large datasets with Hadoop, or however? It seems the
barriers are somewhat high (my game plan, from studying Incanter, is:
step 1, learn Java), but I'm perfectly willing to be told they're not,
in which case I'll redouble my efforts to learn how to do so.
Thanks, and best of luck, and apologies for my off-topic-ness.
Ahmed