On Fri, Jun 22, 2012 at 9:18 PM, VincentAB <
vincen...@gmail.com> wrote:
> Although I wouldn't put things as strongly as Brendan, I share some of his
> feelings toward the statsmodels docs. From the perspective of someone who
> might like to contribute improvements to the docs and examples, I was
> wondering what your vision is on this. Do you mainly see the website docs as
> a "rich man page", or are you aiming for something more in line with Quick
> R:
http://www.statmethods.net/
both,
Man pages in the style of numpy and scipy are a minimum requirement
and what I'm used to.
More tutorial or Quick R documentation would be very desired.
>
> My view is that there is a definite need for a Quick statsmodels, but I'm
> wondering if such a thing should live on its own webpage or if you thing the
> statsmodels docs will eventually evolve in that direction. In the former
> case, I'll have to punt (too much work!). In the latter case, however, I can
> see many new adopters begin able to put together useful examples that nicely
> complement the man pages.
I was thinking along the latter after Brendan's comments.
New contributions to improve the documentation to make statsmodels
more accessible to newcomers would be very welcome. Vincent Davis was
making similar comments maybe 2 years ago, and started on a tutorial
(and laid the foundation for the summary method).
The main problem is manpower.
As long as it's mainly Skipper and I doing development, maintenance
and documentation, it's rather limited what we can accomplish in our
time. We have 4 GSOC students which are busy with their own projects.
I'm pretty happy if I can even keep the "man pages" up-to-date and
improving. Those are pretty much designed for an intermediate python
and statistics level. Large parts of my work between 0.4.0 and 0.4.1
was updating or improving this part.
(for example adding plots to docstrings:
http://statsmodels.sourceforge.net/devel/generated/statsmodels.graphics.boxplots.beanplot.html#statsmodels.graphics.boxplots.beanplot
)
There are parts where I tried to provide more of an overview, but I
didn't spend the time adding examples to docstrings or rst files
http://www.statmethods.net/stats/rdiagnostics.html
ours:
http://statsmodels.sourceforge.net/devel/stats.html#residual-diagnostics-and-specification-tests
http://statsmodels.sourceforge.net/devel/diagnostic.html#diagnostics
(my blog is more for work in progress than final results, e.g.
http://jpktd.blogspot.ca/2012/01/influence-and-outlier-measures-in.html
)
open issues (filed by myself):
https://github.com/statsmodels/statsmodels/issues?labels=comp-docs&state=open
In 0.4.0, we added examples that ipython and sphinx renders to html,
e.g.
http://statsmodels.sourceforge.net/devel/examples/generated/example_discrete.html
In this case also: I went through the examples, that have been written
as scripts before, to make sure they run and render without problems,
but I didn't go through the content to check whether the style is
really appropriate for the new format and has enough information for
newcomers.
Submission/contribution of examples by users would be very useful.
I've seen some tutorials for statsmodels on the web.
We have most of the infrastructure for the documentation now. We are
still missing a real "Getting Started" tutorial section, and many of
the module docstrings (big ones regression, discrete, genmod, ...) or
topics pages are very "skinny"
missing overview and recommendation of what to read for newcomers.
for example endog exog is explained/hidden here
http://statsmodels.sourceforge.net/devel/dev/package_overview.html#design
basic structure and import paths
http://statsmodels.sourceforge.net/devel/importpaths.html
missing examples in docstrings all over the place
one lacking part of the documentation is cross-linking
https://github.com/statsmodels/statsmodels/issues/278
to make it easier to find the relevant information.
So, the area where the documentation can be improved is pretty wide,
and we are open to adding more sections, tutorials, examples,
improvements, ...
numpy and scipy managed to get improved documentation through a large
community effort over the last several years.
We don't have an online editor for statsmodels, but, in the times of
git and github:
Pull requests are very welcome.
Josef
>
> Vincent