Speeding up AstroPy?

289 views
Skip to first unread message

Francesco Maggiore

unread,
Nov 16, 2020, 9:51:12 AM11/16/20
to astro...@googlegroups.com
What about an AstroPy version speeded up in calculations through the use of GPUs and tools such as nvc++ and Cython?

Kris A. Stern

unread,
Nov 16, 2020, 10:59:39 AM11/16/20
to astro...@googlegroups.com
Think GPU use could caused some unplanned for costs. Cython is already in use in some way. 

K.S. 

On Mon, 16 Nov 2020 at 22:51, Francesco Maggiore <frama...@gmail.com> wrote:
What about an AstroPy version speeded up in calculations through the use of GPUs and tools such as nvc++ and Cython?


--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/astropy-dev/CABi%2B6AvEC5vzSDrUf6mLasf%2Bj2i48Unao1BZe_n2t%3DrUU%3DMAAg%40mail.gmail.com.

Kris A. Stern

unread,
Nov 16, 2020, 11:20:38 AM11/16/20
to astro...@googlegroups.com

Thanks for bringing this to our attention though. 

K.S.

Francesco Maggiore

unread,
Nov 17, 2020, 8:35:39 AM11/17/20
to astro...@googlegroups.com
Indeed, Kris. It was precisely the reading of articles like that one that recently aroused my interest, even if for years now I have started to mount Nvidia graphics cards on my workstations, and to try  exploring the HPC perspectives through the exploitation of CUDA cores.

Kris A. Stern

unread,
Nov 17, 2020, 8:37:28 AM11/17/20
to astro...@googlegroups.com
That sounds very interesting indeed...

Francesco Maggiore

unread,
Nov 18, 2020, 9:09:28 AM11/18/20
to astropy-dev
I believe that there is no doubt about the possibility of speeding up some routines through the Nvidia cards. The central point of the matter, as far as I'm concerned, is that most of these opportunities are currently open only to those who work under Unix, while they remain precluded to who - like me - prefer to work in a Windows environment.




Kris A. Stern

unread,
Nov 18, 2020, 9:14:49 AM11/18/20
to astro...@googlegroups.com
Maybe you could look into using NVIDIA GPUs for gaming laptops, sine work on Windows... 

On Wed, 18 Nov 2020 at 22:09, Francesco Maggiore <frama...@gmail.com> wrote:
I believe that there is no doubt about the possibility of speeding up some routines through the Nvidia cards. The central point of the matter, as far as I'm concerned, is that most of these opportunities are currently open only to those who work under Unix, while they remain precluded to who - like me - prefer to work in a Windows environment.


<img src="

Kris A. Stern

unread,
Nov 18, 2020, 9:15:48 AM11/18/20
to astro...@googlegroups.com
Typo: sine->these

E. Madison Bray

unread,
Nov 22, 2020, 1:18:43 PM11/22/20
to astropy-dev
On Mon, Nov 16, 2020 at 3:51 PM Francesco Maggiore <frama...@gmail.com> wrote:
>
> What about an AstroPy version speeded up in calculations through the use of GPUs and tools such as nvc++ and Cython?

Hi Francesco,

There are several parts in Astropy that could possibly benefit from
using the GPU. Convolution comes to mind. However, unless you
suggest a specific use case you have in mind this discussion can't go
beyond idle speculation. Is there a use case you have that could
benefit from using the GPU?

Best,
Madison

Kris A. Stern

unread,
Nov 22, 2020, 2:59:47 PM11/22/20
to astro...@googlegroups.com
He said it is for a performance boost already. 

--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.

Aldcroft, Tom

unread,
Nov 22, 2020, 4:55:15 PM11/22/20
to astropy-dev
On Sun, Nov 22, 2020 at 2:59 PM Kris A. Stern <krisa...@gmail.com> wrote:
He said it is for a performance boost already. 

Hi Kris,

I believe Madison was asking for a specific calculation or a particular function in Astropy that could benefit from the use of GPUs. I will second their point that the discussion needs to have a narrower focus in order to help the developers see how to make Astropy more useful in research that demands such performance.

Best,
Tom
 

Kris A. Stern

unread,
Nov 22, 2020, 5:08:29 PM11/22/20
to astro...@googlegroups.com
Aren’t we (or Madison) in a better position to provide recommendations as to how to improve overall Astropy performance according to the article cited? I thought he/she should have the expertise to make such a judgment call, instead of blindly asking for some prescriptions from the user. I don’t see in Madison’s email any mentioning if either the performance issue or the original article, hence was concerned, since GPUs are in general used to accelerate graphical stuff like ones in computer vision. He/she did say there would be no further discussion is the user cannot provide further info for discussion, however, which does now seem very reasonable for me. 

Kris A. Stern

unread,
Nov 22, 2020, 5:09:24 PM11/22/20
to astro...@googlegroups.com
Typo: *now -> not. 

Kris A. Stern

unread,
Nov 22, 2020, 5:21:17 PM11/22/20
to astro...@googlegroups.com
While we are at it, we might as well also discuss the hardware requirements, since not everyone has access to GPUs like ones offered by NVIDIA graphics card. Honestly it depends on the AI software like the Intel OpenVINO toolkit to bridge the gap. In this case say my IRIS GPU in my Apple MacBook Pro suffice as a GPU... but if I have some other software then my machine is just some plain CPU. So are we or are we not going to get into AI could be a determining factor of whether to incorporate such to be “idle” discussion as well. Why could you just ask for clarification instead of assuming I am not trying to chip in my two cents? I am very concerned indeed. 

Kris A. Stern

unread,
Nov 22, 2020, 5:35:43 PM11/22/20
to astro...@googlegroups.com
I do however feel very strongly that Astropy appears to not value junior member input very highly or have the desire to groom a new generation of computational specialists to allow the sustained and further and future growth of Astropy, judging from the many personal experiences contributing to the ecosystems like participation in the discussion of this thread here. Which is fine to me since what else can I say or do anyway. I see way too many times our GSoC student developers never get promoted to any more visible positions for real codebase development upon successful completion of their projects, which truly is a disappointment on its own merits. I am simply asking can I get the basic respect from a fellow member in this instance. Maybe Tom you do have some feedback to make in this regard? 

K.S. 

Kris A. Stern

unread,
Nov 22, 2020, 11:09:57 PM11/22/20
to astro...@googlegroups.com
Hi Francesco,

Meant to send this to you earlier, but were too afraid. Now what the heck! All my guy friends wanted one of these for GPU-enabled CUDA-capable ML training on a local machine: https://www.pcmag.com/reviews/razer-blade-stealth-13-2020

It's reasonably priced while meets the minimum requirements, especially the one with the 4K screen. 

Cheers,
Kris 

Kris A. Stern

unread,
Nov 23, 2020, 2:31:36 AM11/23/20
to astro...@googlegroups.com
Also Francesco,

There is a package called astroML for offloading some machine learning tasks in astronomy in case you are not familiar with it, which originated from a book. Also there is PyTorch and Tensorflow + Keras if you plan to do ML work in astrophysics like me. (Plus something like scikit learn for more basic modules.)


Cheers,
Kris

Aldcroft, Tom

unread,
Nov 23, 2020, 8:36:12 AM11/23/20
to astropy-dev
Hi Kris,

I'm very sorry you feel this way and feedback from your experience in the Astropy project is very highly valued.

One of the highest priorities in our project has been to expand the number of core developers, helping people go beyond making a few pull requests and become regular contributors to either the core or an affiliated package. This has been the subject of much discussion at the annual developer coordination meeting and is one of our highest priorities. Unfortunately this is a challenging problem that we have not succeeded in addressing. Ideas and input about how to make this happen are most welcome.

At this point Astropy is still substantially a volunteer project and mostly a "do-acracy", which is to say that people who step up and make pull requests and comment on issues and get involved in discussions, those are the ones who end up being formally named as maintainers (what you might call "promotion"). Nevertheless we greatly encourage contributions, both small and large, from all members of the community.

The reality that many GSoC students have not transitioned to being long-term active members of the Astropy community is a tremendous disappointment to us as well. If you or others have any ideas on how we can make this happen then we would love to hear them.

- Tom

Francesco Maggiore

unread,
Nov 23, 2020, 9:20:03 AM11/23/20
to astro...@googlegroups.com
Dear Kris,
I am certainly glad to have stimulated a debate on this topic, even if it was certainly not my intention to provoke it!
However, I feel the need to clarify one thing: although I find very interesting ML researches in astrophysics like the one I'm learning now that you are carrying out, the subject of the question I asked the community about a week ago was completely different. The reason why I personally approached the world of GPUs, and in particular of Nvidia's cuda-cores, is not ML at all, but HPC, as those same cuda cores, as well as as inference engines, can be used as mathematical coprocessors to speed up complex but non-recursive calculations, such as those that in astrophysics - and in particular in positional astronomy, which is my specific field of interest - often have to be done.

Best wishes,
Francesco

Francesco Maggiore

unread,
Nov 23, 2020, 9:20:03 AM11/23/20
to astropy-dev
Dear Kris,
I am certainly glad to have stimulated a debate on this topic, even if it was certainly not my intention to provoke it!
However, I feel the need to clarify one thing: although I find very interesting ML researches in astrophysics like the one I'm learning now that you are carrying out, the subject of the question I asked the community about a week ago was completely different. The reason why I personally approached the world of GPUs, and in particular of Nvidia's cuda-cores, is not ML at all, but HPC, as those same cuda cores, as well as as inference engines, can be used as mathematical coprocessors to speed up complex but non-recursive calculations, such as those that in astrophysics - and in particular in positional astronomy, which is my specific field of interest - often have to be done.

Best wishes,
Francesco

Kris A. Stern

unread,
Nov 23, 2020, 9:39:11 AM11/23/20
to astro...@googlegroups.com
Hi Francesco, 

That’s even a more interesting topic as I don’t think many are currently tackling it. Does it happen to be something in the vein of Manodeep’s Corrfunc package?: 
https://corrfunc.readthedocs.io/en/master/index.html. I think he strives to use clusters of CPUs for carrying out n-way matching, which is pretty cool in its own right. There has been talk of some GSoC 2020 work to be carried out to establish an Astropy package for that, but unfortunately the project was cancelled due to COVID. However, if this counts as positional astronomy that definitely more discussion will surely lead to a better outcome for the plans if any future work is to be initiated. 

Cheers, 
Kris 

E. Madison Bray

unread,
Nov 23, 2020, 11:30:23 AM11/23/20
to astropy-dev
On Sun, Nov 22, 2020, 22:55 Aldcroft, Tom <tald...@gmail.com> wrote:

On Sun, Nov 22, 2020 at 2:59 PM Kris A. Stern <krisa...@gmail.com> wrote:
He said it is for a performance boost already. 

Hi Kris,

I believe Madison was asking for a specific calculation or a particular function in Astropy that could benefit from the use of GPUs. I will second their point that the discussion needs to have a narrower focus in order to help the developers see how to make Astropy more useful in research that demands such performance.

Tom read me as intended here. I'm sorry if anything in my wording suggested a desire to stifle discussion. To the contrary I think it's an interesting discussion to have, and wanted to stimulate it by asking if there was a specific use case in mind, e.g. a specific research task using existing code in Astropy on such a volume that the performance gains of using a GPU (if even feasible for this hypothetical task) would be with it. 

I did cite convolution as one such possibility that stands out as being clearly benefitting of GPU optimization if someone needed to use it for a lot of data. The existing convolution code is also already written in Cython and could be fairly easily adapted to, say, using C++ compiled with nvc++. 

But in general I think many Astropy developers are aware of the benefits of parallel computing (with or without GPU) computing as well as (at least some of) the wide variety of libraries and tools for achieving it, as well as the fact that Astropy could potentially benefit. E.g. I have thought about it before in the context of astropy.modeling. Depending on different areas of the package it could be relatively easy to do, or it could take months. So it's helpful to know if Francesco (or anyone else reading) has a specific case in mind to focus on, from which things like potential implementation details can follow, and if there's strong enough interest for anyone to actually take on the work. 

Best, 
Madison

Marten van Kerkwijk

unread,
Nov 23, 2020, 11:36:04 AM11/23/20
to astro...@googlegroups.com
Hi Francesco,

Speeding up astropy would be great, whether through GPUs or more plainly
by finding the bottlenecks and solving those (still quite a few around,
especially in astropy.coordinates, which might be particularly relevant
for you...).

For GPUs or other tricks, I think the great difficulty is to ensure that
code remains agnostic to the processor or way to switch between
processor -- one would want to avoid to become bound to CUDA or NUMBA.

Over at numpy, there has been quite a bit of effort to play nicer with
packages like dask, which led to the introduction of at least the
possibility to override how things get done (using __array_function__).
For astropy, one might want to similar see if one can get the current
code to allow using different containers for arrays (starting with
Quantity...). Sadly, I don't think this is easy, and am also not sure
astropy isn't too high up, i.e., whether efforts are not better spent on
numpy (which astropy would immediately benefit from), except perhaps in
a few parts where we've rolled our own low-level code (like for
convolution), where also simpler things like making use of CPU
vectorization may still be avenues for improvement.

All the best,

Marten

--
Prof. M. H. van Kerkwijk
Dept. of Astronomy & Astroph., 50 St George St., Toronto, ON, M5S 3H4, Canada
McLennan Labs, room 1203B, tel: +1(416)9467288, fax: +1(416)9467287
mh...@astro.utoronto.ca, http://www.astro.utoronto.ca/~mhvk

Kris A. Stern

unread,
Nov 23, 2020, 12:21:15 PM11/23/20
to astro...@googlegroups.com
Just wanted to chip in to say blog articles like https://developer.nvidia.com/blog/accelerating-python-on-gpus-with-nvc-and-cython/ where this discussion originated is highly specialized as to both the hardware and the software used, to the point where it prescribes a very narrowly focused recipe for one to follow as is. More than not the case, it will not work if one takes liberty from it. Hence, this discussion at the outset already has a very well as well as restrictive (perhaps even prohibitive) scope. This is exactly why I could not comprehend why it has to do with the architecture or design of astropy per se, except maybe on its performance, as prescribed in the article. 

K.S.

--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.

Francesco Maggiore

unread,
Nov 23, 2020, 12:36:33 PM11/23/20
to astro...@googlegroups.com
Dear friends,

I am very grateful to everyone who has spoken on the subject, and I am very pleased to see that this community is always open to new inputs, which is the only way to grow that I know!
I find the suggestion Kris provided to me in the first message interesting, but unfortunately the link she gave me doesn't seem to work, while I totally agree with what she says in the last one: I was the first to suggest reading the article by Nvidia, but I'm also the first to think that they pretend to impose their way of dealing with problems; I also agree with Madison when he points out that there are actually some mathematical libraries that could benefit most from the use of parallel computing; and of course I also agree with Marten when he says that the pursuit of greater speed must not make us lose sight of the bottlenecks that still remain unsolved.
What to tell you? If I have posed the problem it is obviously because I do not have the solution, and therefore I can only hope that my initiative can be a stimulus for the whole community!
Thanks again everyone,

Francesco


--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.

Kris A. Stern

unread,
Nov 23, 2020, 12:54:11 PM11/23/20
to astro...@googlegroups.com
No worries, it's been a good discussion. One thing I cannot help but notice though: The link of the article definitely worked and still works for me.  

Cheers,
Kris 

Kris A. Stern

unread,
Nov 26, 2020, 3:59:44 PM11/26/20
to astro...@googlegroups.com
Hi Tom,

If we look at the stats provided by GitHub: https://github.com/astropy/astropy/pulse... 

Screenshot 2020-11-27 at 4.58.10 AM.png

It appears that "excluding merges, 13 authors have pushed 45 commits to master and 97 commits to all branches" over the period spanning one week over November 19, 2020 – November 26, 2020. We have in total approximately 160 members serving the community, yet only about 1 in 12 have significantly contributed, and many of these are the core members. Perhaps this low number might have had to do with the recent release of version 4.2 of astropy/astropy. However, this low number is clearly a red flag to me in terms of the future sustainability of the Astropy Project. It looks to me most repo maintainers within the org are faculty members, who normally are overwhelmed by other duties such as publishable research. Perhaps it is time for Astropy to learn to trust a generation of younger developers and to let them have the opportunity to become co-maintainers as well, such as PhD students and Postdocs who normally would have more free time to spare (at least theoretically) and are equally competent at the tasks required? This appears to me the most obvious solution moving forward. Otherwise we will become another org that is simply "too big to fail". 

Cheers,
Kris 

Aldcroft, Tom

unread,
Dec 1, 2020, 9:24:21 AM12/1/20
to astropy-dev
Hi Kris,

It should be noted for the record the Astropy project has had quite a number of people at the postdoc or student level who have become maintainers, including former GSoC students. This sort of knowledge does require a long view of the project over its almost 10 year history, including people that have come and gone (e.g. due to leaving academia). Looking at a one-week snapshot of a 10 year project is bound to give a distorted view. I would suggest doing analysis on at least a year of activity if you'd like a clearer picture of where we stand at present.

As far as process, In order to be nominated as an astropy core maintainer, it is typical that a person will have been actively involved in the project for a considerable amount of time (at least a year and often longer) and have made numerous substantial code contributions.  They will have shown a sustained commitment to Astropy by participating actively in pull request reviews and discussions in other channels (mailing lists, slack, etc). Finally, they need to express a desire to maintain this involvement going forward and accept the responsibility of being a maintainer (interacting with users, responding to bug reports in a timely manner).  This is how trust is established, and that trust is entirely independent of age, gender identity, race, and employment status.

You are absolutely correct that recruiting maintainers and keeping them in the project is a key to future sustainability and we are keenly aware of this issue. Astropy is very much labor limited and we are thrilled when a person arrives to the project and starts making sustained contributions in any of the defined roles (including the roles that are not primarily about writing code).

Short version: make sustained and positive contributions to the Astropy project and community, and your voice will be heard. If you want the responsibility then you will be recognized with a formal role in the project.

- Tom


Marten van Kerkwijk

unread,
Dec 1, 2020, 9:50:16 AM12/1/20
to astro...@googlegroups.com
Hi Tom, others,

It would probably be good to have this short summary of how to get more
involved as a separate link in the "Developer Documentation" - I've been
asked at times how to proceed and answered essentially what you wrote,
but it might as well be explicit somewhere. The only thing to be clear
about is that it doesn't have to be code - it could be documentation,
infrastructure, tutorials, etc.

All the best,

Marten

Kris A. Stern

unread,
Dec 1, 2020, 9:52:47 AM12/1/20
to astro...@googlegroups.com
Hi Tom,

Thanks for the thoughtful response. To be honest the one week analysis is to give us a sense of the most recent snapshot at what really happened in a week’s time. It is not to be confused with some in-depth cross sectional study. 

From what I have personally experienced for the past two years since becoming active on Astropy, I do observe things that perhaps from a perspective you are not privy to. Overall, I expect that if Astropy is to grow into a bigger organisation which we expect it to be due to demand from the community, we will need more maintainers per package beyond the average two to three (active ones) we have now within the ecosystem. 

Also from what I have observed from a PhD student’s perspective, we really do need to step up and cultivate a new generation of To-be-core developers for Astropy. I have observed none of our recent GSoC alums opted to stay and contribute due to various reasons for the past three years in a row at least. This is not such a good sign. 

In summary, if we simply do not give for lack of a better way of putting it our best effort in motivating new members to stay and to contribute, but instead would rather opt for asking too much from new comers and in the process setting up barriers to their eventually acceptance is simply not an acceptable attitude for an open-source organisation, especially one of our reputation. This is also against the principle of inclusiveness which is one we have adopted. 

Thank you for your attention to this matter! 

Cheers, 
Kris


Peter Erwin

unread,
Dec 1, 2020, 11:49:34 AM12/1/20
to astro...@googlegroups.com
Hi Kris,

> On Dec 1, 2020, at 3:52 PM, Kris A. Stern <krisa...@gmail.com> wrote:
>
> Hi Tom,
>
>
> Also from what I have observed from a PhD student’s perspective, we really do need to step up and cultivate a new generation of To-be-core developers for Astropy. I have observed none of our recent GSoC alums opted to stay and contribute due to various reasons for the past three years in a row at least. This is not such a good sign.

How much should we expect this to happen, though? That is, is the “failure” of GSoC alums to continue contributing to astropy unusual in the context of GSoC projects, or even GSoC projects for scientific software? I have the impression that there is some research in this area that might be worth looking into. For example, this 2017 paper by Silva et al. ("How Long and How Much: What to Expect from Summer of Code Participants?”) notes that while about 43% of “newcomers” kept contributing more than a month past the end of the GSoC, only about 16% did so more than a year after the end. (So is astropy doing better or worse than average on this score?)

http://www.igor.pro.br/publica/papers/ICSME2017.pdf

(Arguably, the primary purpose of GSoC and similar projects is not so much “recruit new long-term contributors to projects” as it is something like “provide useful training and experience for students + contributions to a particular sub-project”. So assuming we *should* be getting new long-term contributors from GSoC might not be the best way to view astropy’s GSoC experience.)


cheers,

Peter

=============================================================
Peter Erwin Max-Planck-Insitute for Extraterrestrial
er...@mpe.mpg.de Physics, Giessenbachstrasse
tel. +49 (0)176 2481 7713 85748 Garching, Germany
fax +49 (0)89 30000 3495 https://www.mpe.mpg.de/~erwin




Kris A. Stern

unread,
Dec 1, 2020, 12:03:13 PM12/1/20
to astro...@googlegroups.com
Hi Peter,

I feel that’s a question for you or the Astropy core community to answer. It wouldn’t be appropriate to ask me since it’s like blaming the victim here. BTW, since being made a core member of SunPy recently after two GSoC projects with them, as well as having the privilege to either converse with or befriend all the Astropy GSoC alums from 2018 after, I do feel the pain for them more than for myself. I have always been lucky that way. Please don’t attack me personally for a fault that you are a part of implicitly by some research or another. We are people too, not some stats. 

Thanks,
Kris 

--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.

Marten van Kerkwijk

unread,
Dec 1, 2020, 1:02:29 PM12/1/20
to astro...@googlegroups.com
Hi Kris,

Before anything else: I think I speak for all of us if I ensure you that
nothing written here is meant personally or as an "attack". You bring
up good points and we want to improve, but we are, like everyone else,
limited by the number of hours in the day. Furthermore, many of us are
volunteers, and at least for me working on astropy is partially
something that I like because I can do something concrete and
constructive *myself* -- I don't mind supervision in principle, but my
day job has a lot of it already...

For GSoC, the main point Peter made seemed simply that one should not
assume that the goal is to train maintainers. And this is certainly
true for some: one student Tom and I jointly supervised made a great
contribution to astropy, but for her being part of GSoC and working on
astronomy was important in part to make connections and switch from her
background in computer science to astronomy. And it succeeded as it
helped her to get into graduate school here at UofT, where she is doing
very well. I think we can all agree that it is completely fine that she
has not contributed further to astropy: GSoC helped both her and astropy.

But what is worrying is that, from what you write, it seems like there
are other specific students who made good contributions who would have
wanted to continue to work with astropy, but we let that opportunity
slip. Do you have practical suggestions on how to improve that? (Beyond
the simplest possible, to lay out what the process actually is...)

A larger problem may be that astropy is becoming quite well developed,
which means that it requires increasingly higher skill levels to tackle
issues. And we are learning the hard way that every increase in the
code comes with increased maintenance... As a result, it is becoming
less and less clear that GSoC is a suitable programme for astropy (same
reason numpy doesn't really do GSoC) -- a GSoC on *trimming* code is not
very exciting....

But of course this all just makes clearer that more help would be
useful....

Anyway, as noted somewhere in the longwinded e-mail, concrete
suggestions are most welcome!

Erik Tollerud

unread,
Dec 2, 2020, 2:03:29 PM12/2/20
to astro...@googlegroups.com

To echo some of what Marten said above: your feedback is appreciated, Kris, and anything below is not meant as a blame or anything like it, but rather some explanation and exploration of potential solutions.

So with that in mind, I think I agree it's a *goal* but not an *expectation* for GSoC to lead to maintainers. I've mentored several GSoC students over the years, and in practice some have had the interest and  motivation to continue, while some have not. In my mind it's a lot like a "first project" that some PhD programs do - you work with a potential advisor to see if it's a good fit, but if not (and if the department is well-run...), you can switch to another one and try again. It's hard to know if you work well in a given environment until you try it! So in my mind we only expect some fraction to continue at all, and a subset of those to grow into a maintainer role. That's OK and even good to not force people to do things they aren't enjoying, as long as everyone knows that going in (which is a thing we need to fix if you don't think that understanding is known, Kris!). 

Having said that, I do think we have a bit of a GSoC issue that we've been struggling to get enough projects and bandwidth of mentors (certainly I *personally* have been struggling with this the last couple cycles). So it wouldn't be surprising if that leads to other problems, like either not communicating clear enough expectations of post-GSoC roles, or ensuring that the "right fit" process I'm describing above doesn't turn into a *biased* process (which is a very real danger we should be aware of!). I don't want to say too much about concrete fixes given that I haven't been able to mentor the last couple cycles, but I certainly would be interested in ideas of fixes if there's an issue there.


More broadly, as Tom said, it's been a long-term goal with at best uneven efforts to boost our contributor-to-maintainer pipeline. I posted an article about this on slack a few months ago about how this is definitely not unique to Astropy but is a broad problem across open source.  So there probably are not quick fixes, we have to keep working on this all the time. A good example of this is an idea that has come up several times as something pretty much everyone wants to see happen: a mentorship program to shepherd contributors into maintainer roles. But thus far no one has been able to summon up the bandwidth to make it happen since this sort of thing is very real work. I wonder if it would be useful to learn from this conversation and begin that program *specifically* focusing on past or incoming GSoC students?


So while there's probably no quick fixes on this, to do at least *something* on it right now, Tom A and I pushed through an update to the Astropy front page to give a bit more guidance on how to move into some of the more formal roles - see https://www.astropy.org/contribute.html#formal-project-role . Feel free to suggest clarifications or additions to that in the astropy.github.com repo!


And one other thing I wanted to mention: while feedback in public like this is definitely welcome and good for our community, anyone who wants to bring something up that they do *not* feel is suitable for public discussion can always send it to confid...@astropy.org, which goes to the ombudsperson (who may show it to the coordinating committee only if there's not a conflict of interest).

---
Erik T


--
You received this message because you are subscribed to the Google Groups "astropy-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to astropy-dev...@googlegroups.com.

Kris A. Stern

unread,
Dec 7, 2020, 5:13:39 AM12/7/20
to astro...@googlegroups.com
Hi Marten and Erik,

Thanks for the kind words and apologies for the delay in replying since I got sidetracked by some unforeseen circumstances. 

I agree with all of your concerns and find them valid actually. But the reality is that we need to secure financial support for some sustained form of apprenticeship-style training in order to recruit and absorb calibre software developers who are ideally also a member of the academia to improve our talent pool, while hoping against all odds that they would be willing to sacrifice a handsome chunk of their life possibly without pay and on a voluntary basis (much like what I am doing actually) to commit to the Astropy project. Our only options at this stage are likely Google Summer of Code and Outreachy, both paid internships supported by private corporations to facilitate engagement in open-source software development. 

Alternatively, I see the recent proposed funding as administered by our former ombudsperson Steve Crawford now with NASA as a potential funding opportunity for some such efforts of active engagement or recruitment (or whatever it is called  based on the actual implementation) like the ROSES.grant(s) opportunity. 

Anyway I feel it will probably take someone of a more senior position as well as with voting rights to perhaps follow this up. I am always open to dialogue however. 

Cheers,
Kris 

E. Madison Bray

unread,
Dec 10, 2020, 2:09:39 PM12/10/20
to astropy-dev
Hi all,

I've been meaning to follow up to this interesting discussion with a
small point for a while, but have been under water with other things.
I think most of the points I was otherwise going to make have been
addressed by others.

One thing I wanted to mention is that the in-progress draft of APE-0
[https://github.com/astropy/astropy-APEs/pull/61] does lay out some
guidelines for how one becomes a "Voting Member" of the project aka
core maintainer. I was thinking (based on memory) that while APE-0
laid out the rights and responsibilities of Voting Members it didn't
have much to say about how one becomes one. Actually that's not true;
on re-review it does list some minimum requirements.

However, it doesn't necessarily serve as a guide to someone who would
be interested in actually trying to meet the requirements.

To this end I was reminded that the Python developer documentation has
a contributor-oriented guide on how to become not just a contributor
(that's separate) but specifically how best to go about becoming a
"Core Developer". I think a lot of this might be useful to adapt into
a similar guide (separate from APE-0):
https://devguide.python.org/coredev/ In particular, it interestingly
proposes someone serve as a "mentor" for someone newly given commit
rights to the repository. As any GSoC mentor can tell you this can be
a non-trivial amount of time to ask of someone, though in this case
it's typically reserved for someone who's already shown their mettle
in other issues, and probably doesn't need a lot of "hands-on"
mentorship.

Just thought I'd bring it up.

M
> To view this discussion on the web visit https://groups.google.com/d/msgid/astropy-dev/CAMxZ8Sn1z3%3DXZUROhJxihOdZ6mP5en4Yw2mVg745HmFVE4zWQA%40mail.gmail.com.
Reply all
Reply to author
Forward
0 new messages