Hi Anirvinya G,
Adding to what Jonathan said.
Our ideas page lists goals that we think will be important for MDAnalysis and necessarily, these goals are expressed at a high level of abstraction. in your proposal you have an opportunity to make these goals as concrete as you can. In other words, you have considerable freedom in how you want to tackle the problem. And we are interested in reading YOUR ideas. We will be impressed by a proposal that shows research and describes a feasible and specific plan to accomplish the goal. If you find that major points are unclear then make these points clear should be an objective in your proposal. You have to explain what is unclear, why you need whatever it is, and how you will achieve it.
Even if you haven’t done any benchmarking yet, you can do a survey of the MDAnalysis code (e,g. by module) and note which modules have some benchmarking coverage. From there you can develop a plan for covering the library. We would expect you prioritize benchmarks in your plan to cover highly used and performance sensitive code first.
As a starting point, code that has been written in cython is almost certainly performance relevant (see a lot of code under mdanalysis.lib, for instance, or trajectory readers). Features like on-the-fly transformations would also benefit from benchmarking, especially as we have open PRs that are supposed to increase performance.
The issue tracker has open and closed issues with the “performance” label, which could give you an initial idea of some of the code areas relevant for benchmarking. There are also labels “benchmarking”. Finally, search the mailing list archives might also help.
If you have specific questions then we can also answer.
Best,
Oliver