|Gerrit Metrics||Stephen Roberts||12/9/11 3:00 AM|
I am interested in generating some metrics from our gerrit
installation and am wondering if it is possible/trivial. Basically, we
have been trialling gerrit for some time now and I would like to go to
management with some hard number as to why we should continue to use
it. Sort of like, we have done X reviews, which on average go through
Y iterations and catch Z issues. Is this possible? Or even is the
database format documented somewhere to make writing such a tool easy?
|Re: Gerrit Metrics||zivkov||12/9/11 8:23 AM|
I don't think the DB schema is documented but it is quite simple to understand.
Look at the DB tables (changes, patch_sets, etc..) and you will find the
information you need.
|Re: Gerrit Metrics||Edwin Kempin||12/9/11 8:31 AM|
please be aware that there is the idea to remove the database in future Gerrit versions (or at least most of its tables). E.g. all the change data will then be stored directly in the git repositories. Keep that in mind before investing too much efforts in writing a tool that relies on the Gerrit database schema.
2011/12/9 Saša Živkov <ziv...@gmail.com>
|Re: Gerrit Metrics||Deen Sethanandha||12/15/11 8:15 AM|
I am also working on mining Gerrit as part of my Ph.D Dissertation. Currently, I retrieve data directly from the database but the more robust approach would be to create a metric database by retrieving the data from the Gerrit API.
@Stephen, please let me know what kind of metric are you interested in getting. Hopefully we can work together on this. I am working on proposing metrics that can be used to evaluate and improve review process.
The related table for collecting metrics are
1) accounts -> store user information
2) changes -> store Gerrit change requests
3) change_message -> store comments for change requests
4) patch_sets -> store patchset information but it doesn't have content of patch of patch_set. The content of patch is stored in Git.
5) patch_comments -> stores comment for each line in a patch
@Edwin, could you let us know where we can learn more about the possible changes when the data is moved to Git. How Gerrit is going to store review history and how to retrieve them? What is the status of this effort?
|Re: Gerrit Metrics||Edwin Kempin||12/22/11 11:00 AM|
2011/12/15 Deen Sethanandha <deen...@gmail.com>
Shawn follows the idea to remove the Gerrit database already for some time. It was discussed on several occasions (e.g. on the mailing list, but also on GitTogether).
One hint about it can be found in the Gerrit design document 'Documentation/dev-design.html'. There it says
"The metadata is mostly housed in the database (*1) ...
*1 Although an effort is underway to eliminate the use of the database altogether, and to store all the metadata directly in the git repositories themselves. So far, as of Gerrit 2.2.1, of all Gerrit’s metadata, only the project configuration metadata has been migrated out of the database and into the git repositories for each project."
For the upcoming Gerrit 2.2.2 release this statement is still true.
Interesting to check out is one discussion on the mailing list  where some pros and cons about removing the database are discussed.
How exactly the review history will be stored in the end I can't say, but I'm sure that Shawn has already some ideas about this. I expect that we will have some special branches in the git repositories in which the review data is persisted (similar to the refs/meta/config branch that we already have to store the project configuration metadata).
|Re: Gerrit Metrics||Janne Hellsten||12/27/11 10:46 AM|
It should be relatively easy to collect metrics via the Gerrit JSON
See docs for the "query" command:
Not everything is available via this interface but this should be
I've written Haskell code to talk to Gerrit via SSH and parse the JSON
|Re: Gerrit Metrics||Remy Bohmer||12/28/11 12:36 PM|
2011/12/27 Janne Hellsten <jjhe...@gmail.com>:
Here is the ping ;-)
|Re: Gerrit Metrics||Janne Hellsten||12/29/11 8:02 AM|
>> I've written Haskell code to talk to Gerrit via SSH and parse the JSON
You can find the GerritJson module here: https://github.com/nurpax/gerrit-json
I only recently went back to Haskell so there are probably many ways
|Re: Gerrit Metrics||Remy Bohmer||12/29/11 1:57 PM|
|RE: Gerrit Metrics||Gustaf Lundh||10/12/12 4:30 AM|
For some of my Gerrit-metrics I'm using the Gerrit-event module in Gerrit-trigger to collect events through the stream-event SSH interface.
I collect the interesting data and feed it to a Graphite instance. This way I can plot real-time data in terms of:
Changes per hour (or minute) and also the rate of comments/merges/etc.
One of my colleagues has also created python interface for listening and parsing the events. Not sure if it has been open-sourced yet.
|RE: Gerrit Metrics||Pursehouse, David||10/12/12 5:10 AM|
> One of my colleagues has also created python interface for listening and parsing the events.Not yet. It needs a bit more tidying up, and then needs to go through the usual management approvals before it can be open sourced.
|Re: Gerrit Metrics||David Pursehouse||8/8/13 11:57 PM|
It's taken a bit longer than I expected, but the python interface that Gustaf was referring to above has been open sourced. Actually the initial commits were pushed to Github some time ago, but it's only in this last week that I've managed to find time to make it into a proper Python package.
Basically what it does is gives you a python class that runs the `stream-events` command over SSH and packages the received data up into python objects in a queue that the client can fetch from. We use it in some backend management scripts for Django apps that need to get real-time information about commits being approved/merged etc.
The source (see link below) includes an example script that demonstrates the usage.
It's been built with quite specific uses in mind so I'm not really sure how useful it's going to be for anyone else. Pull requests are welcome though.
Source on Github: https://github.com/sonyxperiadev/pygerrit
Distribution package: https://pypi.python.org/pypi/pygerrit/0.1.0
API documentation: http://pythonhosted.org/pygerrit/