I believe we currently set the ARSession update mode to blocking mode, meaning we get camera frames approximately at 30fps. That means that (1) and (3) should be roughly 30fps, and should be the same. This is what I would measure as the frame rate.
Outside of what you render through webgl, there are some steps before it becomes visible on screen - that is (2) and (4). There may be optimizations so Chrome's compositor doesn't copy the same pixels multiple frames. My understanding is Chrome's compositor will usually run at ~60fps.
If you are modifying Chromium to use the latest camera image multiple frames, (1) and (3) still represent how many RAF callbacks are invoked, and therefore how many times you render new frames. However, you'll have to be careful because now nothing will be throttling your frame rate, so you can render faster than the screen refresh rate. This means that (1) and (3) may be faster than 60fps, even if (2) is 60fps.