--
--
Chromium Developers mailing list: chromi...@chromium.org
View archives, change email options, or unsubscribe:
http://groups.google.com/a/chromium.org/group/chromium-dev
A lot of the causes of tree breakage now is when the CQ is skipped. So we should try to drastically reduce the number of times this is used.Please don't use NOTRY=true. Please don't LGTM changes with it.
Skipping the trybots should only be done for immediate reverts.
NOPRESUBMIT=true should skip presubmit checks.
Which warnings?
Cheers,Mark
+infra-dev,jrobbins,rmistryI don't know the answer to the question about "Revert Patchset" behavior. Adding Jason and Ravi that should know it.
On Friday, January 30, 2015 at 7:31:26 AM UTC-5, Paweł Hajdan, Jr. wrote:+infra-dev,jrobbins,rmistryI don't know the answer to the question about "Revert Patchset" behavior. Adding Jason and Ravi that should know it.The one-click revert button used to leave off "TBR=" for CLs older than 14 days, but this functionality was removed in crrev/124470043 because it was found to be confusing (chromium:401513). I agreed with the change and personally did not like the idea of the button automatically behaving differently based on how old the CL is.
How about instead putting in a checkbox below the "Automatically send Revert CL to CQ" that says "Commit immediately". If the CL is X days old then you get an extra prompt "This CL is X days old, are you sure you want to commit immediately?" or maybe the "Commit immediately" can be unchecked by default for old CLs instead.
On Fri, Jan 30, 2015 at 5:04 AM, Ravi <rmi...@chromium.org> wrote:
On Friday, January 30, 2015 at 7:31:26 AM UTC-5, Paweł Hajdan, Jr. wrote:+infra-dev,jrobbins,rmistryI don't know the answer to the question about "Revert Patchset" behavior. Adding Jason and Ravi that should know it.The one-click revert button used to leave off "TBR=" for CLs older than 14 days, but this functionality was removed in crrev/124470043 because it was found to be confusing (chromium:401513). I agreed with the change and personally did not like the idea of the button automatically behaving differently based on how old the CL is.How about instead putting in a checkbox below the "Automatically send Revert CL to CQ" that says "Commit immediately". If the CL is X days old then you get an extra prompt "This CL is X days old, are you sure you want to commit immediately?" or maybe the "Commit immediately" can be unchecked by default for old CLs instead.I think we should use sane defaults, and the user can (always) override. It seems that this old reverted behavior was slightly different though, in that it left out TBR so the change wouldn't let.Proposal: for commits < 6 hours, current behavior stays the same. The reason is that something in the tree is broken and it might have taken a while to track it down during sheriff-switchover. Since it's less than 6 hours, the chance that it'll collide with another landed change is low.For commits longer than 6 hours, then we leave TBR as is, but we don't add NOTRY=true. The older a change is, the more chance that its revert will collide with other commits. Also, if a change is old then there's no urgent rush to revert it and it can wait an hour.
On Friday, January 30, 2015 at 10:42:17 AM UTC-5, John Abd-El-Malek wrote:On Fri, Jan 30, 2015 at 5:04 AM, Ravi <rmi...@chromium.org> wrote:
On Friday, January 30, 2015 at 7:31:26 AM UTC-5, Paweł Hajdan, Jr. wrote:+infra-dev,jrobbins,rmistryI don't know the answer to the question about "Revert Patchset" behavior. Adding Jason and Ravi that should know it.The one-click revert button used to leave off "TBR=" for CLs older than 14 days, but this functionality was removed in crrev/124470043 because it was found to be confusing (chromium:401513). I agreed with the change and personally did not like the idea of the button automatically behaving differently based on how old the CL is.How about instead putting in a checkbox below the "Automatically send Revert CL to CQ" that says "Commit immediately". If the CL is X days old then you get an extra prompt "This CL is X days old, are you sure you want to commit immediately?" or maybe the "Commit immediately" can be unchecked by default for old CLs instead.I think we should use sane defaults, and the user can (always) override. It seems that this old reverted behavior was slightly different though, in that it left out TBR so the change wouldn't let.Proposal: for commits < 6 hours, current behavior stays the same. The reason is that something in the tree is broken and it might have taken a while to track it down during sheriff-switchover. Since it's less than 6 hours, the chance that it'll collide with another landed change is low.For commits longer than 6 hours, then we leave TBR as is, but we don't add NOTRY=true. The older a change is, the more chance that its revert will collide with other commits. Also, if a change is old then there's no urgent rush to revert it and it can wait an hour.My 2c:* 6 hours is IMHO too short a timeframe, some projects correctness/perf tests may reliably identify a CL that needs to be reverted after that. We could make the timeframe project specific with a way to specify whether a project would like to opt out of this.* The problem with the previous removing TBR= implementation was that it was never clear to the revert'er what was going to happen. If we do implement this NOTRY proposal, then lets make it crystal clear what is going to happen either with a new dialog or a descriptive message in the revert popup.
--
You received this message because you are subscribed to the Google Groups "infra-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to infra-dev+...@chromium.org.
To post to this group, send email to infr...@chromium.org.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/infra-dev/d1bd3786-5096-4036-bfd6-825eb7f4a88a%40chromium.org.
--
The memory sheriffs have a tradition of landing suppression changes with NOTRY=true: http://www.chromium.org/developers/tree-sheriffs/sheriff-details-chromium/memory-sheriff#TOC-Submitting-a-patch
Are the bots now smart enough to avoid compiling the world when the changed file isn't compiled?
On Fri, Jan 30, 2015 at 8:29 AM, Ravi <rmi...@chromium.org> wrote:
On Friday, January 30, 2015 at 10:42:17 AM UTC-5, John Abd-El-Malek wrote:On Fri, Jan 30, 2015 at 5:04 AM, Ravi <rmi...@chromium.org> wrote:
On Friday, January 30, 2015 at 7:31:26 AM UTC-5, Paweł Hajdan, Jr. wrote:+infra-dev,jrobbins,rmistryI don't know the answer to the question about "Revert Patchset" behavior. Adding Jason and Ravi that should know it.The one-click revert button used to leave off "TBR=" for CLs older than 14 days, but this functionality was removed in crrev/124470043 because it was found to be confusing (chromium:401513). I agreed with the change and personally did not like the idea of the button automatically behaving differently based on how old the CL is.How about instead putting in a checkbox below the "Automatically send Revert CL to CQ" that says "Commit immediately". If the CL is X days old then you get an extra prompt "This CL is X days old, are you sure you want to commit immediately?" or maybe the "Commit immediately" can be unchecked by default for old CLs instead.I think we should use sane defaults, and the user can (always) override. It seems that this old reverted behavior was slightly different though, in that it left out TBR so the change wouldn't let.Proposal: for commits < 6 hours, current behavior stays the same. The reason is that something in the tree is broken and it might have taken a while to track it down during sheriff-switchover. Since it's less than 6 hours, the chance that it'll collide with another landed change is low.For commits longer than 6 hours, then we leave TBR as is, but we don't add NOTRY=true. The older a change is, the more chance that its revert will collide with other commits. Also, if a change is old then there's no urgent rush to revert it and it can wait an hour.My 2c:* 6 hours is IMHO too short a timeframe, some projects correctness/perf tests may reliably identify a CL that needs to be reverted after that. We could make the timeframe project specific with a way to specify whether a project would like to opt out of this.* The problem with the previous removing TBR= implementation was that it was never clear to the revert'er what was going to happen. If we do implement this NOTRY proposal, then lets make it crystal clear what is going to happen either with a new dialog or a descriptive message in the revert popup.In general I prefer to avoid extra dialogs or new checkboxes to avoid cluttering up the UI.With the previous removal of TBR=, it was confusing because people hit "revert" and might close the page and the revert never lands. With the proposal above, I don't think there's much confusion. The reverter has the same flow (just hit one button). It's just sometimes it takes an hour while other times it takes a few minutes. Devs are already trained that CQ takes variable amount of time, depending on how queued it is, if the tree is closed, how many files their change touches etc..
To unsubscribe from this group and stop receiving emails from it, send an email to infra-dev+unsubscribe@chromium.org.
The confusion would result from the revert CL not landing immediately. I personally would rather click through an extra dialog or read an info message telling me what is going to happen, than having my expected keywords be automatically missing and spend time trying to figure out where they went. Leaving the decision to Jason.
To unsubscribe from this group and stop receiving emails from it, send an email to infra-dev+...@chromium.org.
To post to this group, send email to infr...@chromium.org.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/infra-dev/9747b96a-210f-4ac2-998f-c4caddc09913%40chromium.org.
Out of curiosity, I didn't put NOTRY=true on my recent change
https://codereview.chromium.org/886073003and it took 35 minutes to get through CQ.It only affects one FYI configuration (ASan for Windows), yet CQ has made a build on Win8 builder and ran all tests on Linux-ASan.
On Mon, Feb 2, 2015 at 6:03 AM, Timur Iskhodzhanov <timu...@chromium.org> wrote:Out of curiosity, I didn't put NOTRY=true on my recent change
https://codereview.chromium.org/886073003and it took 35 minutes to get through CQ.It only affects one FYI configuration (ASan for Windows), yet CQ has made a build on Win8 builder and ran all tests on Linux-ASan.Hey Timur, thanks for checking. I have filed bugs on all these issues (some were already known).
Is 35 minutes really slowing down the memory sheriffs? Compared with 6 months ago, median CQ time was 4 hours. Now Linux ASAN runs on CQ, and the other ASAN bots on the main waterfall cycle in < 30 minutes compared to up 4 hours before.
On Mon Feb 02 2015 at 6:44:34 PM John Abd-El-Malek <j...@chromium.org> wrote:On Mon, Feb 2, 2015 at 6:03 AM, Timur Iskhodzhanov <timu...@chromium.org> wrote:Out of curiosity, I didn't put NOTRY=true on my recent change
https://codereview.chromium.org/886073003and it took 35 minutes to get through CQ.It only affects one FYI configuration (ASan for Windows), yet CQ has made a build on Win8 builder and ran all tests on Linux-ASan.Hey Timur, thanks for checking. I have filed bugs on all these issues (some were already known).Thanks!Is 35 minutes really slowing down the memory sheriffs? Compared with 6 months ago, median CQ time was 4 hours. Now Linux ASAN runs on CQ, and the other ASAN bots on the main waterfall cycle in < 30 minutes compared to up 4 hours before.Sorry for the naming confusion [I was against this naming], but memory sheriffs are not looking at the Memory waterfall [ASan,LSan].Instead, they look at Valgrind, MSan and TSan bots. They are much slower than ASan and there's no point
in waiting even 30 minutes CQ when you can locally try new suppressions with a cycle time of 1 second [search for waterfall.sh here:
On Mon Feb 02 2015 at 6:44:34 PM John Abd-El-Malek <j...@chromium.org> wrote:On Mon, Feb 2, 2015 at 6:03 AM, Timur Iskhodzhanov <timu...@chromium.org> wrote:Out of curiosity, I didn't put NOTRY=true on my recent change
https://codereview.chromium.org/886073003and it took 35 minutes to get through CQ.It only affects one FYI configuration (ASan for Windows), yet CQ has made a build on Win8 builder and ran all tests on Linux-ASan.Hey Timur, thanks for checking. I have filed bugs on all these issues (some were already known).Thanks!Is 35 minutes really slowing down the memory sheriffs? Compared with 6 months ago, median CQ time was 4 hours. Now Linux ASAN runs on CQ, and the other ASAN bots on the main waterfall cycle in < 30 minutes compared to up 4 hours before.Sorry for the naming confusion [I was against this naming], but memory sheriffs are not looking at the Memory waterfall [ASan,LSan].Instead, they look at Valgrind, MSan and TSan bots. They are much slower than ASan and there's no point in waiting even 30 minutes CQ when you can locally try new suppressions with a cycle time of 1 second [search for waterfall.sh here:
--
Also per my previous email that may have crossed with this, given how slow the memory bots are, the difference between 0 and 30 minutes won't change things.
On Mon, Feb 2, 2015 at 7:49 AM, Timur Iskhodzhanov wrote:On Mon Feb 02 2015 at 6:44:34 PM John Abd-El-Malek wrote:
On Mon, Feb 2, 2015 at 6:03 AM, Timur Iskhodzhanov wrote:Out of curiosity, I didn't put NOTRY=true on my recent change
https://codereview.chromium.org/886073003and it took 35 minutes to get through CQ.It only affects one FYI configuration (ASan for Windows), yet CQ has made a build on Win8 builder and ran all tests on Linux-ASan.Hey Timur, thanks for checking. I have filed bugs on all these issues (some were already known).
Thanks!Is 35 minutes really slowing down the memory sheriffs? Compared with 6 months ago, median CQ time was 4 hours. Now Linux ASAN runs on CQ, and the other ASAN bots on the main waterfall cycle in < 30 minutes compared to up 4 hours before.Sorry for the naming confusion [I was against this naming], but memory sheriffs are not looking at the Memory waterfall [ASan,LSan].Instead, they look at Valgrind, MSan and TSan bots. They are much slower than ASan and there's no point in waiting even 30 minutes CQ when you can locally try new suppressions with a cycle time of 1 second [search for waterfall.sh here:
I want to second this. Memory suppression changes only touch text files that can't affect any main waterfall bots, and it's work you do to get bots back to green. Adding 30 minutes of overhead for something that adds 0 value doesn't sound like a good trade.
It used to be 5 minutes for no-op changes, but there have been some regressions and they'll be fixed.
It used to be 5 minutes for no-op changes, but there have been some regressions and they'll be fixed.What's the plan for regular sheriffs? If someone checks in something that causes a failure on a main waterfall builder not covered by the cq that has an obvious fix, will fixing them (instead of reverting) no longer be allowed?
On Fri Jan 30 2015 at 6:38:36 PM Ravi <rmi...@chromium.org> wrote:The confusion would result from the revert CL not landing immediately. I personally would rather click through an extra dialog or read an info message telling me what is going to happen, than having my expected keywords be automatically missing and spend time trying to figure out where they went. Leaving the decision to Jason.+1
On Sun, 01 Feb 2015 07:48:05 +0100, Sergiy Byelozyorov <ser...@chromium.org> wrote:On Fri Jan 30 2015 at 6:38:36 PM Ravi <rmi...@chromium.org> wrote:The confusion would result from the revert CL not landing immediately. I personally would rather click through an extra dialog or read an info message telling me what is going to happen, than having my expected keywords be automatically missing and spend time trying to figure out where they went. Leaving the decision to Jason.+1It sounds to me like the problem you (rmistry and sergeiyb and most likely many others) are seeing is that you don't trust the CQ and don't know how to check what it's working on and prioritizing at the moment?
My first thought about it is that I agree that users should not be uncertain about the affect of a revert, and that the dialog for doing a revert has some room to add information that could clarify what will happen.I am thinking that we could just make all the revert options more explicit. E.g., in ASCII-art:Reason for revert:[text entry box]I'm thinking that the rule for NOTRY on reverts should just be that it defaults to false because (ideally) a revert should still pass tests. The user doing the revert can get in the habit of checking it by clicking on it if they need it (e.g., to quickly roll back a series of changes). If we later put in a more tricky rule for the NOTRY default, the user will still be able to see it and override it before pressing the submit button.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/infra-dev/CADPkFo1Y-cz%3DgcmJjx1uMOjejjG0Xa1z8ndAMXqW_txsWSMwKA%40mail.gmail.com.
I have a question on how the analyze stuff works. Someone on this thread might know the answer to that. https://codereview.chromium.org/898183005/ (an OWNERS file change) got committed almost immediately after I hit "cq" – apparently from the presubmit bot, not by the cq (?).
Very fast, cool. However, hitting cq still kicked off builds on 25 bots, and they all think that they should actually compile things (which seems wasteful).
On Tue, Feb 10, 2015 at 7:42 AM, Nico Weber <tha...@chromium.org> wrote:I have a question on how the analyze stuff works. Someone on this thread might know the answer to that. https://codereview.chromium.org/898183005/ (an OWNERS file change) got committed almost immediately after I hit "cq" – apparently from the presubmit bot, not by the cq (?).Sheyang/Sergey/Pawel: this is new to me; I would expect that CQ would need the same bots to pass (the full list) independent of how the try was triggered?I guess I have seen this before where a "git cl try", on Android only file, might only trigger the android boy because of the presubmit filters. However it seems that the CQ verifier should trigger any other trybot in the full list that's not there, instead of just looking that the ones in Rietveld are green.
On Tue, Feb 10, 2015 at 7:42 AM, Nico Weber <tha...@chromium.org> wrote:I have a question on how the analyze stuff works. Someone on this thread might know the answer to that. https://codereview.chromium.org/898183005/ (an OWNERS file change) got committed almost immediately after I hit "cq" – apparently from the presubmit bot, not by the cq (?).Sheyang/Sergey/Pawel: this is new to me; I would expect that CQ would need the same bots to pass (the full list) independent of how the try was triggered?
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/infra-dev/CAPTJ0XFWJ9MuN2y9MvstdeN0D1XNvfXBWZ-iihdYznfyo52Lug%40mail.gmail.com.