Proposed OOM improvements.

112 views

Skip to first unread message

Greg Spencer

unread,

Aug 4, 2010, 6:02:25 PM8/4/10

to Chromium OS dev, Chromium-dev

Hi Folks,

Here's my proposal for improving OOM situations on ChromeOS. In a nutshell, the idea is that we'll tune the OOM killer's algorithm to match what we want, and make the UI more explicit about what happened when a tab is killed by the OOM killer.

Please let me know if you have any suggestions/comments.

-Greg.

Document link: https://docs.google.com/a/google.com/document/edit?id=1ddPY1-v7ZFr0jmhuxw04ehNLMQrPzxHhL6vUoBzELBo&hl=en

(but that probably won't work outside google.com: see below for full text).

-------------------

Out of Memory Management for ChromeOS

Greg Spencer (gspencer), ChromeOS UI team.

Intro

Like all computers, ChromeOS devices have limited memory, and bad things happen when we run out of physical memory. We’d like to make ChromeOS be more elegant than most OSs when it runs into this situation. To that end, we’re looking to improve the user experience around out of memory (OOM) conditions.

Current State

Currently, when a ChromeOS device runs out of memory processes are killed by the OOM killer (a part of the kernel) until enough memory is available. Because we have no swap configured, but do allow overcommit (i.e. malloc pretends it has nearly unlimited memory when handing out addresses), eventually a process tries to use memory assigned to it in the virtual address space that isn’t actually available, and the kernel asks the OOM killer to kill processes on the system until enough memory is available. The processes killed don’t necessarily include the one that started to use the unavailable memory, but rather are based on the OOM killer’s “badness” algorithm.

We don’t have a swap partition configured because we’re afraid that it will start killing blocks on the SSD after an unreasonably short time. [I haven’t verified that this is indeed a problem, but I’m assuming that the original decision wasn’t made in a vacuum. It does seem to me that write levelling in the SSD hardware should mitigate this somewhat, however].

Currently, the renderers, plugins, and browser processes are (not surprisingly) the largest users of memory on Chrome OS. The renderers and plugins can be killed without crashing the system, but killing the browser process (which can grow quite large) causes the entire session to restart, so we want to kill that as a last resort (or at least after all the renderers and plugins have been killed).

Linux Chrome already uses the /proc/<PID>/oom_adj method for proiritizing renderers and plugins over the browser process (or system processes) for being killed by the OOM so they’ll be killed in the right order. This works fairly well, but the default OOM killer algorithm prefers to kill recent processes instead of older processes, so this is not quite optimal for us, as we would prefer to kill older tabs over newer ones, and non-pinned tabs before pinned ones. [The file /proc/<PID>/oom_adj contains a bit-shift value from -17 to 15 that adjust the badness value of the process.]

Also, when things are killed, the “Sad Tab” page is displayed, which doesn’t communicate the nature of the failure.

Possible Methods of Controlling Memory Usage on ChromeOS

[This is the brainstorming part of the doc. Not all of these will be implemented.]

Kernel Level

Change overcommit behavior (change to "overcommit_ratio”), to encourage more NULLs being returned from malloc instead of the OOM getting happy and killing stuff randomly. This might not actually help things -- it'll mean that the process that is trying to allocate always gets killed via segfault instead of another less important process.
Use mem_notify kernel module to send notification when thresholds are reached if we aren't already. This is useful for clearning caches, garbage collecting, etc, but isn’t a solution to the overall problem. This may be useful for marking which tabs are killed from the OOM and which are killed for other reasons.
Severely re-nice or stop processes that abuse memory in order to have resources to let user pick what to do. (but it is not be possible for it to happen fast enough in all cases).
Setup some small swap space (e.g. 50M) so that any very static data in memory gets swapped out. We currently have at least 25M of data that never gets accessed again once the app is loaded.

Chrome Level Changes

Respond to mem_notify events (in order of how draconian they are) with actions that don’t require user notification. This is by its nature a bandaid, as any memory sponge will quickly eat up the freed memory:

Flushing memory HTML caches
Garbage collecting all V8, Crore, Flash instances.
Sharing renderers among more tabs, killing some renderers. [Darin says this probably won’t gain us much -- only means we can share a few more font tables, etc, and will slow things down considerably due to swapping out DOMs, etc.]
Empty Flash and HTML5 audio/video buffers (and maybe notify user because they’ll all start rebuffering if they are playing).

Other measures that may require user notification:

Closing no-content tabs (new tab pages, about: pages).
Closing windows that only have non-content tabs in them (e.g. an empty window running with just the new tab page).

Reduce memory usage in the first place by:

mmap’ing large images (which would get swapped out on low memory by the kernel). [We may already be doing this]

Implementation Plan

Given that some of the suggestions above require more work than others, I’m planning to pick the low hanging items first, and then see how much bang that gives us, and then move on to more time consuming mitigation if that’s not sufficient.

Phase 1 -- Tune OOM killer algorithm

I'm going to collect the following information:

Whether or not a tab is pinned
When was the last time the user clicked on or entered something into the tab
When was the last time the user clicked on the tab to make it current
How much memory the tab is using

And then I'm going to come up with an algorithm (TBD) that ranks tabs based on these criterion. The algorithm will probably prefer to kill tabs that aren’t pinned, have been idle for the longest, and use the most memory. It’ll probably kill plugins before killing renderers.

I'm going to write a manager into the browser process that every so often (every five seconds or so) adjusts the oom_adj value of all the renderers and plugins to sort them based on the algorithm above. I will probably only need to adjust them a little -- the current renderer and plugin processes get an adjustment of five (which shifts the badness up by five bits). I'll probably just have three to five different levels of badness to assign, starting at five (where larger is more likely to be killed).

I'm going to change the UI so that the when a tab is killed by the OOM, it displays a page different from the “Sad Tab” page that tells the user what happened and why, and gives them the option to reload the page. This may be a little tricky to determine, as there really isn’t a lot of warning when the OOM kills your process.

It has been suggested that we just let the OOM killer kill a tab, mark it, and just reload it the next time the user visits it. We can test this, but my feeling is that the user will occasionally be very surprised to find that this happens, that some web apps will handle this poorly, and that losing user data on reload is something we need to explicitly notify the user about. It seems to me that if we can’t guarantee full reload (save DOM state, javascript variable state, plugin state, etc.), that this is a shabby thing to do: it’s cleaner to tell them why we killed it and let them decide if they want to chance reloading it.

As I’m implementing this, I’ll write a test that will exercise the OOM killer algorithm. Hopefully that’s not too tricky to get into our testing framework without being flaky.

Phase 1.1 -- Add in Networking info to OOM killer tuning.

Collecting the last time a tab accessed the network is complicated to implement (e.g. sandboxed network access happens in another process and so has to be tracked back to a renderer), so I’ll implement that only if we think it’ll help with tuning. The main idea here is that music streaming apps might be likely to be killed based on the other criterion, so this helps recognize tabs that are streaming in the background. The fallback is to have the user pin streaming tabs.

Phase 2 -- Notify user when memory is getting low

In this phase we post some kind of notification when we get a mem_notify event that we’re low on memory. At that point, we can ask the user to kill off memory intensive applications. This will require a UI similar to the task manager (it might even be the task manager) so that the user can make informed choices about what to kill. In order to be able to display this UI when the memory is low, we’ll have to pre-allocate it and keep it around until needed.

This feels like a pretty heavy UI, and I’m not sure all users will feel qualified to decide what to kill. Maybe just give them a choice of the top five candidates for killing?

Phase 3 -- Flush all caches on mem_notify events

In this phase we try and flush all available caches in the OS -- plugins, browsers, etc. when we get our first mem_notify event that we’re out of memory. This step seems like a bandaid -- it’s only going to help the first time it happens, and thereafter there will bealmost nothing freed until the caches have time to refill. This would, however, be good in combination with user notification that there is too much memory being used, since it may buy the user some time to manage their tabs.

Greg Spencer

unread,

Aug 4, 2010, 6:06:06 PM8/4/10

to Chromium OS dev, Chromium-dev

Luigi Semenzato

unread,

Aug 4, 2010, 9:11:53 PM8/4/10

to Chromium OS dev, Chromium-dev

I suspect there is one issue you may want to consider even before you
get to the ones you mention. We've had reports of "extreme slowness",
and I was able to reproduce such situation in the past. The slowness
(and pegged disk activity) is consistent with thrashing due to code
paging. Even though we don't use swap, the kernel will still reclaim
read-only executable pages since they have a backing store (the
executable file). I suspect this may make the system unusable before
you get into an actual OOM situation.

Other than that, this seems like a good plan. The "low-on-memory" UI
is something that is sorely missing from existing systems.

> --
> Chromium OS Developers mailing list: chromiu...@chromium.org
> View archives, change email options, or unsubscribe:
> http://groups.google.com/a/chromium.org/group/chromium-os-dev?hl=en
>

Will Drewry

unread,

Aug 5, 2010, 12:14:16 PM8/5/10

to Luigi Semenzato, Chromium OS dev, Chromium-dev

On Wed, Aug 4, 2010 at 8:11 PM, Luigi Semenzato <seme...@chromium.org> wrote:
> I suspect there is one issue you may want to consider even before you
> get to the ones you mention. We've had reports of "extreme slowness",
> and I was able to reproduce such situation in the past. The slowness
> (and pegged disk activity) is consistent with thrashing due to code
> paging. Even though we don't use swap, the kernel will still reclaim
> read-only executable pages since they have a backing store (the
> executable file). I suspect this may make the system unusable before
> you get into an actual OOM situation.

Out of curiousity, would this still be the case if Chrome was running
with rlimits? Will it still attempt to swap out read-only executable
pages to keep memory use under that bar or will it just start
returning malloc failures?

When I did some very informal testing a few months back, running
chrome with 90% of the system memory and opening many, many tabs
resulted in sad faces, but no thrashing. But that was unscientific
and I never ended up exploring the rlimit v OOM code in the kernel
(thus the question).

> Other than that, this seems like a good plan. The "low-on-memory" UI
> is something that is sorely missing from existing systems.

I agree. The UI would be phenomenal to have, but I'm curious if we
would get less magical behavior if we were using something like
cgroups-memory or rlimits. (E.g., no surprise swap-free thrashing).
But I haven't taken the time to investigate those avenues to know :/

Greg Spencer

unread,

Aug 5, 2010, 2:09:18 PM8/5/10

to Will Drewry, Luigi Semenzato, Chromium OS dev, Chromium-dev

On Thu, Aug 5, 2010 at 9:14 AM, Will Drewry <w...@chromium.org> wrote:

On Wed, Aug 4, 2010 at 8:11 PM, Luigi Semenzato <seme...@chromium.org> wrote:
> I suspect there is one issue you may want to consider even before you
> get to the ones you mention. We've had reports of "extreme slowness",
> and I was able to reproduce such situation in the past. The slowness
> (and pegged disk activity) is consistent with thrashing due to code
> paging. Even though we don't use swap, the kernel will still reclaim
> read-only executable pages since they have a backing store (the
> executable file). I suspect this may make the system unusable before
> you get into an actual OOM situation.

Out of curiousity, would this still be the case if Chrome was running
with rlimits? Will it still attempt to swap out read-only executable
pages to keep memory use under that bar or will it just start
returning malloc failures?

Good question. I'll have to look into that some more. I know that cgroups will reclaim from the cgroup LRU list when it approaches the limit however, so maybe that's the way to go.

One issue with setting resource limits is figuring out what to set them to. We'd have to be constantly tweaking the values whenever the system code changes (larger/smaller system memory use can come from anywhere).

Seems like what you'd want to do is measure how much memory the browser and system (I'm including X and the window manager in the "system") are using, and set the limits for the renderer and plugin processes to give the browser and systemsome headroom. But it would have to be dynamic, since the browser grows with more tabs, etc.

My gut feeling (although I definitely could be wrong here) is that we'd end up with behavior similar to the OOM killer strategy in the long run -- renderers would die before the browser and system processes, it would just be a different cause of death.

When I did some very informal testing a few months back, running
chrome with 90% of the system memory and opening many, many tabs
resulted in sad faces, but no thrashing. But that was unscientific
and I never ended up exploring the rlimit v OOM code in the kernel
(thus the question).

Yes, this is my experience with limited testing as well.

-Greg.

Luigi Semenzato

unread,

Aug 5, 2010, 2:27:10 PM8/5/10

to Greg Spencer, Will Drewry, Chromium OS dev, Chromium-dev

That's interesting. I never got sad faces during my tests, only slow
response and pegged disk activity. This is from last November. I was
running on a white eeepc. My testing strategy was to go to Google
News and control-click links as fast as I could. Then I'd go to some
of the new tabs and control-click more random links. A lot of the
pages had Flash-based ads.

Mandeep Singh Baines

unread,

Aug 6, 2010, 2:33:08 PM8/6/10

to Luigi Semenzato, Chromium OS dev, Chromium-dev

From the correct address.

Luigi Semenzato (seme...@chromium.org) wrote:
> I suspect there is one issue you may want to consider even before you

> get to the ones you mention. ?We've had reports of "extreme slowness",
> and I was able to reproduce such situation in the past. ?The slowness

> (and pegged disk activity) is consistent with thrashing due to code

> paging. ?Even though we don't use swap, the kernel will still reclaim

> read-only executable pages since they have a backing store (the

> executable file). ?I suspect this may make the system unusable before

> you get into an actual OOM situation.
>

I believe when you did you're testing, we were still on a 2.6.30 kernel.
Some work was done in 2.6.31 to improve this. From the
kernelnewbies 2.6.31 summary:

1.3. Improve desktop interactivity under memory pressure

PROT_EXEC pages are pages that normally belong to some currently running executables and their linked libraries, they shall really be cached aggressively to provide good user experiences because if they aren't, the desktop applications will experience very long and noticeable pauses when the application's code path jumps to a part of the code which is not cached in memory and needs to be read from the disk, which is very slow. Due to some memory management scalability work in recent kernel versions, there're some (commonly used) workloads which can send these PROT_EXEC pages to the list of filesystem-backed pages (the ones used to map files) which are unactive and can get flushed out of the working set. The result is a desktop environment with poor interactivity: the applications become unresponsive too easily.

In this version, some heuristics have been used to make much harder to get the mapped executable pages out of the list of active pages. The result is an improved desktop experience: Benchmarks on memory tight desktops show clock time and major faults reduced by 50%, and pswpin numbers are reduced to ~1/3, that means X desktop responsiveness is doubled under high memory/swap pressure. Memory flushing benchmarks in a file server shows the number of major faults going from 50 to 3 during 10% cache hot reads. See the commit link for more details and benchmarks.

http://kernelnewbies.org/Linux_2_6_31#head-799157cd8729eba8ee5bc1ff0290d7414f366ef2

> Other than that, this seems like a good plan. ?The "low-on-memory" UI

> is something that is sorely missing from existing systems.
>
>
> On Wed, Aug 4, 2010 at 3:06 PM, Greg Spencer <gspe...@chromium.org> wrote:
> > Hi Folks,
> >
> > Here's my proposal for improving OOM situations on ChromeOS. In a nutshell,
> > the idea is that we'll tune the OOM killer's algorithm to match what we
> > want, and make the UI more explicit about what happened when a tab is killed
> > by the OOM killer.
> > Please let me know if you have any suggestions/comments.
> > -Greg.
> >
> > Document link:
> > https://docs.google.com/a/google.com/document/edit?id=1ddPY1-v7ZFr0jmhuxw04ehNLMQrPzxHhL6vUoBzELBo&hl=en
> >
> > (but that probably won't work outside google.com: see below for full text).
> >
> > -------------------
> >
> > Out of Memory Management for ChromeOS
> >
> > Greg Spencer (gspencer), ChromeOS UI team.
> >
> > Intro
> >
> > Like all computers, ChromeOS devices have limited memory, and bad things

> > happen when we run out of physical memory. ?We?d like to make ChromeOS be
> > more elegant than most OSs when it runs into this situation. ?To that end,
> > we?re looking to improve the user experience around out of memory (OOM)

> > conditions.
> >
> > Current State
> >
> > Currently, when a ChromeOS device runs out of memory processes are killed by
> > the OOM killer (a part of the kernel) until enough memory is available.

> > ?Because we have no swap configured, but do allow overcommit (i.e. malloc

> > pretends it has nearly unlimited memory when handing out addresses),
> > eventually a process tries to use memory assigned to it in the virtual

> > address space that isn?t actually available, and the kernel asks the OOM

> > killer to kill processes on the system until enough memory is available.

> > ?The processes killed don?t necessarily include the one that started to use
> > the unavailable memory, but rather are based on the OOM killer?s ?badness?
> > algorithm.
> >
> > We don?t have a swap partition configured because we?re afraid that it will
> > start killing blocks on the SSD after an unreasonably short time. ?[I
> > haven?t verified that this is indeed a problem, but I?m assuming that the
> > original decision wasn?t made in a vacuum. ?It does seem to me that write

> > levelling in the SSD hardware should mitigate this somewhat, however].
> >
> > Currently, the renderers, plugins, and browser processes are (not

> > surprisingly) the largest users of memory on Chrome OS. ?The renderers and

> > plugins can be killed without crashing the system, but killing the browser
> > process (which can grow quite large) causes the entire session to restart,
> > so we want to kill that as a last resort (or at least after all the
> > renderers and plugins have been killed).
> >
> > Linux Chrome already uses the /proc/<PID>/oom_adj method for proiritizing
> > renderers and plugins over the browser process (or system processes) for

> > being killed by the OOM so they?ll be killed in the right order. ?This works

> > fairly well, but the default OOM killer algorithm prefers to kill recent
> > processes instead of older processes, so this is not quite optimal for us,
> > as we would prefer to kill older tabs over newer ones, and non-pinned tabs

> > before pinned ones. ?[The file /proc/<PID>/oom_adj contains a bit-shift

> > value from -17 to 15 that adjust the badness value of the process.]
> >

> > Also, when things are killed, the ?Sad Tab? page is displayed, which doesn?t

> > communicate the nature of the failure.
> >
> > Possible Methods of Controlling Memory Usage on ChromeOS
> >

> > [This is the brainstorming part of the doc. ?Not all of these will be
> > implemented.]
> >
> > Kernel Level
> >
> > Change overcommit behavior (change to "overcommit_ratio?), to encourage more

> > NULLs being returned from malloc instead of the OOM getting happy and

> > killing stuff randomly. ??This might not actually help things -- it'll mean

> > that the process that is trying to allocate always gets killed via segfault
> > instead of another less important process.
> > Use mem_notify kernel module to send notification when thresholds are

> > reached if we aren't already. ??This is useful for clearning caches, garbage
> > collecting, etc, but isn?t a solution to the overall problem. This may be

> > useful for marking which tabs are killed from the OOM and which are killed
> > for other reasons.
> > Severely re-nice or stop processes that abuse memory in order to have
> > resources to let user pick what to do. (but it is not be possible for it to
> > happen fast enough in all cases).
> > Setup some small swap space (e.g. 50M) so that any very static data in

> > memory gets swapped out. ?We currently have at least 25M of data that never

> > gets accessed again once the app is loaded.
> >
> > Chrome Level Changes
> >
> > Respond to mem_notify events (in order of how draconian they are) with

> > actions that don?t require user notification. ?This is by its nature a

> > bandaid, as any memory sponge will quickly eat up the freed memory:
> >
> > Flushing memory HTML caches
> > Garbage collecting all V8, Crore, Flash instances.

> > Sharing renderers among more tabs, killing some renderers. ?[Darin says this
> > probably won?t gain us much -- only means we can share a few more font

> > tables, etc, and will slow things down considerably due to swapping out
> > DOMs, etc.]
> > Empty Flash and HTML5 audio/video buffers (and maybe notify user because

> > they?ll all start rebuffering if they are playing).

> >
> > Other measures that may require user notification:
> >
> > Closing no-content tabs (new tab pages, about: pages).
> > Closing windows that only have non-content tabs in them (e.g. an empty
> > window running with just the new tab page).
> >
> > Reduce memory usage in the first place by:
> >

> > mmap?ing large images (which would get swapped out on low memory by the
> > kernel). ?[We may already be doing this]
> >
> > Implementation Plan
> >
> > Given that some of the suggestions above require more work than others, I?m

> > planning to pick the low hanging items first, and then see how much bang

> > that gives us, and then move on to more time consuming mitigation if that?s

> > not sufficient.
> >
> > Phase 1 -- Tune OOM killer algorithm
> >
> > I'm going to collect the following information:
> >
> > Whether or not a tab is pinned
> > When was the last time the user clicked on or entered something into the tab
> > When was the last time the user clicked on the tab to make it current
> > How much memory the tab is using
> >
> > And then I'm going to come up with an algorithm (TBD) that ranks tabs based

> > on these criterion. ?The algorithm will probably prefer to kill tabs that
> > aren?t pinned, have been idle for the longest, and use the most memory.
> > ?It?ll probably kill plugins before killing renderers.

> >
> > I'm going to write a manager into the browser process that every so often
> > (every five seconds or so) adjusts the oom_adj value of all the renderers

> > and plugins to sort them based on the algorithm above. ?I will probably only

> > need to adjust them a little -- the current renderer and plugin processes

> > get an adjustment of five (which shifts the badness up by five bits). ?I'll

> > probably just have three to five different levels of badness to assign,
> > starting at five (where larger is more likely to be killed).
> >
> > I'm going to change the UI so that the when a tab is killed by the OOM, it

> > displays a page different from the ?Sad Tab? page that tells the user what
> > happened and why, and gives them the option to reload the page. ?This may be
> > a little tricky to determine, as there really isn?t a lot of warning when

> > the OOM kills your process.
> >
> > It has been suggested that we just let the OOM killer kill a tab, mark it,

> > and just reload it the next time the user visits it. ?We can test this, but

> > my feeling is that the user will occasionally be very surprised to find that
> > this happens, that some web apps will handle this poorly, and that losing
> > user data on reload is something we need to explicitly notify the user

> > about. ?It seems to me that if we can?t guarantee full reload (save DOM

> > state, javascript variable state, plugin state, etc.), that this is a shabby

> > thing to do: it?s cleaner to tell them why we killed it and let them decide

> > if they want to chance reloading it.
> >

> > As I?m implementing this, I?ll write a test that will exercise the OOM
> > killer algorithm. ?Hopefully that?s not too tricky to get into our testing

> > framework without being flaky.
> >
> > Phase 1.1 -- Add in Networking info to OOM killer tuning.
> >
> > Collecting the last time a tab accessed the network is complicated to
> > implement (e.g. sandboxed network access happens in another process and so

> > has to be tracked back to a renderer), so I?ll implement that only if we
> > think it?ll help with tuning. ?The main idea here is that music streaming

> > apps might be likely to be killed based on the other criterion, so this

> > helps recognize tabs that are streaming in the background. ?The fallback is

> > to have the user pin streaming tabs.
> >
> > Phase 2 -- Notify user when memory is getting low
> >
> > In this phase we post some kind of notification when we get a mem_notify

> > event that we?re low on memory. ?At that point, we can ask the user to kill
> > off memory intensive applications. ?This will require a UI similar to the

> > task manager (it might even be the task manager) so that the user can make

> > informed choices about what to kill. ?In order to be able to display this UI
> > when the memory is low, we?ll have to pre-allocate it and keep it around
> > until needed.
> >
> > This feels like a pretty heavy UI, and I?m not sure all users will feel
> > qualified to decide what to kill. ?Maybe just give them a choice of the top

> > five candidates for killing?
> >
> > Phase 3 -- Flush all caches on mem_notify events
> >
> > In this phase we try and flush all available caches in the OS -- plugins,

> > browsers, etc. when we get our first mem_notify event that we?re out of
> > memory. ?This step seems like a bandaid -- it?s only going to help the first

> > time it happens, and thereafter there will bealmost nothing freed until the

> > caches have time to refill. ?This would, however, be good in combination

Reply all

Reply to author

Forward

This conversation is locked

You cannot reply and perform actions on locked conversations.

0 new messages