Librarization/Modularization

30 views

Skip to first unread message

Waldek Kozaczuk

unread,

May 26, 2020, 1:29:51 AM5/26/20

to OSv Development

Hi,

I am going to be sending proper "Next Release Proposal" email later this week (or next) and "Librarization/Modularization" will be a key part of it. Currently, OSv kernel provides quite a significant subset of the functionality of some standard Linux libraries listed here - https://github.com/cloudius-systems/osv#kernel-size. In reality, many applications do not need all of this functionality, but they "get it" whether they need it or not. Even Java, which used to need lots of symbols from standard libraries, has become way more modular, and with the advent of GraalVM and other AOT-type technologies, OSv kernel does not need to provide all this functionality universally to every app. Worse, if you run an app on Firecracker which needs console, non-PCI virtio-blk and virtio-net drivers only, one gets all other drivers including ones for VirtualBox, Xen, VMware, etc. This actually makes OSv barely a unikernel or at best a "fat" one. This has some real negative consequences - higher memory utilization (kernel needs to be loaded in memory), larger kernel file (makes decompression longer), and poorer security because of the fairly vast number of exported symbols (at this moment everything non-static gets exported) and finally possibly less optimized code. On the other hand, because of this "universality", it is quite easy, comparing to other unikernels, to run an arbitrary Linux app on OSv. And no matter what we do to make OSv more modular, we should preserve that "ease" and not make it harder, at least by default, to run an app on OSv.

So in general, what I am advocating for, is an ability (and a mechanism) to create more "stripped-down" versions of kernels tailored to the need of specific app and/or specific hypervisor OSv will run on while preserving the default universal kernel. And also shrinking the universal kernel by extracting optional functionality from it, where it makes sense and is relatively easy to do so, as a shared library to be loaded during the boot process. The latter should also ideally involve the build process (compile/link) optimizations I have already proposed in my other email I sent a week ago to the group - https://groups.google.com/d/msg/osv-dev/hCYGRzytaJ4/D23S_ibNAgAJ

In the end, what I am proposing could be organized in the following three categories:

Tailor kernel (and really drivers) to a specific hypervisor - this could be as simple as defining more granular sets of targets in the main makefile and adding #ifdef in all relevant places and possibly using existing ./conf/*.mk - based mechanism; for starters we could define a build configuration for Firacracker and QEMU microvm machine that I believe requires the same small subset of drivers.
Extract optional functionality into shared libraries - this is more difficult than the above. One example of such functionality is ZFS and there is already an open issue - https://github.com/cloudius-systems/osv/issues/1009. Some drivers could be extracted as libraries as well but it might be more difficult to do so. The main difficulty here is that there needs to be a filesystem mounted early enough in the boot process to load such a library from - bootfs (less attractive as it is part of loader.elf/kernel.elf) or RoFS.
Create a mechanism to build a smaller kernel "tailored" to a specific app. This would require some sort of ELF analyzer tool that would identify all symbols needed by the given app and its dependencies and create a version script file defining specific set of symbols to be exported from kernel. To achieve that we could start with addressing the issue - https://github.com/cloudius-systems/osv/issues/97 - "Be more selective on symbols exported from the kernel" - that could deliver such a generic solution.

Addressing 3) could help us with another issue - https://github.com/cloudius-systems/osv/issues/821 - "Combining pre-compiled OSv kernel with pre-compiled executable". To that end, we could also consider creating a mechanism that would let us build a stripped-down version of the kernel with functionality exposed through SYSCALL instruction only and no built-in musl (except for dynamic linker function (dlopen, etc)) and libc and let one mix in original pre-built musl library which would interact with kernel through those SYSCALL calls. This would require probably exposing more functions as SYSCALL than we have now in linux.cc - at least brk and clone. I am not sure if that is even feasible but I think I think at least one of the unikernels does just this - Hermitux.

I am also leaning more and more toward hiding C++ library - this should help us with 821 and there is at least one case - dotnet apps - that require an incompatible version of libstdc++.so. This would impact existing internal C++ apps like cpiod and httpserver as we would have to add libstdc++.so to the manifest for any of these apps. So there are some space and memory trade-offs here.

What do you think?

Waldek

Nadav Har'El

unread,

May 26, 2020, 4:05:35 AM5/26/20

to Waldek Kozaczuk, OSv Development

On Tue, May 26, 2020 at 8:29 AM Waldek Kozaczuk <jwkoz...@gmail.com> wrote:

Hi,

I am going to be sending proper "Next Release Proposal" email later this week

Thanks for doing this!

(or next) and "Librarization/Modularization" will be a key part of it. Currently, OSv kernel provides quite a significant subset of the functionality of some standard Linux libraries listed here - https://github.com/cloudius-systems/osv#kernel-size. In reality, many applications do not need all of this functionality, but they "get it" whether they need it or not. Even Java, which used to need lots of symbols from standard libraries, has become way more modular, and with the advent of GraalVM and other AOT-type technologies, OSv kernel does not need to provide all this functionality universally to every app. Worse, if you run an app on Firecracker which needs console, non-PCI virtio-blk and virtio-net drivers only, one gets all other drivers including ones for VirtualBox, Xen, VMware, etc. This actually makes OSv barely a unikernel or at best a "fat" one. This has some real negative consequences - higher memory utilization (kernel needs to be loaded in memory), larger kernel file (makes decompression longer), and poorer security because of the fairly vast number of exported symbols (at this moment everything non-static gets exported) and finally possibly less optimized code

I agree with all of this, here are some additional thoughts and why we kept with the "fat unikernel" approach in the past.

The unused drivers (e.g., Xen when running on KVM) are very small (it would be interesting to calculate exactly how small, though). The benefit, however, is that you can create an image that will work on any VM, whether it uses Qemu or Xen or VMware. This is a nice benefit that most other unikernels don't have, and if it costs a few kilobytes to get this benefit, it's probably worth it (if the cost is megabytes, that's different, of course. we should calculate it exactly).
The unused *features* are a bigger problem. E.g., some application might not need ZFS (https://github.com/cloudius-systems/osv/issues/1009, https://github.com/cloudius-systems/osv/issues/195), some other application might need UDP but not TCP, and so on. There is a problem of diminishing returns here, though - is it worth to compile out the sin() function because your application will never use it, to save another 1K? I guess it won't hurt to have such a configuration capability - especially for the bigger features (like ZFS).
We discovered that you never really know which Linux features an application might need. If you compile out sin() because your application doesn't need it, you may discover a month later that in some obscure code path, it actually does need it. Or maybe a month later an upgrade application comes out and it does need sin(). So again, the safest and easiest approach is just to include everything. But of course this doesn't preclude having an option to decide what to include.

. On the other hand, because of this "universality", it is quite easy, comparing to other unikernels, to run an arbitrary Linux app on OSv. And no matter what we do to make OSv more modular, we should preserve that "ease" and not make it harder, at least by default, to run an app on OSv.

I agree.

So in general, what I am advocating for, is an ability (and a mechanism) to create more "stripped-down" versions of kernels tailored to the need of specific app and/or specific hypervisor OSv will run on while preserving the default universal kernel.

Please keep the two dimensions - functionality needed by the app, and hypervisor - separate, and measure their contribution separately. If we discover that supporting Xen just adds 10KB of image size and memory use, there may not be a need to disable it. But we should measure.

And also shrinking the universal kernel by extracting optional functionality from it, where it makes sense and is relatively easy to do so, as a shared library to be loaded during the boot process.

Right, we just need to watch out whether doing this will cause a performance penalty (some things, especially TLS, are slower in shared object), but in general this is a good idea.

The latter should also ideally involve the build process (compile/link) optimizations I have already proposed in my other email I sent a week ago to the group - https://groups.google.com/d/msg/osv-dev/hCYGRzytaJ4/D23S_ibNAgAJ

Unfortunately I didn't read that yet :-(

In the end, what I am proposing could be organized in the following three categories:
Tailor kernel (and really drivers) to a specific hypervisor - this could be as simple as defining more granular sets of targets in the main makefile and adding #ifdef in all relevant places and possibly using existing ./conf/*.mk - based mechanism; for starters we could define a build configuration for Firacracker and QEMU microvm machine that I believe requires the same small subset of drivers.

Again, I think it's a good idea to do something like this, but then measure what is the actual benefit (in image size and memory use) of disabling the specific hypervisor support, and if it's very small, we could keep this conditional-compilation option but not recommend it to anyone.

Extract optional functionality into shared libraries - this is more difficult than the above. One example of such functionality is ZFS and there is already an open issue - https://github.com/cloudius-systems/osv/issues/1009. Some drivers could be extracted as libraries as well but it might be more difficult to do so. The main difficulty here is that there needs to be a filesystem mounted early enough in the boot process to load such a library from - bootfs (less attractive as it is part of loader.elf/kernel.elf) or RoFS.
Create a mechanism to build a smaller kernel "tailored" to a specific app. This would require some sort of ELF analyzer tool that would identify all symbols needed by the given app and its dependencies and create a version script file defining specific set of symbols to be exported from kernel. To achieve that we could start with addressing the issue - https://github.com/cloudius-systems/osv/issues/97 - "Be more selective on symbols exported from the kernel" - that could deliver such a generic solution.
Addressing 3) could help us with another issue - https://github.com/cloudius-systems/osv/issues/821 - "Combining pre-compiled OSv kernel with pre-compiled executable". To that end, we could also consider creating a mechanism that would let us build a stripped-down version of the kernel with functionality exposed through SYSCALL instruction only and no built-in musl (except for dynamic linker function (dlopen, etc)) and libc and let one mix in original pre-built musl library which would interact with kernel through those SYSCALL calls.

You wouldn't *really* want to not expose any functions - I'm pretty sure you will want to expose malloc(), for example, otherwise the application will need to include a full C library which uses brk() and implement its own malloc(). (I see you mentioned this below). This is possible, but I think this is a step in the wrong direction. It's related to https://github.com/cloudius-systems/osv/issues/212 - the ability to run a statically-linked executable, that was already linked with the C library, so it indeed only uses syscalls.

Having the application call SYSCALL instead of functions will also ruin one of OSv's only performance advantages.

This would require probably exposing more functions as SYSCALL than we have now in linux.cc - at least brk and clone. I am not sure if that is even feasible but I think I think at least one of the unikernels does just this - Hermitux.

I am also leaning more and more toward hiding C++ library - this should help us with 821 and there is at least one case - dotnet apps - that require an incompatible version of libstdc++.so. This would impact existing internal C++ apps like cpiod and httpserver as we would have to add libstdc++.so to the manifest for any of these apps. So there are some space and memory trade-offs here.

I think hiding the C++ library should definitely be a compile-time option. As you said, there are situations where you really need to use a different C++ library, but there are also situations where you can save space by not having the library twice (inside the kernel and inside the application). By the way, when the application doesn't use C++, we can also probably save space by including in the kernel only the C++ stuff it needs for itself, and nothing more (i.e., avoid the --whole-archive option).

What do you think?

Waldek

--
You received this message because you are subscribed to the Google Groups "OSv Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to osv-dev+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/osv-dev/f587322c-97af-450c-831e-99122b93cbda%40googlegroups.com.

Reply all

Reply to author

Forward

0 new messages