What is the impact of Modules on buildsystems?

Stephen Kelly

unread,

Jan 22, 2017, 6:49:39 AM1/22/17

to mod...@isocpp.org, mod...@microsoft.com

Hello,

I am interested in Modules both as a C++ developer and as a developer of a buildsystem (CMake). I realize that the standard has nothing to say about buildsystems or even compiler drivers, but the impact that the standard has on those (particularly in the case of Modules) will affect adoption of the feature, so I think it makes sense to talk about it.

It seems to me that large amounts of C++ library code would have to be rewritten to take advantage of them.

In the Microsoft implementation of Modules at least, it seems that users will have to create new ixx files, and the buildsystem will have to gain additional steps to generate ifc files

https://blogs.msdn.microsoft.com/vcblog/2015/12/03/c-modules-in-vs-2015-update-1/

Other implementations might make other choices, and tools like cmake will try to abstract away the differences to whatever extent is possible. For now in the rest of this email, I'll assume the current Microsoft implementation is 'the way'.

One of the problems I have with all examples of Modules that I've seen is that they use simple inline functions to demonstrate the feature. What does 'using Modules' look like for a shared library?

Imagine I have 40 classes in a library, which today each has a .cpp and a .h file and I want to support modules. I am required to write one ixx file for the entire library, so I copy the contents of the 40 .h files into the MyLib.ixx and I add a few export keywords.

Question: Does having a single file for all declarations like this give me 'modularity'?

Now I'm unsatisfied with the duplication from the .h files, so I delete them all and I go to each of the 40 .cpp files and replace the

#include "foo1.h"

with

import MyLib;

So, the duplication is gone. Additionally, now I have the definition of the class Foo{2..40} available while I'm implementing Foo1. I might also (It is not clear to me) have the definitions from any dependent libraries available.

Question: Is that a deliberate outcome of the choices made in designing Modules? What impact does it have on maintainability of the code?

If it is deliberate, it is a very large change. Such a large change should surely be the result of a decision made about the language, not an unmentioned/undiscussed side effect. It has a huge impact on adoption of the feature.

So, now I've updated my C++ code and it is time to consider the impact of this change on my buildsystem. Previously, my buildsystem would compile each of the 40 cpp files into object files and then link them together. Now, my buildsystem has to perform the step of generating the .ifc file from the .ixx file. Buildsystems need to know the output of commands which make up the build, so I have a custom command which uses

cl /c MyLib.ixx /module:output C:\Output\path\MyLib.ifc

which apparently produces an object file (what does it contain? Is that what I want?) and the MyLib.ifc file.

My buildsystem knows to invoke that command before attempting to compile any of my cpp files. If I have multiple libraries in the same build and there are dependencies between them, my buildsystem knows to generate the .ifc files in the correct order.

Question: What is the impact of naming the module in the .ixx file? What synchronization is expected between the 'module M;' name and the name of the .ifc file generated? What if there was no 'module M;' syntax? Is there an alternate design without name redundancy?

I'm interested in any answers to the questions I've asked here, but if you prefer I am also very interested in whether anyone has done their own thinking about this and has an answer to the open question

What is the impact of Modules on buildsystems?

Thanks,

Steve.

Klaim - Joël Lamotte

unread,

Jan 23, 2017, 6:09:22 AM1/23/17

to mod...@isocpp.org, C++ Modules

On 22 January 2017 at 12:49, Stephen Kelly <stev...@gmail.com> wrote:

One of the problems I have with all examples of Modules that I've seen is that they use simple inline functions to demonstrate the feature. What does 'using Modules' look like for a shared library?

On this point in particular, my current understanding is that implementation-specific instructions like dllexport

are totally orthogonal to the module's proposal export.

Which basically mean that at for visual studio compiler, if you don't do this:

module A;

export class SOME_EXPORTING_MACRO MyClass

{

};

your class isn't exposed by a shared library.

I might be wrong but that's what I assumed in my experimentations and it seemed to work as expected.

I assume that the default with gcc would be to export symbols when the type is exported as even non exported types

in non-module translation units are exported by defaults if not static already (unless you use the appropriate flags of course).

Joël Lamotte

Gabriel Dos Reis

unread,

Jan 30, 2017, 12:11:56 PM1/30/17

to Stephen Kelly, mod...@isocpp.org, C++ Modules

Steve:

>> I realize that the standard has nothing to say about buildsystems or even compiler drivers, but the impact that the standard has on those (particularly in the case of Modules) will affect adoption of the feature, so I think it makes sense to talk about it.

Agreed.

Steve:

>> It seems to me that large amounts of C++ library code would have to be rewritten to take advantage of them.

That hasn’t been my experience, nor is there any factual logical reason for that to be.

If your existing library/program is already architecturally modular, there is no reason to have to rewrite them. You may have

to add a module declaration to state in code the symbolic name of your module (which was in the background of your architecture, but you had no way to express directly)
stick the keyword ‘export’ in front of the declarations that you intended to be part of the interface of your library

but that is a far cry from “have to be rewritten.”

Steve:

>> In the Microsoft implementation of Modules at least, it seems that users will have to create new ixx files

No, there is no requirement that you have to create new ixx files. You can invoke the compiler with -module:interface command line switch. Having an “ixx” is a convenience, not a requirement.

Steve

>> One of the problems I have with all examples of Modules that I've seen is that they use simple inline functions to demonstrate the feature.

I don’t know that to be true. Certainly the design document (P0142R0) contains examples of non-inline functions.

If your larger suggestion is that we need more tutorial materials, I fully agree with that.

Steve:

>> What does 'using Modules' look like for a shared library?

I know you know the distinction, but I want to take the opportunity here to clarify a common confusion. C++ modules are distinct from shared libraries or DLLs. There is no requirement that a C++ module be compiled to a single shared library or that a shared library corresponds to a single module. Furthermore, C++ export declarations are not necessarily linker-level export symbols (e.g. dllexport) for dynamic linked purposes. It is most likely that a linker-level symbol that is dllexported ends up being declared ‘export’ at a C++ source level; but the converse does not necessarily hold.

Now, if you do export a declaration both in the C++ sense (using the keyword ‘export’) and in the linker sense (e.g. _declspec(dllexport)), VC++ will take the step of automatically marking the symbol as ‘dllimport’ on the ‘import side’. This relives you from having to perform the confusing macro dance regarding when to put __declspec(dllexport) or __declspec(dllimport). All of that evaporates.

Steve:

>> Imagine I have 40 classes in a library, which today each has a .cpp and a .h file and I want to support modules. I am required to write one ixx file for the entire library, so I copy the contents of the 40 .h files into the MyLib.ixx and I add a few export keywords.

I am a bit puzzled. If your existing library has 40 classes in modular 40 headers, and that corresponds to the architecture you had in mind for your library, why would you want to smash them together into a single MyLib.ixx? Apparently you have determined that you wanted a single module instead of 40 modules (that would mirror your existing 40 headers). If that is the case, then why didn’t you have a super-header that included all of the 40 headers which would have been the official interface of your library? The question isn’t rhetorical; I’m trying to understand what has changed in your architecture to push you to do that?

By the way, you can also have this:

// file: MyLib.ixx

module MyLib;

#include “foo1.h”

// …

#include “foo40.h”

If you like the precept of one class per file. You can even go further using module aggregation if you go by the precept of one class per module.

Steve:

>> Question: Is that a deliberate outcome of the choices made in designing Modules? What impact does it have on maintainability of the code?

Yes. It help maintenance by centralizing the definitions of helpers (shared by module units of the same module) at one place.

Steve:

>> If it is deliberate, it is a very large change.

No, not really. The actual issue here is the decision made to lump the contents of 40 headers together. That isn’t required if it does not match the architecture you had in mind for your library.

Steve:

>> which apparently produces an object file (what does it contain? Is that what I want?) and the MyLib.ifc file.

First all, a module interface unit can contain definitions, so their compilation generally need to go somewhere. The traditional place is object file. So, yes that is what you want. :-)

What this means is that compiling a module interface file produces at least two outputs.

Steve:

>> My buildsystem knows to invoke that command before attempting to compile any of my cpp files. If I have multiple libraries in the same build and there are dependencies between them, my buildsystem knows to generate the .ifc files in the correct order.

That is awesome! Which version of CMake has this capability?

Steve:

>> Question: What is the impact of naming the module in the .ixx file? What synchronization is expected between the 'module M;' name and the name of the .ifc file generated? What if there was no 'module M;' syntax? Is there an alternate design without name redundancy?

There is no formal relationship between the pathname of the file containing a module interface unit and the module name itself.

“module M;” is what indicate to the compiler (from C++ semantics) that any declaration that follows is owned by the module M. If that module declaration is missing then there is no module semantics.

In the VC++ case, the name of the IFC file is settable by the user via the command line “-module:output”. You can choose any name you want. If you import a module M in your source, you need to specify an IFC file for the compiler to find the metadata for the interface of M. You can do that via “-module:reference <pathname>” where you specifiy a specific file, or via a directory search path with the option “-module:search <directory>” in which case the compiler will try to associate the module M with a file M.ifc in that directory. I recommend being explicit, e.g. using the “-module:reference” option. See the sections “Consuming Modules” and “Module Search Path” at https://blogs.msdn.microsoft.com/vcblog/2015/12/03/c-modules-in-vs-2015-update-1/