Hi,
Clang 12 can be configured on Windows with MinGW (either GNU or
LLVM) with the following CMake parameters:
which has some effect on the binary size of the build.
I configured the llvm-project with the following parameters:
The installed (stripped) build of Clang 12 with llvm-mingw release 12.0.0 resulted in:
Due to the nature of MSVC regarding default visibility of symbols (hidden by default, whereas MinGW has visible by default), one needs to generate a .def file with the symbols needed to be exported.
This is done already in two cases for LLVM_BUILD_LLVM_C_DYLIB (llvm/tools/llvm-shlib/gen-msvc-exports.py) and for LLVM_EXPORT_SYMBOLS_FOR_PLUGINS (llvm/utils/extract_symbols.py).
I've put together a patch that enables LLVM_DYLIB and CLANG_DYLIB for MSVC.
I tested with clang-cl from the official Clang 12 x64 Windows binary release:
The shlib release build compiled and linked fine with LLVM.dll and clang-cpp.dll, unfortunately it crashes at runtime. For example llvm-nm:
$ llvm-nm
PLEASE submit a bug report to https://bugs.llvm.org/ and include
the crash backtrace.
Stack dump:
0. Program arguments: llvm-nm
#0 0x00007ffd32807d43 llvm::StringMap<llvm::cl::Option
*,llvm::MallocAllocator>::begin
C:\Projects\llvm-project\repo\llvm\include\llvm\ADT\StringMap.h:204:0
#1 0x00007ffd32807d43 llvm::cl::HideUnrelatedOptions(class
llvm::cl::OptionCategory &, class llvm::cl::SubCommand
&)
C:\Projects\llvm-project\repo\llvm\lib\Support\CommandLine.cpp:2589:0
#2 0x00007ff689df2b13 llvm::StringRef::StringRef
C:\Projects\llvm-project\repo\llvm\include\llvm\ADT\StringRef.h:107:0
#3 0x00007ff689df2b13 main
C:\Projects\llvm-project\repo\llvm\tools\llvm-nm\llvm-nm.cpp:2232:0
#4 0x00007ff689e26d04 invoke_main
D:\agent\_work\10\s\src\vctools\crt\vcstartup\src\startup\exe_common.inl:78:0
#5 0x00007ff689e26d04 __scrt_common_main_seh
D:\agent\_work\10\s\src\vctools\crt\vcstartup\src\startup\exe_common.inl:288:0
#6 0x00007ffd9a7f7034 (C:\Windows\System32\KERNEL32.DLL+0x17034)
#7 0x00007ffd9b742651 (C:\Windows\SYSTEM32\ntdll.dll+0x52651)
This crash is due to llvm::cl::HideUnrelatedOptions
which accesses TopLevelSubCommand, which
is defined as:
extern ManagedStatic<SubCommand> TopLevelSubCommand;
The MSVC 2019 build behaves in the same way as the clang-cl build.
I have tried building without LLVM_ENABLE_THREADS, or by linking to the CRT statically LLVM_USE_CRT_RELEASE=MT, didn't help.
The MSVC 2019 build sizes were:
I would appreciate any help in getting the shlib build running.
It works fine with llvm-mingw, I think it should also work with
clang-cl / cl.
Cheers,
Cristian.
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> Due to the nature of MSVC regarding default visibility of symbols (hidden by
> default, whereas MinGW has visible by default), one needs to generate a .def
> file with the symbols needed to be exported.
>
> This is done already in two cases for LLVM_BUILD_LLVM_C_DYLIB
> (llvm/tools/llvm-shlib/gen-msvc-exports.py) and for
> LLVM_EXPORT_SYMBOLS_FOR_PLUGINS (llvm/utils/extract_symbols.py).
>
> I've put together a patch that enables LLVM_DYLIB and CLANG_DYLIB for MSVC.
>
> I tested with clang-cl from the official Clang 12 x64 Windows binary
> release:
>
> * Normal build: 1,42 GB
> * shlib build: 536 MB
>
> The shlib release build compiled and linked fine with LLVM.dll and
> clang-cpp.dll, unfortunately it crashes at runtime.
Without digging into the scripts, I have one hunch:
Does the def generator script differentiate between code and data symbols?
For the cases where accessing a data symbol from another DLL, the caller
would have to have seen a declaration with the dllimport attribute. For
functions, it doesn't matter (it just does an extra hop via the import
thunk), but for data variables it matters. If the def file would have
proper DATA annotations for such symbols, you would end up with linker
errors (where you'd have an undefined reference to dataSymbol, where the
import library only provides __imp_dataSymbol).
This is fixed up by the autoimport feature when linking in mingw mode
(which, in general, requires you to link against the mingw runtime too);
for cases where the caller references dataSymbol but you only have
__imp_dataSymbol available, the linker adds an entry to a list of pseudo
relocations, which the mingw runtime handles when loaded, which then maps
sections as writable and patches up the addresses to where they are
located in another DLL.
So to avoid this, we would either need to actually provide proper
dllimport declarations at least for all data symbols, or avoid cross-DLL
data accesses (by using e.g. accessor functions instead).
// Martin
Yes, for Clang, the benefits are mostly captured with just -Bsymbolic-functions.
For GCC -fno-semantic-interposition is needed to enable interprocedural
optimizations for -fPIC compiles (https://reviews.llvm.org/D102453)
>I put out this alternative proposal mainly to see if there is any
>enthusiasm for it. If not, you are welcome to continue forward with our
>existing solutions. But if there is broad interest in adding API
>annotations to LLVM, I think that would be a better way to go long term.
I think explicit annotations make sense. I think most large-scale
projects considering ELF/Windows portability are doing this and
llvm-project is an unfortunate outlier.
If we add annotations (I persoanlly favor it), there will be churn to
llvm/include/llvm/**/*.h header files.
Moreover, as is, almost every function defined in llvm/lib/A/*.cpp is
exported to llvm/include/llvm/A/*.h if it is used by another library
llvm/lib/B. Every cross-lib API is public. We don't do a good job
making clear what are internal and what are public (and what are
stabler and what are less).
>On Sun, May 30, 2021 at 7:17 AM Cristian Adam via llvm-dev <
>llvm...@lists.llvm.org> wrote:
>
>> Hi,
>>
>> Clang 12 can be configured on Windows with MinGW (either GNU or LLVM) with
>> the following CMake parameters:
>>
>> - LLVM_BUILD_LLVM_DYLIB=ON
>> - LLVM_LINK_LLVM_DYLIB=ON
>> - CLANG_LINK_CLANG_DYLIB=ON
>>
>> which has some effect on the binary size of the build.
>>
>> I configured the llvm-project with the following parameters:
>>
>> - CMAKE_BUILD_TYPE=Release
>> - LLVM_TARGETS_TO_BUILD=X86
>> - LLVM_ENABLE_PROJECTS=clang;clang-tools-extra
>>
>> The installed (stripped) build of Clang 12 with llvm-mingw release 12.0.0
>> <https://github.com/mstorsjo/llvm-mingw/releases/tag/20210423> resulted
>> in:
>>
>> - Normal build: 1,76 GB
>> - shlib build: 481 MB
>>
>> Due to the nature of MSVC regarding default visibility of symbols (hidden
>> by default, whereas MinGW has visible by default), one needs to generate a
>> .def file with the symbols needed to be exported.
>>
>> This is done already in two cases for LLVM_BUILD_LLVM_C_DYLIB (llvm/tools/llvm-shlib/gen-msvc-exports.py)
>> and for LLVM_EXPORT_SYMBOLS_FOR_PLUGINS (llvm/utils/extract_symbols.py).
>>
>> I've put together a patch
>> <https://github.com/cristianadam/llvm-project/commit/3a3b8a7df17a49ba7c0153b0c9a7ee25705ede46>
>> that enables LLVM_DYLIB and CLANG_DYLIB for MSVC.
>>
>> I tested with clang-cl from the official Clang 12 x64 Windows binary
>> release:
>>
>> - Normal build: 1,42 GB
>> - shlib build: 536 MB
>> - Normal build: 1,74 GB
>> - shlib build: 949 MB
On 03/06/2021 23:09, Martin Storsjö wrote:
On Sun, 30 May 2021, Cristian Adam via llvm-dev wrote:
Due to the nature of MSVC regarding default visibility of symbols (hidden by
default, whereas MinGW has visible by default), one needs to generate a .def
file with the symbols needed to be exported.
This is done already in two cases for LLVM_BUILD_LLVM_C_DYLIB
(llvm/tools/llvm-shlib/gen-msvc-exports.py) and for
LLVM_EXPORT_SYMBOLS_FOR_PLUGINS (llvm/utils/extract_symbols.py).
I've put together a patch that enables LLVM_DYLIB and CLANG_DYLIB for MSVC.
I tested with clang-cl from the official Clang 12 x64 Windows binary
release:
* Normal build: 1,42 GB
* shlib build: 536 MB
The shlib release build compiled and linked fine with LLVM.dll and
clang-cpp.dll, unfortunately it crashes at runtime.
Without digging into the scripts, I have one hunch:
Does the def generator script differentiate between code and data symbols?
No. The CMake script just calls llvm-nm, exports the symbols and filters some out.
llvm-nm needs to exist beforehand.
I tried using dumpbin, but it's
slower than llvm-nm. CMake's __create_def
also works, but it's not exporting all symbols needed to proper
link.
For the cases where accessing a data symbol from another DLL, the caller would have to have seen a declaration with the dllimport attribute. For functions, it doesn't matter (it just does an extra hop via the import thunk), but for data variables it matters. If the def file would have proper DATA annotations for such symbols, you would end up with linker errors (where you'd have an undefined reference to dataSymbol, where the import library only provides __imp_dataSymbol).
This is fixed up by the autoimport feature when linking in mingw mode (which, in general, requires you to link against the mingw runtime too); for cases where the caller references dataSymbol but you only have __imp_dataSymbol available, the linker adds an entry to a list of pseudo relocations, which the mingw runtime handles when loaded, which then maps sections as writable and patches up the addresses to where they are located in another DLL.
So to avoid this, we would either need to actually provide proper dllimport declarations at least for all data symbols, or avoid cross-DLL data accesses (by using e.g. accessor functions instead).
According to https://docs.microsoft.com/en-us/cpp/build/reference/exports?view=msvc-160 you need to:
When you export a variable from a DLL by using a .DEF file, you do not have to specify __declspec(dllexport) on the variable. However, in any file that uses the DLL, you must still use __declspec(dllimport) on the declaration of data.
Then tested it out in a small project https://github.com/cristianadam/test-dll-def/.
Then proceeded to add dllimport declarations for llvm::cl::TopLevelSubCommand and llvm::cl::AllSubCommands as seen in the
updated patch:
https://github.com/cristianadam/llvm-project/commit/56ecad41992bd9345702fccaf3805ab186dca25c
Now llvm-nm doesn't crash anymore!
Adding the dllimport declaration for the exported data symbols
should be less work than doing proper dllimport declaration for
everything that uses LLVM.dll and clang-cpp.dll. Now I just need
to find out which data variables I need to update.
Thank you!
Cheers,
Cristian.
This would be a huge improvement, if you want to work on it, go for it.
> The proposal from last weeks' Windows/COFF call, was to start by adding a
> header defining `LLVM_EXPORT` macros for explicit marking of exported symbols,
> which can be used to mark data symbols as `__declspec(dllimport)` as well.
> With this in the codebase, the symbols required when using shared libraries,
> can be marked successively to enable more API boundaries to rely on the
> explicit exports.
>
There is already the LLVM_EXTERNAL_VISIBILITY macro defined in
llvm/Support/Compiler.h macro which is used in llvm/lib/Target.
I would start using this one instead of creating a new LLVM_EXPORT macro.
We can always rename the macro later if people like the name LLVM_EXPORT
better.
- Tom
>> llvm...@lists.llvm.org
>> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
There is already the LLVM_EXTERNAL_VISIBILITY macro defined in
llvm/Support/Compiler.h macro which is used in llvm/lib/Target.
I would start using this one instead of creating a new LLVM_EXPORT macro.
We can always rename the macro later if people like the name LLVM_EXPORT
better.
Ok, this is fine with me too.
-Tom
> Personally, I like the *_EXPORT name, but the other widely used convention is *_API. I think ICU uses that.
>
> Here are some examples of existing export header templates:
> https://cmake.org/cmake/help/latest/module/GenerateExportHeader.html <https://cmake.org/cmake/help/latest/module/GenerateExportHeader.html>
> https://gitlab.kitware.com/cmake/cmake/-/blob/master/Modules/exportheader.cmake.in <https://gitlab.kitware.com/cmake/cmake/-/blob/master/Modules/exportheader.cmake.in>
> https://source.chromium.org/chromium/chromium/src/+/main:base/base_export.h;l=12?q=base_export%20&ss=chromium <https://source.chromium.org/chromium/chromium/src/+/main:base/base_export.h;l=12?q=base_export%20&ss=chromium>