[llvm-dev] Contributing LLD for Mach-O

71 views
Skip to first unread message

Shoaib Meenai via llvm-dev

unread,
Feb 28, 2020, 4:32:29 PM2/28/20
to llvm...@lists.llvm.org, Jez Ng

Hi all,

We’re planning to contribute a new implementation of LLD for Mach-O, using the same design as the COFF and ELF ports. This design has proven to work very well for those ports, and we’re keen to explore it for Mach-O as well. Our work is based on an initial prototype created by Peter Collingbourne and Rui Ueyama.

Our initial commit is up for review at https://reviews.llvm.org/D75382. We’ve intentionally stripped down this initial commit as much as possible to ease reviewing; we’ve kept it to the absolute minimum needed to produce and test a working macOS x86-64 executable for that prints “Hello World” via a syscall. We have several short-term follow-ups planned to add important functionality, such as linking against archives, universal binaries, dylibs, and tbd files, performing subsection splitting (atomization), and producing dylibs. The follow-ups should give a good sense of the overall design while still keeping each piece easily reviewable and testable individually. Our end goal is to create a full-featured Mach-O linker, and we’ll be working toward that goal over the next several months (and years, in all likelihood). We’d appreciate feedback and reviews.

James Y Knight via llvm-dev

unread,
Feb 28, 2020, 4:47:02 PM2/28/20
to Shoaib Meenai, llvm...@lists.llvm.org, Jez Ng
Nice!

Your plan sounds great, and it'll be awesome to finally have a good MachO LLD available.

_______________________________________________
LLVM Developers mailing list
llvm...@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev

Fangrui Song via llvm-dev

unread,
Feb 29, 2020, 1:40:22 AM2/29/20
to llvm-dev, Jez Ng

The existing Mach-O port https://reviews.llvm.org/D38290#882910
had been unmaintained when the ld64.lld alias was added.

If Jez and the team are committed to maintain the new Mach-O port and we
think the existing port is a dead end, we may assign the flavor `darwin` to
it (`lld -flavor darwin`) and rename the existing flavor to `darwin-old` or
`darwin-legacy`.

David Blaikie via llvm-dev

unread,
Feb 29, 2020, 5:15:10 PM2/29/20
to Shoaib Meenai, llvm...@lists.llvm.org, Jez Ng
Awesome :) out of curiosity, are there any particular features/characteristics/benefits you're hoping for compared to the MacOS system linker?

_______________________________________________

Shoaib Meenai via llvm-dev

unread,
Mar 2, 2020, 2:18:19 PM3/2/20
to Fangrui Song, llvm-dev, Jez Ng
We are committed to maintaining this. I believe there are some people who use the old port right now though, so I'd want us to be a bit more feature complete before we take over the Darwin flavor (e.g. we should at least be able to self-host).

On 2/28/20, 10:40 PM, "Fangrui Song" <mas...@google.com> wrote:

On 2020-02-28, James Y Knight via llvm-dev wrote:
>Nice!
>
>Your plan sounds great, and it'll be awesome to finally have a good MachO
>LLD available.
>
>On Fri, Feb 28, 2020 at 4:32 PM Shoaib Meenai via llvm-dev <
>llvm...@lists.llvm.org> wrote:
>
>> Hi all,
>>
>> We’re planning to contribute a new implementation of LLD for Mach-O, using
>> the same design as the COFF and ELF ports. This design has proven to work
>> very well for those ports, and we’re keen to explore it for Mach-O as well.
>> Our work is based on an initial prototype created by Peter Collingbourne
>> and Rui Ueyama.
>>
>> Our initial commit is up for review at https://urldefense.proofpoint.com/v2/url?u=https-3A__reviews.llvm.org_D75382&d=DwIDaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=o3kDXzdBUE3ljQXKeTWOMw&m=ewsOF79X4LajoVfIDFOIrBvc7WiqUQNo7sNMnNPG48I&s=5X09zpWtQuKwp-Fsb1NHCSVFTJ3-63Gy1PcucIjSQTQ&e= .
>> We’ve intentionally stripped down this initial commit as much as possible
>> to ease reviewing; we’ve kept it to the absolute minimum needed to produce
>> and test a working macOS x86-64 executable for that prints “Hello World”
>> via a syscall. We have several short-term follow-ups planned to add
>> important functionality, such as linking against archives, universal
>> binaries, dylibs, and tbd files, performing subsection splitting
>> (atomization), and producing dylibs. The follow-ups should give a good
>> sense of the overall design while still keeping each piece easily
>> reviewable and testable individually. Our end goal is to create a
>> full-featured Mach-O linker, and we’ll be working toward that goal over the
>> next several months (and years, in all likelihood). We’d appreciate
>> feedback and reviews.

The existing Mach-O port https://urldefense.proofpoint.com/v2/url?u=https-3A__reviews.llvm.org_D38290-23882910&d=DwIDaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=o3kDXzdBUE3ljQXKeTWOMw&m=ewsOF79X4LajoVfIDFOIrBvc7WiqUQNo7sNMnNPG48I&s=9tPjw9s64q0kszjhd9Yj4Shpc8NOYMOacqCyrWH5h5A&e=

Shoaib Meenai via llvm-dev

unread,
Mar 2, 2020, 2:22:38 PM3/2/20
to David Blaikie, llvm...@lists.llvm.org, Jez Ng

We’re hoping to get some linking speedups, similar to how LLD for ELF and COFF ended up being faster than the existing system linkers, though of course it’s hard to say how ld64’s design and efficiency compares to those other system linkers. In particular, we’re excited about the potential of speedups from parallelization and using the LLVM data structures instead of the STL ones.

Fangrui Song via llvm-dev

unread,
Mar 2, 2020, 5:11:46 PM3/2/20
to Shoaib Meenai, Jez Ng, llvm-dev
On 2020-03-02, Shoaib Meenai wrote:
>We are committed to maintaining this. I believe there are some people who use the old port right now though, so I'd want us to be a bit more feature complete before we take over the Darwin flavor (e.g. we should at least be able to self-host).

Thanks for the commitment. I know close to zero about Mach-O, but I'll try following your development.

You may already have known this, but the following two talks are helpful.

https://llvm.org/devmtg/2017-10/#talk16
https://archive.fosdem.org/2019/schedule/event/llvm_lld/

Shoaib Meenai via llvm-dev

unread,
Mar 2, 2020, 5:24:22 PM3/2/20
to Fangrui Song, Jez Ng, llvm-dev
I'd seen the first talk but not the second; I'll check that out. Thanks for the pointer.

On 3/2/20, 2:11 PM, "Fangrui Song" <mas...@google.com> wrote:

On 2020-03-02, Shoaib Meenai wrote:
>We are committed to maintaining this. I believe there are some people who use the old port right now though, so I'd want us to be a bit more feature complete before we take over the Darwin flavor (e.g. we should at least be able to self-host).

Thanks for the commitment. I know close to zero about Mach-O, but I'll try following your development.

You may already have known this, but the following two talks are helpful.

https://urldefense.proofpoint.com/v2/url?u=https-3A__llvm.org_devmtg_2017-2D10_-23talk16&d=DwIDaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=o3kDXzdBUE3ljQXKeTWOMw&m=GT8uVlPGTQdvVjhHId_JrQo0z146SwJ5NNsuWB5Q2fs&s=XV1y4LJ9pubNdyDEOXQQXEjK3kKmamHDIYvRFtrfmlE&e=
https://urldefense.proofpoint.com/v2/url?u=https-3A__archive.fosdem.org_2019_schedule_event_llvm-5Flld_&d=DwIDaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=o3kDXzdBUE3ljQXKeTWOMw&m=GT8uVlPGTQdvVjhHId_JrQo0z146SwJ5NNsuWB5Q2fs&s=4wk-TRzE3PfIOqYA2wp_deDIl4oAfwTvgb9qyvqYABM&e=

Dave Lee via llvm-dev

unread,
Mar 3, 2020, 10:37:02 AM3/3/20
to Shoaib Meenai, llvm...@lists.llvm.org, Jez Ng
Hi Shoaib,

Has there been any recent discussion with anyone on Apple's side? Is there a way forward that results in a single unified open source linker? If not at the start, how would it work if they later want to take maintainership?

Have you thought about a compatibility test suite? If so, I'm curious what the approach will be.

Also for what it's worth, there's a recent fork of ld64's recent source drop, which is optimized for incremental compilation. It also uses non-STL data structures, parallelism, and a disk cache. See https://github.com/michaeleisel/zld

_______________________________________________

Fangrui Song via llvm-dev

unread,
Mar 4, 2020, 9:00:58 PM3/4/20
to llvm-dev, Jez Ng
On 2020-03-03, Dave Lee via llvm-dev wrote:
>Hi Shoaib,
>
>Has there been any recent discussion with anyone on Apple's side? Is there
>a way forward that results in a single unified open source linker? If not
>at the start, how would it work if they later want to take maintainership?
>
>Have you thought about a compatibility test suite? If so, I'm curious what
>the approach will be.
>
>Also for what it's worth, there's a recent fork of ld64's recent source
>drop, which is optimized for incremental compilation. It also uses non-STL
>data structures, parallelism, and a disk cache. See
>https://github.com/michaeleisel/zld

% cat llvm-project/lld/CODE_OWNERS.TXT
...
N: Lang Hames, Nick Kledzik
E: lha...@gmail.com, kle...@apple.com
D: Mach-O backend
...

Lang resigned (https://reviews.llvm.org/D75382#1906108) and agreed to delete the existing Mach-O port,
but I think we should also get Nick's confirmation.

Shoaib Meenai via llvm-dev

unread,
Mar 4, 2020, 10:19:33 PM3/4/20
to Dave Lee, llvm...@lists.llvm.org, Jez Ng

We’ve discussed this with Jim Grosbach at Apple (CC’d). My understanding is that at this time, the officially supported linker for Apple’s platforms is ld64. We’d love to collaborate with Apple (and any other interested parties, for that matter) on LLD for Mach-O, and we’d be delighted if it were to become officially supported at some point, but there’s a lot of work to be done first on reaching feature parity with ld64 before that could even be considered :) Once we reach feature parity, I can envision several good reasons both for sticking with ld64 and for switching to LLD. On our end, we aim to create a feature-complete Mach-O linker. We also aim for the end product to be compelling enough that a switch could be considered, and we’d be happy to work with Apple on that front if it turns out to be.

 

What sort of compatibility test suite did you have in mind? We’re adding lit-style unit tests as we add features (as is standard for LLVM), but we didn’t have anything in mind beyond that right now.

 

We saw zld, and the speedups achieved by it make us hopeful of being able to achieve similar results with LLD (since we’ll also have better parallelization and better data structures in the form of the LLVM ones). (I’d also be curious if zld’s improvements could be contributed back to ld64 so that everyone can take advantage of them, but that’s completely tangential, of course.)

 

From: Dave Lee <davel...@gmail.com>
Date: Tuesday, March 3, 2020 at 7:37 AM
To: Shoaib Meenai <sme...@fb.com>
Cc: "llvm...@lists.llvm.org" <llvm...@lists.llvm.org>, Jez Ng <je...@fb.com>
Subject: Re: [llvm-dev] Contributing LLD for Mach-O

 

Hi Shoaib,

Jean-Daniel via llvm-dev

unread,
Mar 5, 2020, 2:01:35 AM3/5/20
to Shoaib Meenai, llvm...@lists.llvm.org

I tried to work on the Mach-O layer on LLD long-time ago (I got Dtrace User Probe working but as it was just before the LLD redesign and dev reboot, so never try to push the changes), and one thing I fond difficult was the complete lack of technical documentation on ld64 passes and flags.

I think that restarting a Mach-O parser from scratch (with ld64 feature parity as a goal) is a very good opportunity to also write a technical reference of the difference passes, especially the one that are Mach-O and Apple specific.
Do you plan something like that (as a page on lld.llvm.org for example). It would also be very helpful for potential contributors.

Reply all
Reply to author
Forward
0 new messages