Efficient storage of broadcasted tensor

31 views

Skip to first unread message

Joel Berkeley

unread,

May 14, 2026, 8:37:12 PMMay 14

to OpenXLA Discuss

Hi,

If I broadcast a tensor to match shapes, will XLA typically notice that and not duplicate my data, to save space (and possibly time)? For example, in

```

x = [1, 2, 3]

y = rand([100, 3])

```

I can do `broadcast x + y`, but I don't want to duplicate the data in x.

Kevin Gleason

unread,

May 15, 2026, 11:38:06 AMMay 15

to OpenXLA Discuss, Joel Berkeley

In general XLA aims to avoid materializing data when possible - it won't reverse engineer something that looks like a broadcasted i32[3] into an i32[3], but it should be able to avoid duplication in the i32[3]->i32[100,3] direction by throwing away induction variables on lowering (x[i, j] + y[i, j] == > x[j] + y[i, j]).

Also, I should note that this is technically implementation defined and I've only looked into this on the TPU side, but guessing its fairly similar mechanics across the board. Seems like a thing that would bite us quickly if we didn't do it.

Best,

Kevin

Reply all

Reply to author

Forward

0 new messages