LoGG tomorrow's paper: Does equivariance matter at scale?

0 views
Skip to first unread message

Hannes Stärk

unread,
Jan 5, 2025, 3:05:56 PMJan 5
to lo...@googlegroups.com
Hi together,

Long time no hear and happy new year! First dose of paper will be:

Paper:
Does equivariance matter at scale? https://arxiv.org/abs/2410.23179 (Johann Brehmer, Sönke Behrends, Pim de Haan, Taco Cohen)
Given large data sets and sufficient compute, is it beneficial to design neural architectures for the structure and symmetries of each problem? Or is it more efficient to learn them from data? We study empirically how equivariant and non-equivariant networks scale with compute and training samples. Focusing on a benchmark problem of rigid-body interactions and on general-purpose transformer architectures, we perform a series of experiments, varying the model size, training steps, and dataset size. We find evidence for three conclusions. First, equivariance improves data efficiency, but training non-equivariant models with data augmentation can close this gap given sufficient epochs. Second, scaling with compute follows a power law, with equivariant models outperforming non-equivariant ones at each tested compute budget. Finally, the optimal allocation of a compute budget onto model size and training duration differs between equivariant and non-equivariant models.

Speaker:
Johann Brehmer whos is a physicist turned machine learner and a research scientist at CuspAI in Amsterdam. There he works on machine learning–driven discovery of materials for carbon capture.

Meeting Details:
Every Monday at 12:00 ET / 9:00 PT / 18:00 CE(S)T.  
https://zoom.us/j/5775722530?pwd=ZzlGTXlDNThhUDZOdU4vN2JRMm5pQT09

Add it to your calendar:
Subscribe via Google Calendar, or subscribe via iCal.
Alternatively, add the events, or add this single event.

Slack Workspace for discussion and paper voting:
https://join.slack.com/t/logag/shared_invite/zt-2nmy5vsj7-8KEDXe~mU~p5fCxwupSv5w

All information: Schedule of upcoming papers, recordings, etc.:
https://portal.valencelabs.com/logg
Reply all
Reply to author
Forward
0 new messages