MLSys Seminar Episode 86: Dan Fu [Mon, 1:00 pm PT]

210 views
Skip to first unread message

Simran Arora

unread,
Dec 3, 2023, 8:27:39 PM12/3/23
to stanford-ml...@googlegroups.com, cs-se...@lists.stanford.edu, ai-...@cs.stanford.edu, stanf...@googlegroups.com, dawn-i...@lists.stanford.edu
Hi everyone, 

We're super excited to host Dan Fu, CS PhD Student at Stanford, for tomorrow's MLSys Seminar (December 4th) at 1:00 pm PT (please note the time change). The talk details are as follows:


Title: Monarch Mixer: Making Foundation Models More Efficient

Abstract: Machine learning models are increasingly being scaled in both sequence length and model dimension to reach longer contexts and better performance. However, existing architectures like Transformers scale quadratically along both these axes. In this talk I'll discuss Monarch Mixer (M2), a new architecture that uses the same sub-quadratic primitive along both sequence length and model dimension. M2 mixes information along the sequence and model dimensions using Monarch matrices, a simple class of expressive structured matrices that captures many linear transforms, achieves high hardware efficiency on GPUs, and scales sub-quadratically. 

Bio: Dan Fu is a PhD student in the Computer Science Department at Stanford University, where he is co-advised by Christopher Ré and Kayvon Fatahalian. His research is at the intersection of systems and machine learning and focuses on developing algorithms and architectures to make machine learning more efficient.

See everyone there!!

Best,
Simran


Simran Arora

unread,
Dec 4, 2023, 3:56:56 PM12/4/23
to stanford-ml...@googlegroups.com, cs-se...@lists.stanford.edu, ai-...@cs.stanford.edu, stanf...@googlegroups.com, dawn-i...@lists.stanford.edu
Reminder that this starts in 5 minutes!

Reply all
Reply to author
Forward
0 new messages