Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
Faster LLM Inference Seminar
Conversations
About
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts
Faster LLM Inference Seminar
Contact owners and managers
1–9 of 9
Mark all as read
Report group
0 selected
Nadav Timor
Apr 10
Starting in <45min: “Optimizing attention for modern hardware” - Tri Dao (Princeton & Together AI)
Date & Time: Today, April 10, at 12:00 PM EST (Add to calendar) Abstract: Attention, as a core
unread,
Starting in <45min: “Optimizing attention for modern hardware” - Tri Dao (Princeton & Together AI)
Date & Time: Today, April 10, at 12:00 PM EST (Add to calendar) Abstract: Attention, as a core
Apr 10
Faster LLM Inference Seminar
Mar 17
On Thursday: “Adaptive Compute LLMs with Early Exits” - Tal Schuster (Google DeepMind)
Date & Time: Thursday, March 20, at 3:00 PM EST (Add to calendar) Abstract: Scaling LLMs is a
unread,
On Thursday: “Adaptive Compute LLMs with Early Exits” - Tal Schuster (Google DeepMind)
Date & Time: Thursday, March 20, at 3:00 PM EST (Add to calendar) Abstract: Scaling LLMs is a
Mar 17
Faster LLM Inference Seminar
Mar 4
Today: “Accelerating LLM Inference with vLLM (and SGLang)” - Ion Stoica (Berkeley & Anyscale & Databricks)
Date & Time: Today at 3:30 PM EST (Add to calendar) Abstract: Inference efficiency remains a
unread,
Today: “Accelerating LLM Inference with vLLM (and SGLang)” - Ion Stoica (Berkeley & Anyscale & Databricks)
Date & Time: Today at 3:30 PM EST (Add to calendar) Abstract: Inference efficiency remains a
Mar 4
Nadav Timor
Feb 12
Today at 3pm ET - Hao Zhang (UCSD & Snowflake) on “Efficiently Serving Reasoning Programs with Certaindex”
Date & Time: Today at 3:00 PM EST (Add to calendar) Abstract: The rapid evolution of large
unread,
Today at 3pm ET - Hao Zhang (UCSD & Snowflake) on “Efficiently Serving Reasoning Programs with Certaindex”
Date & Time: Today at 3:00 PM EST (Add to calendar) Abstract: The rapid evolution of large
Feb 12
Nadav Timor
Jan 9
This Monday - Tianqi Chen (OctoAI & CMU) on “Enabling LLM Deployment Across Cloud and Edge with ML Compilation”
Date & Time: Saturday, January 13, 2025, at 1:00 PM EST Abstract: In this talk, we will discuss
unread,
This Monday - Tianqi Chen (OctoAI & CMU) on “Enabling LLM Deployment Across Cloud and Edge with ML Compilation”
Date & Time: Saturday, January 13, 2025, at 1:00 PM EST Abstract: In this talk, we will discuss
Jan 9
Faster LLM Inference Seminar
11/22/24
This Monday - Hongyang Zhang (Waterloo & Vector Institute) on "EAGLE & EAGLE-2: Lossless Inference Acceleration for LLMs"
Registration: https://faster-llms.vercel.app See you then! Nadav
unread,
This Monday - Hongyang Zhang (Waterloo & Vector Institute) on "EAGLE & EAGLE-2: Lossless Inference Acceleration for LLMs"
Registration: https://faster-llms.vercel.app See you then! Nadav
11/22/24
Faster LLM Inference Seminar
11/14/24
Starting in 3 hrs - Ce Zhang (UChicago & Together AI) on "The Token/s Game and Beyond"
Registration: https://faster-llms.vercel.app See you soon! Nadav
unread,
Starting in 3 hrs - Ce Zhang (UChicago & Together AI) on "The Token/s Game and Beyond"
Registration: https://faster-llms.vercel.app See you soon! Nadav
11/14/24
Nadav Timor
9/23/24
Starting in 10 min - Sasha Rush (Cornell & Hugging Face) on "SSMs and The Foundation Model Design Space"
Registration: https://faster-llms.vercel.app/ See you soon! Nadav
unread,
Starting in 10 min - Sasha Rush (Cornell & Hugging Face) on "SSMs and The Foundation Model Design Space"
Registration: https://faster-llms.vercel.app/ See you soon! Nadav
9/23/24
Nadav Timor
8/28/24
Today at noon ET - Xupeng Miao (Purdue University) - "Towards Fast and Affordable Serving Systems for LLMs"
Hi all, Happy to invite you to our session featuring Xupeng Miao from Purdue University. Please
unread,
Today at noon ET - Xupeng Miao (Purdue University) - "Towards Fast and Affordable Serving Systems for LLMs"
Hi all, Happy to invite you to our session featuring Xupeng Miao from Purdue University. Please
8/28/24