Search
Clear search
Close search
Main menu
Google apps
Groups
Sign in
Groups
Faster LLM Inference Seminar
Conversations
About
Send feedback
Help
Faster LLM Inference Seminar
Contact owners and managers
1–8 of 8
Mark all as read
Report group
0 selected
Faster LLM Inference Seminar
Mar 17
On Thursday: “Adaptive Compute LLMs with Early Exits” - Tal Schuster (Google DeepMind)
Date & Time: Thursday, March 20, at 3:00 PM EST (Add to calendar) Abstract: Scaling LLMs is a
unread,
On Thursday: “Adaptive Compute LLMs with Early Exits” - Tal Schuster (Google DeepMind)
Date & Time: Thursday, March 20, at 3:00 PM EST (Add to calendar) Abstract: Scaling LLMs is a
Mar 17
Faster LLM Inference Seminar
Mar 4
Today: “Accelerating LLM Inference with vLLM (and SGLang)” - Ion Stoica (Berkeley & Anyscale & Databricks)
Date & Time: Today at 3:30 PM EST (Add to calendar) Abstract: Inference efficiency remains a
unread,
Today: “Accelerating LLM Inference with vLLM (and SGLang)” - Ion Stoica (Berkeley & Anyscale & Databricks)
Date & Time: Today at 3:30 PM EST (Add to calendar) Abstract: Inference efficiency remains a
Mar 4
Nadav Timor
Feb 12
Today at 3pm ET - Hao Zhang (UCSD & Snowflake) on “Efficiently Serving Reasoning Programs with Certaindex”
Date & Time: Today at 3:00 PM EST (Add to calendar) Abstract: The rapid evolution of large
unread,
Today at 3pm ET - Hao Zhang (UCSD & Snowflake) on “Efficiently Serving Reasoning Programs with Certaindex”
Date & Time: Today at 3:00 PM EST (Add to calendar) Abstract: The rapid evolution of large
Feb 12
Nadav Timor
Jan 9
This Monday - Tianqi Chen (OctoAI & CMU) on “Enabling LLM Deployment Across Cloud and Edge with ML Compilation”
Date & Time: Saturday, January 13, 2025, at 1:00 PM EST Abstract: In this talk, we will discuss
unread,
This Monday - Tianqi Chen (OctoAI & CMU) on “Enabling LLM Deployment Across Cloud and Edge with ML Compilation”
Date & Time: Saturday, January 13, 2025, at 1:00 PM EST Abstract: In this talk, we will discuss
Jan 9
Faster LLM Inference Seminar
11/22/24
This Monday - Hongyang Zhang (Waterloo & Vector Institute) on "EAGLE & EAGLE-2: Lossless Inference Acceleration for LLMs"
Registration: https://faster-llms.vercel.app See you then! Nadav
unread,
This Monday - Hongyang Zhang (Waterloo & Vector Institute) on "EAGLE & EAGLE-2: Lossless Inference Acceleration for LLMs"
Registration: https://faster-llms.vercel.app See you then! Nadav
11/22/24
Faster LLM Inference Seminar
11/14/24
Starting in 3 hrs - Ce Zhang (UChicago & Together AI) on "The Token/s Game and Beyond"
Registration: https://faster-llms.vercel.app See you soon! Nadav
unread,
Starting in 3 hrs - Ce Zhang (UChicago & Together AI) on "The Token/s Game and Beyond"
Registration: https://faster-llms.vercel.app See you soon! Nadav
11/14/24
Nadav Timor
9/23/24
Starting in 10 min - Sasha Rush (Cornell & Hugging Face) on "SSMs and The Foundation Model Design Space"
Registration: https://faster-llms.vercel.app/ See you soon! Nadav
unread,
Starting in 10 min - Sasha Rush (Cornell & Hugging Face) on "SSMs and The Foundation Model Design Space"
Registration: https://faster-llms.vercel.app/ See you soon! Nadav
9/23/24
Nadav Timor
8/28/24
Today at noon ET - Xupeng Miao (Purdue University) - "Towards Fast and Affordable Serving Systems for LLMs"
Hi all, Happy to invite you to our session featuring Xupeng Miao from Purdue University. Please
unread,
Today at noon ET - Xupeng Miao (Purdue University) - "Towards Fast and Affordable Serving Systems for LLMs"
Hi all, Happy to invite you to our session featuring Xupeng Miao from Purdue University. Please
8/28/24