Groups
Groups
Sign in
Groups
Groups
wg-serving
Conversations
Labels
About
Send feedback
Help
wg-serving
Contact owners and managers
1–30 of 32
Mark all as read
Report group
0 selected
Bob Tian (via Google Docs)
Jan 8
Document shared with you: "PoC: gRPC support using standalone EPP"
Bob Tian shared a document Bob Tian (bobz...@google.com) has invited you to comment on the
unread,
Document shared with you: "PoC: gRPC support using standalone EPP"
Bob Tian shared a document Bob Tian (bobz...@google.com) has invited you to comment on the
Jan 8
Abdullah Gharaibeh (via Google Docs)
10/23/25
Document shared with you: "Serving Online Batch via Inference Gateway"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "Serving Online Batch via Inference Gateway"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
10/23/25
Abdullah Gharaibeh (via Google Docs)
10/15/25
Document shared with you: "[Public] EPP As a Standalone Request Scheduler"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "[Public] EPP As a Standalone Request Scheduler"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
10/15/25
Yuan Tang
10/3/25
Zoom Meeting Updates for Contributors Meetings
Hi all, We have updated the Zoom meeting links for the following two contributors meetings on the WG
unread,
Zoom Meeting Updates for Contributors Meetings
Hi all, We have updated the Zoom meeting links for the following two contributors meetings on the WG
10/3/25
Yuan Tang
, …
Tu Xudong
20
9/4/25
Re: Proposal for a Kubernetes AI Conformance Working Group
Hi, everyone First, i would like to join. Second, we're working on 1) broadcast this import trend
unread,
Re: Proposal for a Kubernetes AI Conformance Working Group
Hi, everyone First, i would like to join. Second, we're working on 1) broadcast this import trend
9/4/25
Kante Yin
8/5/25
[ANNOUNCE] LeaderWorkerSet v0.7.0 is released
Hi folks, On behalf of LWS team, we're very excited to announce that we just released lws v0.7.0,
unread,
[ANNOUNCE] LeaderWorkerSet v0.7.0 is released
Hi folks, On behalf of LWS team, we're very excited to announce that we just released lws v0.7.0,
8/5/25
NIR ROZENBAUM
, …
Shane Utt
4
7/22/25
[ANNOUNCE] Gateway API Inference Extension v0.5.0 is released
Thanks to everyone for the hard work on the GIE! 🎉 On Mon, Jul 21, 2025 at 7:10 PM Xunzhuo <
unread,
[ANNOUNCE] Gateway API Inference Extension v0.5.0 is released
Thanks to everyone for the hard work on the GIE! 🎉 On Mon, Jul 21, 2025 at 7:10 PM Xunzhuo <
7/22/25
Daneyon Hansen
6/23/25
[ANNOUNCE] Inference Gateway v0.4.0 is Released
All, I am honored to announce the v0.4.0 release of Inference Gateway —our biggest update yet! This
unread,
[ANNOUNCE] Inference Gateway v0.4.0 is Released
All, I am honored to announce the v0.4.0 release of Inference Gateway —our biggest update yet! This
6/23/25
Abdullah Gharaibeh (via Google Docs)
5/29/25
Document shared with you: "Revisiting The InferenceModel API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "Revisiting The InferenceModel API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
5/29/25
Clayton Coleman (via Google Docs)
4/10/25
Document shared with you: "[PUBLIC] LLM Serving: Latency-Informed Saturation Regimes"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to
unread,
Document shared with you: "[PUBLIC] LLM Serving: Latency-Informed Saturation Regimes"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to
4/10/25
Chen Zicong
4/10/25
[KEP] Gang Scheduling Support for LWS
Hi all, I've submitted a new KEP PR (#496) to add gang scheduling support for LWS. This feature
unread,
[KEP] Gang Scheduling Support for LWS
Hi all, I've submitted a new KEP PR (#496) to add gang scheduling support for LWS. This feature
4/10/25
Kante Yin
3/31/25
[ANNOUNCE] LeaderWorkerSet v0.6.0 is released
Hi all, on behalf of the LWS team, we just released v0.6.0, please refer to the release note for full
unread,
[ANNOUNCE] LeaderWorkerSet v0.6.0 is released
Hi all, on behalf of the LWS team, we just released v0.6.0, please refer to the release note for full
3/31/25
Suyogya Khanal
3/18/25
NHX1@HACKERONE.COM SECURITY TEST
THIS IS A SECURITY TEST! PLEASE DO NOT REPLY Regards, nh...@wearehackerone.com
unread,
NHX1@HACKERONE.COM SECURITY TEST
THIS IS A SECURITY TEST! PLEASE DO NOT REPLY Regards, nh...@wearehackerone.com
3/18/25
Yuan Tang (via Google Docs)
2/12/25
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
2/12/25
Abdullah Gharaibeh (via Google Docs)
1/23/25
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
1/23/25
Ashok Chandrasekar
1/9/25
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
unread,
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
1/9/25
Abdullah Gharaibeh
,
Cindy Xing
2
1/9/25
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
1/9/25
Abdullah Gharaibeh (via Google Docs)
1/5/25
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
1/5/25
Rob Scott
12/5/24
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
unread,
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
12/5/24
Abdullah Gharaibeh (via Google Docs)
10/2/24
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
10/2/24
Clayton Coleman (via Google Sheets)
9/16/24
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
unread,
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
9/16/24
Yuan Tang (via Google Docs)
8/28/24
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
8/28/24
Sergey Kanzhelev
8/20/24
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
unread,
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
8/20/24
loong dai (via Google Docs)
8/12/24
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
8/12/24
Clayton Coleman
8/7/24
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
unread,
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
8/7/24
Yuan Tang (via Google Docs)
7/31/24
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
unread,
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
7/31/24
Abdullah Gharaibeh (via Google Docs)
7/18/24
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
7/18/24
Abdullah Gharaibeh (via Google Docs)
6/26/24
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
unread,
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
6/26/24
Dan Sun (via Google Docs)
6/24/24
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
6/24/24
Clayton Coleman (via Google Docs)
6/5/24
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
unread,
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
6/5/24