Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
wg-serving
Conversations
Labels
About
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts
wg-serving
Contact owners and managers
1–24 of 24
Mark all as read
Report group
0 selected
Abdullah Gharaibeh (via Google Docs)
May 29
Document shared with you: "Revisiting The InferenceModel API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "Revisiting The InferenceModel API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
May 29
Clayton Coleman (via Google Docs)
Apr 10
Document shared with you: "[PUBLIC] LLM Serving: Latency-Informed Saturation Regimes"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to
unread,
Document shared with you: "[PUBLIC] LLM Serving: Latency-Informed Saturation Regimes"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to
Apr 10
Chen Zicong
Apr 10
[KEP] Gang Scheduling Support for LWS
Hi all, I've submitted a new KEP PR (#496) to add gang scheduling support for LWS. This feature
unread,
[KEP] Gang Scheduling Support for LWS
Hi all, I've submitted a new KEP PR (#496) to add gang scheduling support for LWS. This feature
Apr 10
Kante Yin
Mar 31
[ANNOUNCE] LeaderWorkerSet v0.6.0 is released
Hi all, on behalf of the LWS team, we just released v0.6.0, please refer to the release note for full
unread,
[ANNOUNCE] LeaderWorkerSet v0.6.0 is released
Hi all, on behalf of the LWS team, we just released v0.6.0, please refer to the release note for full
Mar 31
Suyogya Khanal
Mar 18
NHX1@HACKERONE.COM SECURITY TEST
THIS IS A SECURITY TEST! PLEASE DO NOT REPLY Regards, nh...@wearehackerone.com
unread,
NHX1@HACKERONE.COM SECURITY TEST
THIS IS A SECURITY TEST! PLEASE DO NOT REPLY Regards, nh...@wearehackerone.com
Mar 18
Yuan Tang (via Google Docs)
Feb 12
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
Feb 12
Abdullah Gharaibeh (via Google Docs)
Jan 23
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
Jan 23
Ashok Chandrasekar
Jan 9
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
unread,
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
Jan 9
Abdullah Gharaibeh
,
Cindy Xing
2
Jan 9
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
Jan 9
Abdullah Gharaibeh (via Google Docs)
Jan 5
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
Jan 5
Rob Scott
12/5/24
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
unread,
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
12/5/24
Abdullah Gharaibeh (via Google Docs)
10/2/24
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
10/2/24
Clayton Coleman (via Google Sheets)
9/16/24
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
unread,
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
9/16/24
Yuan Tang (via Google Docs)
8/28/24
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
8/28/24
Sergey Kanzhelev
8/20/24
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
unread,
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
8/20/24
loong dai (via Google Docs)
8/12/24
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
8/12/24
Clayton Coleman
8/7/24
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
unread,
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
8/7/24
Yuan Tang (via Google Docs)
7/31/24
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
unread,
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
7/31/24
Abdullah Gharaibeh (via Google Docs)
7/18/24
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
7/18/24
Abdullah Gharaibeh (via Google Docs)
6/26/24
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
unread,
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
6/26/24
Dan Sun (via Google Docs)
6/24/24
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
6/24/24
Clayton Coleman (via Google Docs)
6/5/24
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
unread,
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
6/5/24
Abdullah Gharaibeh
6/4/24
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
6/4/24
Ray Wainman
5/23/24
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
unread,
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
5/23/24