Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
wg-serving
Conversations
Labels
About
Groups keyboard shortcuts have been updated
Dismiss
See shortcuts
wg-serving
Contact owners and managers
1–19 of 19
Mark all as read
Report group
0 selected
Yuan Tang (via Google Docs)
Feb 12
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "[Public Draft] WG-Serving Annual Report 2024"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
Feb 12
Abdullah Gharaibeh (via Google Docs)
Jan 23
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "Gateway Inference Extension “Contracts”"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
Jan 23
Ashok Chandrasekar
Jan 9
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
unread,
Request for repository inference-perf
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/inference-perf As per
Jan 9
Abdullah Gharaibeh
,
Cindy Xing
2
Jan 9
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.5.0 released!
Awesome to see this! Congrats to the team and thanks for making it happen. If we want to understand
Jan 9
Abdullah Gharaibeh (via Google Docs)
Jan 5
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "InferencePool Configuration API"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
Jan 5
Rob Scott
12/5/24
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
unread,
Renaming Instance Gateway + APIs
Hey Everyone, I've written up a doc that proposes renaming our llm-instance-gateway subproject
12/5/24
Abdullah Gharaibeh (via Google Docs)
10/2/24
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
10/2/24
Clayton Coleman (via Google Sheets)
9/16/24
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
unread,
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
9/16/24
Yuan Tang (via Google Docs)
8/28/24
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
8/28/24
Sergey Kanzhelev
8/20/24
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
unread,
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
8/20/24
loong dai (via Google Docs)
8/12/24
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
8/12/24
Clayton Coleman
8/7/24
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
unread,
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
8/7/24
Yuan Tang (via Google Docs)
7/31/24
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
unread,
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
7/31/24
Abdullah Gharaibeh (via Google Docs)
7/18/24
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
7/18/24
Abdullah Gharaibeh (via Google Docs)
6/26/24
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
unread,
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
6/26/24
Dan Sun (via Google Docs)
6/24/24
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
6/24/24
Clayton Coleman (via Google Docs)
6/5/24
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
unread,
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
6/5/24
Abdullah Gharaibeh
6/4/24
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
6/4/24
Ray Wainman
5/23/24
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
unread,
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
5/23/24