Groups
Sign in
Groups
wg-serving
Conversations
Labels
About
Send feedback
Help
wg-serving
Contact owners and managers
1–13 of 13
Mark all as read
Report group
0 selected
Abdullah Gharaibeh (via Google Docs)
Oct 2
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
unread,
Document shared with you: "[Public] Inference Gateway Scheduling Algorithm"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to comment
Oct 2
Clayton Coleman (via Google Sheets)
Sep 16
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
unread,
Spreadsheet shared with you: "[PUBLIC] Kubernetes LLM Instance Gateway MVP Prioritization"
Clayton Coleman shared a spreadsheet Clayton Coleman (smarter...@gmail.com) has invited you to
Sep 16
Yuan Tang (via Google Docs)
Aug 28
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "K8s WG Serving docs and problem exploration"
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to edit the following
Aug 28
Sergey Kanzhelev
Aug 20
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
unread,
Request for repository llm-instance-gateway
Hi, This is a request to create a repository https://github.com/kubernetes-sigs/llm-instance-gateway
Aug 20
loong dai (via Google Docs)
Aug 12
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Extension for Kubernetes Gateway API"
loong dai shared a document loong dai (looon...@gmail.com) has invited you to edit the following
Aug 12
Clayton Coleman
Aug 7
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
unread,
Proposal for dense LLM serving on Kubernetes via gateway
At today's wg-serving Jiaxin and I want to present a proposal for a project that allows multiple
Aug 7
Yuan Tang (via Google Docs)
Jul 31
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
unread,
Document shared with you: "[Public] KServe vs. Blueprint "
Yuan Tang shared a document Yuan Tang (terryt...@gmail.com) has invited you to comment on the
Jul 31
Abdullah Gharaibeh (via Google Docs)
Jul 18
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
unread,
Document shared with you: "[Public] K8s LLM Serving Catalog"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (ahgha...@gmail.com) has invited you to
Jul 18
Abdullah Gharaibeh (via Google Docs)
Jun 26
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
unread,
Document shared with you: "[Public] Blueprint Is All You Need"
Abdullah Gharaibeh shared a document Abdullah Gharaibeh (a...@google.com) has invited you to edit the
Jun 26
Dan Sun (via Google Docs)
Jun 24
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
unread,
Document shared with you: "Cloud Native LLM Gateway"
Dan Sun shared a document Dan Sun (dansu...@gmail.com) has invited you to edit the following
Jun 24
Clayton Coleman (via Google Docs)
Jun 5
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
unread,
Document shared with you: "[PUBLIC] Kubernetes LLM Inference Autoscaling Examples"
Clayton Coleman shared a document Clayton Coleman (clayton...@google.com) has invited you to edit
Jun 5
Abdullah Gharaibeh
Jun 4
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
unread,
Re: [ANNOUNCE] LeaderWorkerSet v0.3.0 Released!
Great work, lots of enhancements in this release. On Tue, Jun 4, 2024 at 2:00 PM 'Rupeng Liu'
Jun 4
Ray Wainman
May 23
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
unread,
Invitation: WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up @ Thu May 30, 2024 11:30am - 12pm (EDT) (wg-serving@kubernetes.io)
WG-Serving: Autoscaling Deep Dive on Warm Replicas Follow-up Zoom meeting: https://zoom.us/j/
May 23