[blink-dev] Intent to Ship: Prompt API

Deepti Bogadi

unread,

Apr 1, 2026, 6:58:12 PMApr 1

to blink-dev, Mike Wasserman, Kenji Baheux, Etienne Noël, Rob Kochman

Contact emails

m...@chromium.org, rei...@chromium.org, kenji...@chromium.org, dbo...@chromium.org

Explainer

https://github.com/webmachinelearning/prompt-api/blob/main/README.md

Specification

http://webmachinelearning.github.io/prompt-api

Summary

The Prompt API gives web developers direct access to a browser-provided on-device AI language model. The API design offers fine-grained control, aligned with cloud API shapes, for progressively enhancing sites with model interactions tailored to individualized use cases. This compliments task-based language model APIs (e.g. Summarizer API), and varied APIs and frameworks for generalized on-device inference with developer-supplied ML models. The initial implementation supports text, image, and audio inputs, as well as response constraints that ensure generated text conforms with predefined regex and JSON schema formats.

This supports a variety of use cases, from generating image captions and performing visual searches to transcribing audio, classifying sound events, generating text following specific instructions, and extracting information or insights from multimodal source material.

This API has already been shipped in Chrome Extensions; this intent tracks the shipping on the web. An enterprise policy GenAILocalFoundationalModelSettings is available to disable the underlying model downloading, which would render this API unavailable. Enterprise admins can also set the BuiltInAIAPIsEnabled policy to block Built-In AI API usage, while still permitting other on-device GenAI features.

Language support log:

Chrome M139 and earlier only supported English ('en')
Chrome M140 added support for Spanish and Japanese ('es' and 'ja')

Blink component

Blink > AI > Prompt

Web Feature ID

https://github.com/web-platform-dx/web-features/issues/3530

Motivation

Direct access to a language model can help web developers accomplish tasks beyond those with dedicated APIs (e.g. Summarizer API) , and tailor their usage for site-specific requirements. Compared to the low-level APIs approach (e.g a custom AI model run via WebGPU, WASM, or WebNN), using the built-in language model can save the user's bandwidth and disk space, and has a lower barrier to entry. The design offers simple shorthands for common patterns (e.g. await session.prompt(‘write a haiku’)), and supports more complex use cases for handling structured content sequences, streaming responses, availability checks, session management, and response constraints.

Initial public proposal

https://github.com/webmachinelearning/charter/pull/9

Search tags

LanguageModel, Language Model, Prompt API, Built-in AI

TAG review

https://github.com/w3ctag/design-reviews/issues/1093

TAG review status

Issues addressed

WebFeature UseCounter name

kLanguageModel_Create

Risks

Interoperability and Compatibility

The Prompt API is designed to provide a stable and interoperable surface for language model interactions, acknowledging the inherent diversity and non-deterministic nature of underlying models. Variance in behaviors and responses is a well understood expectation amongst developers employing this technology, and this API aims to provide an interoperable framework for consistent web platform access across browsers and models.

The Prompt API specifically aims to maximize compatibility by:

- Codifying an interoperable API surface for generalized language model interactions, so developers can write code that works across different browser engines and models. This surface has demonstrated compatibility with models from Google and Microsoft, and been polyfilled by extensions and JS frameworks, using different backends.

- Enforcing objective response conformance with constraints that ensure output adheres to known JSON schemas or regexes for interoperable processing of generated text.

- Supporting progressive enhancement patterns, by offering availability signals that encapsulate device and model support dimensions, and encourage developers to consider this API as one option among varied compatible AI offerings, including developer supplied models and cloud-based services.

Shipping this API provides a critical opportunity to broaden real-world implementation experience, explore future refinements, and collaborate with the web community on interoperable model diversity within a robust, predictable platform surface.

Gecko: Negative (https://github.com/mozilla/standards-positions/issues/1213)

WebKit: No signal (https://github.com/WebKit/standards-positions/issues/495)

Web developers: Strongly positive (https://github.com/webmachinelearning/prompt-api/blob/main/README.md#stakeholder-feedback)

Other signals: Microsoft Edge developers have been strong collaborators with notable contributions including structured output, and experimental tool use enhancements. Edge will be shipping this API using a different underlying model.

Ergonomics

The API deprecated parameters and renamed identifiers, leaving legacy access for previously launched extension contexts. We plan to align the web and extension surfaces through careful additive changes and cautious deprecation processes. Developers are encouraged to use the new identifier names in both contexts and observe deprecation messages regarding planned API alignments.

Activation

This feature would definitely benefit from having polyfills, backed by any of: cloud services, lazily-loaded client-side models using WebGPU/WASM/WebNN, or the web developer's own server. We anticipate seeing an ecosystem of polyfills and client frameworks grow as more developers experiment with this API.

WebView application risks

Does this intent deprecate or change behavior of existing APIs, such that it has potentially high risk for Android WebView-based applications?

Not Applicable; this API is not available in WebView.

Debuggability

The API surface supports basic DevTools debugging. Perfetto tracing (via optimization_guide and other events) is useful, and internal debugging pages which give more detail on the model's status, e.g. chrome://on-device-internals might be suitable to port into DevTools. The team is maintaining extensions of DevTools panels for improving debuggability. It is possible that giving more insight into the nondeterministic states of the model, e.g. random seeds, could help with debugging.

Will this feature be supported on all six Blink platforms (Windows, Mac, Linux, ChromeOS, Android, and Android WebView)?

No

The initial launch focuses on Windows, Mac, Linux, and ChromeOS (on Chromebook Plus devices). An implementation for Android using that platform’s OS-level built-in language model is being prototyped and will ship after the initial launch.

Is this feature fully tested by web-platform-tests?

No

Web platform tests cover the API surface adequately: https://wpt.fyi/results/ai/language-model These attempt to mitigate execution environments differences, e.g. stub/full implementations (content_shell, chrome), and device/model states (unavailable, downloadable, downloaded). The core responses of real models can be unpredictable (especially without sampling parameters) and may cause inconsistent test results, but some facets are more readily testable, e.g. the adherence to structured output response constraints. Test coverage and reliability improvements are ongoing, including planning for WebDriver extensions.

DevTrial instructions

https://developer.chrome.com/docs/ai/prompt-api

Flag name on about://flags

prompt-api-for-gemini-nano-multimodal-input

Finch feature name

AIPromptAPIMultimodalInput

Rollout plan

Will ship enabled for all users

Requires code in //chrome?

True

Tracking bug

https://crbug.com/417526788

Launch bug

https://launch.corp.google.com/launch/4461863

Measurement

The API has use counters for all methods and attributes e.g.: LanguageModel_Create LanguageModel_Availability LanguageModel_Prompt LanguageModel_PromptStreaming LanguageModel_Append LanguageModel_MeasureContextUsage LanguageModel_OnContextOverflow LanguageModel_ContextUsage LanguageModel_ContextWindow LanguageModel_Clone LanguageModel_Destroy

Non-OSS dependencies

Does the feature depend on any code or APIs outside the Chromium open source repository and its open-source dependencies to function?

Yes: this feature depends on a language model, which is bridged to the open-source parts of the implementation via the interfaces in //services/on_device_model.

Estimated milestones

Shipping on desktop	148
Origin trial desktop first	139
Origin trial desktop last	144
Origin trial extension 1 end milestone	147
DevTrial on desktop	137

Anticipated spec changes

Open questions about a feature may be a source of future web compat or interop issues. Please list open issues (e.g. links to known github issues in the project for the feature specification) whose resolution may introduce web compat/interop risk (e.g., changing to naming or structure of the API in a non-backward-compatible way).

Params may be re-added after addressing interop concerns: https://github.com/webmachinelearning/prompt-api/issues/170 Identifiers have been renamed for clarity before Web GA launch: https://github.com/webmachinelearning/prompt-api/issues/177 Any post-launch additive changes should be backwards compatible: e.g. tool use, multimodal sampling info/options and outputs, session history access, model info/options, etc.

Link to entry on the Chrome Platform Status

https://chromestatus.com/feature/5134603979063296?gate=5123192519393280

Links to previous Intent discussions

Intent to Prototype: https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAM0wra_LXU8KkcVJ0x%3DzYa4h_sC3FaHGdaoM59FNwwtRAsOALQ%40mail.gmail.com

Intent to Experiment: https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAM0wra9oT0jygAYT00WPp0_wtZ-znrB2OdZ6GQb%2B3thFLP19pA%40mail.gmail.com

Intent to Extend Experiment 1: https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAJcT_ZhyheBntZHMEwFJA%3DuhpkWmDx8yFieL5E5g%2Bwp5UA0mzQ%40mail.gmail.com

This intent message was generated by Chrome Platform Status.

Mike Taylor

unread,

Apr 3, 2026, 3:53:12 PMApr 3

to Deepti Bogadi, blink-dev, Mike Wasserman, Kenji Baheux, Etienne Noël, Rob Kochman

LGTM1

The only thing I was going to request was already taken care of: pinging the Mozilla issue and providing a pointer to this thread (since it's been about a year - maybe their position has evolved).

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAJcT_Zj73wjXZfmMcpQRWePp-H%3D5LzxYBOnasViYcn%3DFzY2vVQ%40mail.gmail.com.

Rick Byers

unread,

Apr 6, 2026, 4:23:22 PMApr 6

to Mike Taylor, Deepti Bogadi, blink-dev, Mike Wasserman, Kenji Baheux, Etienne Noël, Rob Kochman

Can you elaborate on this? I see 14 issues were filed in the spec repo based on TAG feedback, but all are still 'open' with little comment. As you mentioned below, the interoperable model parameters one was addressed by leaving these as "experimental", so I assume they are not covered by this I2S.

Many of these issues don't seem particularly actionable to me - they represent inherent risks in this space which, personally, I think we should be comfortable taking, learning from and iterating on post-ship. It's fine to disagree with TAG on this (especially since even TAG couldn't agree among themselves - the issue is formally 'lacks consensus'). But maybe there are still one or two things we should be doing now to reduce future interop risk?

Can you do a triage pass over the issues and flag which are fine to leave for future consideration (as a non-breaking enhancement), which have been mitigated for now (eg. model params), and which are ones the group just disagrees with and wants to close? If there are any which seem like they have a good chance of leading to future interop risk, just call those out. Personally interop risk here seem unavoidable and I appreciate the amount of effort that's gone into collaborating between two different implementations (Google and Microsoft). Given the developer demand I do think we should be shipping, I just want the transparency of being clear on our thinking at this point to maximize learning.

WebFeature UseCounter name

kLanguageModel_Create

Risks

Interoperability and Compatibility

The Prompt API is designed to provide a stable and interoperable surface for language model interactions, acknowledging the inherent diversity and non-deterministic nature of underlying models. Variance in behaviors and responses is a well understood expectation amongst developers employing this technology, and this API aims to provide an interoperable framework for consistent web platform access across browsers and models.

The Prompt API specifically aims to maximize compatibility by:

- Codifying an interoperable API surface for generalized language model interactions, so developers can write code that works across different browser engines and models. This surface has demonstrated compatibility with models from Google and Microsoft, and been polyfilled by extensions and JS frameworks, using different backends.

- Enforcing objective response conformance with constraints that ensure output adheres to known JSON schemas or regexes for interoperable processing of generated text.

- Supporting progressive enhancement patterns, by offering availability signals that encapsulate device and model support dimensions, and encourage developers to consider this API as one option among varied compatible AI offerings, including developer supplied models and cloud-based services.

Shipping this API provides a critical opportunity to broaden real-world implementation experience, explore future refinements, and collaborate with the web community on interoperable model diversity within a robust, predictable platform surface.

Agreed, thank you!

Gecko: Negative (https://github.com/mozilla/standards-positions/issues/1213)

WebKit: No signal (https://github.com/WebKit/standards-positions/issues/495)

Web developers: Strongly positive (https://github.com/webmachinelearning/prompt-api/blob/main/README.md#stakeholder-feedback)

Other signals: Microsoft Edge developers have been strong collaborators with notable contributions including structured output, and experimental tool use enhancements. Edge will be shipping this API using a different underlying model.

Ergonomics

The API deprecated parameters and renamed identifiers, leaving legacy access for previously launched extension contexts. We plan to align the web and extension surfaces through careful additive changes and cautious deprecation processes. Developers are encouraged to use the new identifier names in both contexts and observe deprecation messages regarding planned API alignments.

Activation

This feature would definitely benefit from having polyfills, backed by any of: cloud services, lazily-loaded client-side models using WebGPU/WASM/WebNN, or the web developer's own server. We anticipate seeing an ecosystem of polyfills and client frameworks grow as more developers experiment with this API.

WebView application risks

Does this intent deprecate or change behavior of existing APIs, such that it has potentially high risk for Android WebView-based applications?

Not Applicable; this API is not available in WebView.

Debuggability

The API surface supports basic DevTools debugging. Perfetto tracing (via optimization_guide and other events) is useful, and internal debugging pages which give more detail on the model's status, e.g. chrome://on-device-internals might be suitable to port into DevTools. The team is maintaining extensions of DevTools panels for improving debuggability. It is possible that giving more insight into the nondeterministic states of the model, e.g. random seeds, could help with debugging.

Will this feature be supported on all six Blink platforms (Windows, Mac, Linux, ChromeOS, Android, and Android WebView)?

No

The initial launch focuses on Windows, Mac, Linux, and ChromeOS (on Chromebook Plus devices). An implementation for Android using that platform’s OS-level built-in language model is being prototyped and will ship after the initial launch.

Is this feature fully tested by web-platform-tests?

No

Web platform tests cover the API surface adequately: https://wpt.fyi/results/ai/language-model These attempt to mitigate execution environments differences, e.g. stub/full implementations (content_shell, chrome), and device/model states (unavailable, downloadable, downloaded).

Yeah that's probably the most we can ask for in this intent. However most of these tests appear to be failing on wpt.fyi. Can you please triage all the failures on wpt.fyi and either fix them so they're passing or explain why the failure doesn't represent a real interop issue?

The core responses of real models can be unpredictable (especially without sampling parameters) and may cause inconsistent test results, but some facets are more readily testable, e.g. the adherence to structured output response constraints. Test coverage and reliability improvements are ongoing, including planning for WebDriver extensions.

Are there tracking issues in the CG regarding conformance testing that you can point to? Are there discussions somewhere around benchmarks / eval suites? Just like with our other big non-determinstic area of competition in browsers, web performance, having open benchmarks can be very useful for promoting compatibility.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/c257bea5-b444-42ab-b195-4025930206f2%40chromium.org.

Mike Wasserman

unread,

Apr 6, 2026, 6:04:47 PMApr 6

to blink-dev, Rick Byers, Deepti Bogadi, blink-dev, Mike Wasserman, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor

Regarding TAG review status:
The scope of issues raised is broad. We conducted thoughtful google-internal reviews and triage, and prioritized addressing issues that couldn't be addressed after launch by backwards-compatible spec and implementation changes. In that regard, sampling parameters are not part of this initial launch proposal, and various API identifiers have been renamed. We still need to respond to the rest; some touch on inherent risks of platform advancements in this nascent domain; many could benefit from outlining non-normative best practices for implementers to respect device resources and real world costs. We will expedite sharing those responses.

Regarding wpt.fyi test failures:

Almost all failures are model download timeouts; we hope to devise a way to retain or sideload models in wpt.fyi's continuous integration infrastructure.

Regarding conformance testing and benchmarks / eval suites:
Web platform tests should continually expand to include more substantive conformance coverage, ideally including support for more objective analysis of natural language. We have also conducted initial research on cross-browser interop and are collaborating on followups, including Microsoft fixing several bugs and planning model training around those shared use cases of interest. Microsoft collaborators also explored a very rough quality benchmark. We are also discussing internally and with partners about use cases and data sets that can be shared publicly and use to represent performance baselines on different representative devices. We don't have yet a formal open/public discussion yet, but we have been iterating on some eval tests, e.g. https://web-ai.studio/cortex. We will continue to iterate and engage with the goal of formalizing open quality and performance eval suites.

Adding color to the "strongly positive" web developers feedback statement:

We have run a number of hackathons, especially the Google Chrome Built-in AI Challenge 2025, and have a good number of engagement from specific partner explorations. We also get anecdotal positive organic signals from press articles, forum threads, and direct feedback.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAJcT_Zj73wjXZfmMcpQRWePp-H%3D5LzxYBOnasViYcn%3DFzY2vVQ%40mail.gmail.com.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

Vladimir Levin

unread,

Apr 7, 2026, 10:45:10 AMApr 7

to Mike Wasserman, blink-dev, Rick Byers, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor

On Mon, Apr 6, 2026 at 6:04 PM Mike Wasserman <m...@chromium.org> wrote:

Regarding TAG review status:
The scope of issues raised is broad. We conducted thoughtful google-internal reviews and triage, and prioritized addressing issues that couldn't be addressed after launch by backwards-compatible spec and implementation changes. In that regard, sampling parameters are not part of this initial launch proposal, and various API identifiers have been renamed.

Is the explainer up-to-date with the proposed changes? I'm struggling to understand exactly what will ship if this intent is approved. As an aside, the explainer starts with "This proposal is an early design sketch ...", which I suspect should be updated to indicate that this is close to shipping.

Also in the explainer: "The following features have been recently renamed. The legacy aliases are deprecated, and clients should update their code ...". Was this an OT or developer trial or just people trying this out?

Generally, I agree that we should ship something in this space like this API so thank you for working on this! I'm a little worried about stability of the API shape though. Specifically, I'm wondering if we've had developer feedback on the shape of the API and that it accomplishes the needed goals. Can you comment on this? As a concrete example, specifying "role" as hard-coded strings either "user" or "assistant" seems a little brittle to me as compared to some declared constant in LanguageModel. Again, I'm not suggesting that these should change, but simply asking if real-world developers had a chance to explore this feature and provide feedback

Thanks!

Vlad

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAJcT_Zj73wjXZfmMcpQRWePp-H%3D5LzxYBOnasViYcn%3DFzY2vVQ%40mail.gmail.com.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/c257bea5-b444-42ab-b195-4025930206f2%40chromium.org.

--

You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/fd88dc6e-a5fd-4dfd-9a8a-78971c13d363n%40chromium.org.

Mike Wasserman

unread,

Apr 7, 2026, 3:00:02 PMApr 7

to blink-dev, Vladimir Levin, blink-dev, Rick Byers, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Mike Wasserman

See responses inline, thanks!

On Tuesday, April 7, 2026 at 7:45:10 AM UTC-7 Vladimir Levin wrote:

On Mon, Apr 6, 2026 at 6:04 PM Mike Wasserman <m...@chromium.org> wrote:
Regarding TAG review status:
The scope of issues raised is broad. We conducted thoughtful google-internal reviews and triage, and prioritized addressing issues that couldn't be addressed after launch by backwards-compatible spec and implementation changes. In that regard, sampling parameters are not part of this initial launch proposal, and various API identifiers have been renamed.

Is the explainer up-to-date with the proposed changes? I'm struggling to understand exactly what will ship if this intent is approved. As an aside, the explainer starts with "This proposal is an early design sketch ...", which I suspect should be updated to indicate that this is close to shipping.

Thanks for this question. While we've kept many aspects of the explainer updated with changes for the initial web platform API launch, we've also been representing enhancements slated for experimentation after the initial launch (e.g. tool use, sampling parameters), and other parts have grown stale. I'm preparing an explainer PR to update and clarify this document in the next day or two.

Also in the explainer: "The following features have been recently renamed. The legacy aliases are deprecated, and clients should update their code ...". Was this an OT or developer trial or just people trying this out?

The API has been in Web Origin Trial from Chrome 139 (August 2025), and launched general availability for Chrome Extensions in Chrome 138 (June 2025). TAG feedback provided in Nov 2025 suggested these renames; spec and impl renames were performed in Feb 2026. Their legacy aliases are deprecated in extension contexts and will be removed in due course to align with the Web API.

Generally, I agree that we should ship something in this space like this API so thank you for working on this! I'm a little worried about stability of the API shape though. Specifically, I'm wondering if we've had developer feedback on the shape of the API and that it accomplishes the needed goals. Can you comment on this? As a concrete example, specifying "role" as hard-coded strings either "user" or "assistant" seems a little brittle to me as compared to some declared constant in LanguageModel. Again, I'm not suggesting that these should change, but simply asking if real-world developers had a chance to explore this feature and provide feedback

We believe API shape has stabilized through comprehensive experimentation phases (lengthy Extension+Web Dev Trial, early partner programs and hackathons, Extension Origin Trial and GA Launch, extended Web Origin Trial), and is ready for shipping to the open Web in Chrome. Most developer feedback about the API shape was in early Dev Trial stages, which precipitated many improvements for ergonomics and extensibility, reflected in commits during the first half of 2025 by our retired spec editor. These included the addition of an append() method, message and content sequence refinements (e.g. interleaving multimodal content of the same role), continued support for `prompt("write a poem")` shorthand, and more.

Since then, many real world developers have integrated this API in real sites and exploratory demos, while others have built polyfills and incorporated use in JS toolkits. We've additionally solicited feedback through additional channels (e.g. devtools console messages on session creation), and recent API shape feedback has generally been positive; no issues have been raised regarding the LanguageModelMessageRole enum and its use in the LanguageModelMessage dictionary. Our team is happy to continue discussing any thoughtful feedback through issues filed in the spec repository or Chromium issue tracker.

Thanks!
Vlad

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAJcT_Zj73wjXZfmMcpQRWePp-H%3D5LzxYBOnasViYcn%3DFzY2vVQ%40mail.gmail.com.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/c257bea5-b444-42ab-b195-4025930206f2%40chromium.org.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

Rick Byers

unread,

Apr 7, 2026, 3:43:45 PMApr 7

to Mike Wasserman, blink-dev, Vladimir Levin, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor

On Tue, Apr 7, 2026 at 3:00 PM Mike Wasserman <m...@chromium.org> wrote:

See responses inline, thanks!

On Tuesday, April 7, 2026 at 7:45:10 AM UTC-7 Vladimir Levin wrote:
On Mon, Apr 6, 2026 at 6:04 PM Mike Wasserman <m...@chromium.org> wrote:
Regarding TAG review status:
The scope of issues raised is broad. We conducted thoughtful google-internal reviews and triage, and prioritized addressing issues that couldn't be addressed after launch by backwards-compatible spec and implementation changes. In that regard, sampling parameters are not part of this initial launch proposal, and various API identifiers have been renamed.

Thanks Mike. From the triage doc I see there were three ship blockers identified and now addressed:

[Tag Review] - Quota vs. context window

[Tag Review] - Interoperable model parameters

measureInputUsage includes <model> start response token

I agree the others are non-blocking. Thank you! Broadly from all the public dialog and collaboration I've seen I trust you and the team to keep investing in the tricky tradeoffs here and expanding the industry alignment. So this is good enough for me for now.

Is the explainer up-to-date with the proposed changes? I'm struggling to understand exactly what will ship if this intent is approved. As an aside, the explainer starts with "This proposal is an early design sketch ...", which I suspect should be updated to indicate that this is close to shipping.

Thanks for this question. While we've kept many aspects of the explainer updated with changes for the initial web platform API launch, we've also been representing enhancements slated for experimentation after the initial launch (e.g. tool use, sampling parameters), and other parts have grown stale. I'm preparing an explainer PR to update and clarify this document in the next day or two.

Also in the explainer: "The following features have been recently renamed. The legacy aliases are deprecated, and clients should update their code ...". Was this an OT or developer trial or just people trying this out?

The API has been in Web Origin Trial from Chrome 139 (August 2025), and launched general availability for Chrome Extensions in Chrome 138 (June 2025). TAG feedback provided in Nov 2025 suggested these renames; spec and impl renames were performed in Feb 2026. Their legacy aliases are deprecated in extension contexts and will be removed in due course to align with the Web API.

Generally, I agree that we should ship something in this space like this API so thank you for working on this! I'm a little worried about stability of the API shape though. Specifically, I'm wondering if we've had developer feedback on the shape of the API and that it accomplishes the needed goals. Can you comment on this? As a concrete example, specifying "role" as hard-coded strings either "user" or "assistant" seems a little brittle to me as compared to some declared constant in LanguageModel. Again, I'm not suggesting that these should change, but simply asking if real-world developers had a chance to explore this feature and provide feedback

We believe API shape has stabilized through comprehensive experimentation phases (lengthy Extension+Web Dev Trial, early partner programs and hackathons, Extension Origin Trial and GA Launch, extended Web Origin Trial), and is ready for shipping to the open Web in Chrome. Most developer feedback about the API shape was in early Dev Trial stages, which precipitated many improvements for ergonomics and extensibility, reflected in commits during the first half of 2025 by our retired spec editor. These included the addition of an append() method, message and content sequence refinements (e.g. interleaving multimodal content of the same role), continued support for `prompt("write a poem")` shorthand, and more.

Since then, many real world developers have integrated this API in real sites and exploratory demos, while others have built polyfills and incorporated use in JS toolkits. We've additionally solicited feedback through additional channels (e.g. devtools console messages on session creation), and recent API shape feedback has generally been positive; no issues have been raised regarding the LanguageModelMessageRole enum and its use in the LanguageModelMessage dictionary. Our team is happy to continue discussing any thoughtful feedback through issues filed in the spec repository or Chromium issue tracker.

Thanks!
Vlad

We still need to respond to the rest; some touch on inherent risks of platform advancements in this nascent domain; many could benefit from outlining non-normative best practices for implementers to respect device resources and real world costs. We will expedite sharing those responses.

Regarding wpt.fyi test failures:
Almost all failures are model download timeouts; we hope to devise a way to retain or sideload models in wpt.fyi's continuous integration infrastructure.

Thanks. Do you have a report of some kind about what's passing vs. failing in a harness that avoids the model download issue? I ran a few of the tests myself manually (on wpt.live) and they passed. So I'm assuming we're broadly in good shape, right? Just want to know details (a few known failures with bugs is fine).

Mike Wasserman

unread,

Apr 7, 2026, 7:16:35 PMApr 7

to blink-dev, Rick Byers, blink-dev, Vladimir Levin, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Mike Wasserman

Thanks for asking. Internally, we have a separate python script that side-loads pre-downloaded models and configurations to run WPTs on Chrome builds, those are run continuously on FYI internal.optimization_guide "wpt bots" with a separate set of /third_party/blink/web_tests/AIExpectations, which enumerates known failures (inherent side-loading availability values, and hopefully solvable platform-specific timeouts). There's a single new Mac-x64 failure there that I've been meaning to address this week.

I also took the liberty of manually running the 28 WPT files in https://wpt.live/ai/language-model/ on Chrome Canary and observed: 90/95 tests pass that pertain to proposed stable API surfaces. Two failures are missing `gc()` WPT method references, two pertain to multimodal session creation, and the last is a context usage measurement OBO. I will file issues and followup with all of these. There are another 43 tests related to experiments for params and tool use that aren't part of the initial API launch.

Vladimir Levin

unread,

Apr 8, 2026, 10:47:23 AMApr 8

to Mike Wasserman, blink-dev, Rick Byers, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor

Thank you for your responses.

LGTM2

Thanks,

Vlad

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

Rick Byers

unread,

Apr 8, 2026, 11:12:11 AMApr 8

to Vladimir Levin, Mike Wasserman, blink-dev, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor

Thank you for the additional details. Sounds good, LGTM3

Alex Russell

unread,

Apr 30, 2026, 5:19:02 PMApr 30

to blink-dev, Rick Byers, Mike Wasserman, blink-dev, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

Hey folks,

First, an apology: I've had concerns about this API for a long while but simply missed that we had 3 LGTMs for 148. We're now hours from branch, and it would be pretty anti-social to -1 at this point. However, I think we need to consider finching the Prompt API off for 148 and moving back to OT for a few reasons.

First, none of the OT feedback I've seen, or engagement that the Edge team has separately had with developers, suggests that an unversioned multi-modal client-side model will be able to avoid serious application breakage if used as the basis for core app functionality. I've mentioned to various folks overthe past year+ that it would be useful if we had a way to call out capabilities, versions, or other parameters at construction time, but the spec doesn't seem to have any discussion of that flexibility today:

https://webmachinelearning.github.io/prompt-api/#dictdef-languagemodelcreateoptions

Microsoft's own experiments with alternative models suggest getting to interoperability with multi-modality via a different training stack is not a given, but feedback about the API only suggests workarounds, rather than API enhancements we'd expect in response to OT feedback:

https://github.com/webmachinelearning/prompt-api/issues/202

This bleeds into a second, related concern: there hasn't been much in the way of what I'd characterise as "pull"-based feedback from developers about this API. But it's hard to judge based on the public record. I might have missed it, but OT interest hasn't been captured here, the original I2E, or the extension request thread:

https://groups.google.com/a/chromium.org/g/blink-dev/c/6uBwiiFohAU/m/WhaKAB9fAAAJ
https://groups.google.com/a/chromium.org/g/blink-dev/c/qs059tBaQMI/m/dWDP2ZvpAQAJ

Should we be looking somewhere else to get a sense for developer enthusiasm?

It's also a bit concerning re: our process for features we're the first to ship to see responses like this, rather than proposed enhancements (even if they're for a future iteration):

https://github.com/webmachinelearning/prompt-api/issues/187

Shipping subsets for v1 is fine; avoiding design work in response to feedback, less so.

If partners are interested in seeing this ship ASAP, I'd be swayed (per usual) by them weighing in here. It would be particularly convincing if they could explain why they are not worried about the API changing behaviour in response to model upgrades, and it would be *most* persuasive if they'd tried both Edge's OTs using Phi in addition to Google's Gemini-based behaviour.

Without that, I'm not sure what there is to do but flip the "Needs Work" flag, avoid shipping downstream in Edge, and suggest we all go back to OT and sort out interop, versioning, and modality questions.

Given that time is short for 148, I hope we can get some useful on-the-record input from developers and/or persuasive details from Google's OT partners.

Thanks,

Alex

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

Reilly Grant

unread,

May 1, 2026, 12:04:31 PMMay 1

to Alex Russell, blink-dev, Rick Byers, Mike Wasserman, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

On Thu, Apr 30, 2026 at 2:19 PM Alex Russell <sligh...@chromium.org> wrote:

Hey folks,

First, an apology: I've had concerns about this API for a long while but simply missed that we had 3 LGTMs for 148. We're now hours from branch, and it would be pretty anti-social to -1 at this point. However, I think we need to consider finching the Prompt API off for 148 and moving back to OT for a few reasons.

First, none of the OT feedback I've seen, or engagement that the Edge team has separately had with developers, suggests that an unversioned multi-modal client-side model will be able to avoid serious application breakage if used as the basis for core app functionality. I've mentioned to various folks overthe past year+ that it would be useful if we had a way to call out capabilities, versions, or other parameters at construction time, but the spec doesn't seem to have any discussion of that flexibility today:

https://webmachinelearning.github.io/prompt-api/#dictdef-languagemodelcreateoptions

The parameters in LanguageModelCreateCoreOptions (in particular, expectedInputs and expectedOutputs) which can be passed to both availability() and create() allow the developer to check for the availability of and request both language support and multi-modality capabilities. For example:

const session = await LanguageModel.create({

expectedInputs: [

{ type: "text", languages: ["en"] },

{ type: "audio" },

{ type: "image" },

],

expectedOutputs: [{ type: "text", languages: ["en"] }],

});

This seems to cover everything you’re asking for other than versions, which is something I think we should be opinionated about: The browser picks the model.

This is the fundamental difference between these “built-in AI” APIs versus other options like WebNN. The tradeoff we’re making here is that the user benefits from having a single model (or small set of models focused on specific tasks, e.g. translation) rather than each developer downloading their own large model consuming bandwidth, disk and memory. I could be convinced to reveal the model version to the developer however I am concerned this would result in a similar situation we see today with user-agent sniffing rather than capability detection-based progressive enhancement.

Microsoft's own experiments with alternative models suggest getting to interoperability with multi-modality via a different training stack is not a given,

We meet regularly with our counterparts on the Edge team and this concern is news to me. I’d like to hear more details.

but feedback about the API only suggests workarounds, rather than API enhancements we'd expect in response to OT feedback:

https://github.com/webmachinelearning/prompt-api/issues/202

This feature request is quite fresh but our team agrees with the reporter that there should be better capability detection here. We suggested they file that issue based on a discussion in the Chromium issue tracker.

This bleeds into a second, related concern: there hasn't been much in the way of what I'd characterise as "pull"-based feedback from developers about this API. But it's hard to judge based on the public record. I might have missed it, but OT interest hasn't been captured here, the original I2E, or the extension request thread:

https://groups.google.com/a/chromium.org/g/blink-dev/c/6uBwiiFohAU/m/WhaKAB9fAAAJ
https://groups.google.com/a/chromium.org/g/blink-dev/c/qs059tBaQMI/m/dWDP2ZvpAQAJ

Should we be looking somewhere else to get a sense for developer enthusiasm?

Agreed, we dropped the ball on publishing developer feedback and are working on a better summary.

It's also a bit concerning re: our process for features we're the first to ship to see responses like this, rather than proposed enhancements (even if they're for a future iteration):

https://github.com/webmachinelearning/prompt-api/issues/187

Shipping subsets for v1 is fine; avoiding design work in response to feedback, less so.

Batched inference is a good feature request but supporting it well (versus conversational inference) will require a different API and underlying inference engine features which may not be available. I think it’s reasonable to acknowledge the feature request but flag it as out-of-scope for v1.

If partners are interested in seeing this ship ASAP, I'd be swayed (per usual) by them weighing in here. It would be particularly convincing if they could explain why they are not worried about the API changing behaviour in response to model upgrades, and it would be *most* persuasive if they'd tried both Edge's OTs using Phi in addition to Google's Gemini-based behaviour.

I agree. It sounds like Edge has some developer feedback (or maybe it’s just internal testing) that we haven’t seen yet. I’ve shared your concerns about the interoperability issues inherent to this API. I think developers successfully building sites that work in Chrome and Edge, as well as our ability to upgrade the model (you’ll note that Google just released Gemma 4, stay tuned) without breaking existing sites is the only true test of whether the concerns are well-founded.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

--

You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/ec9da2ca-20a1-41f2-b60b-ca0ecd959215n%40chromium.org.

Rick Byers

unread,

May 1, 2026, 4:36:18 PMMay 1

to Reilly Grant, Alex Russell, blink-dev, Mike Wasserman, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

From a procedural point of view I don't personally think we should be considering flagging this off at this stage. It already went out in our early stable release last week and will be deployed to all Chrome stable users in a matter of days. Using a kill switch at this late stage is generally reserved for emergencies to mitigate significant user harm (and comes with its own risks, since we have not been testing the off state in our canary/dev/beta population). While the strength of the Edge team's concern is apparently a surprise, I don't think there's actually any significant new information about the cost/benefit risk tradeoff here.

Instead I suggest we try to focus on finding and fixing any interop issues in practice. As I said on the Mozilla standards position thread, I'm personally supportive of Chrome making breaking changes where necessary to break calcification around model-specific quirks (as we've done in the past in the name of interop, eg. when fixing WebKit border image quirks, breaking mobile Gmail in order to match the spec and Firefox). Reilly says the team is working to share more developer signals. Once we have some real-world use cases to point at we can focus together on driving interop in practice.

I'll note that there's some parallels here to how the browser community approaches performance - the one big longstanding aspect of probabilistic platform behavior (alongside snaller examples like the web speech API). Most applications aren't that sensitive to performance differences, and I expect the same here (but could be wrong!). But for some applications, a performance difference can mean the difference between an app being usable or not. Browsers largely compete on performance implementations but collaborate on benchmarks and API design. It has its downsides, but I think it's also done a lot of good to drive user-positive innovation and investment through constructive competition.

Rick

Mike Wasserman

unread,

May 1, 2026, 6:31:16 PMMay 1

to blink-dev, Rick Byers, Alex Russell, blink-dev, Mike Wasserman, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin, Reilly Grant

To address some additional requests from Alex:

Regarding open API issues and feature requests:

This quote from earlier in the thread might have been overlooked: “We conducted thoughtful google-internal reviews and triage, and prioritized addressing issues that couldn't be addressed after launch”

We know we have work to respond more thoughtfully to each issue, but we have exercised diligence to ensure all open issues could be addressed by additive API enhancements and backwards-compatible implementation improvements.

Please do raise any concrete counter-examples of API design traits or specified functionality that preclude interoperable implementations. As Rick says, we will break APIs after launch for the unequivocal health of the platform.

Regarding developer feedback:

I acknowledge that the initial evidence provided regarding developer sentiment could be more fresh and complete. We are compiling more recent examples of developer feedback to better reflect the broad demand we are seeing for this capability.

Unfortunately, I’m unable to name and quote partners in this particular forum without advance consent; we are actively working to provide an approved list.

Meanwhile, here are a couple published case studies:

https://developer.chrome.com/blog/prompt-api-blog-cyberagent

https://developer.chrome.com/blog/ai-guessing-game

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/ec9da2ca-20a1-41f2-b60b-ca0ecd959215n%40chromium.org.

Alex Russell

unread,

May 4, 2026, 2:48:47 PMMay 4

to blink-dev, Mike Wasserman, Rick Byers, Alex Russell, blink-dev, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin, Reilly Grant

Thanks for weighing in, all.

Rick: on process, given the import, every downstream vendor is going to make a call about this independently. We (Edge) are going to flag it off for now, and it seems like others will too. Not sure what that implies for API OWNERS process, but it at least suggests my failure to spot that this had 3 LGTMs is not a costless mistake. For that, I apologize.

Reilly: thanks for the example code; it's roughly what Mike had posted in response to developers asking about feature detection, right? Do I misunderstand that it potentially has the cost of cons'ing up a multi-gigabyte memory allocation?

On the ability to transition versions painlessly, is there a downside to letting the Gemini/Gemma 3 --> 4 transition play out behind OT to get more data on that front? Or is the set of OT users too small for us to derive that information?

Best,

Alex

Rick Byers

unread,

May 4, 2026, 4:49:27 PMMay 4

to Alex Russell, blink-dev, Mike Wasserman, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin, Reilly Grant

On Mon, May 4, 2026 at 2:48 PM Alex Russell <sligh...@chromium.org> wrote:

Thanks for weighing in, all.

Rick: on process, given the import, every downstream vendor is going to make a call about this independently. We (Edge) are going to flag it off for now, and it seems like others will too. Not sure what that implies for API OWNERS process, but it at least suggests my failure to spot that this had 3 LGTMs is not a costless mistake. For that, I apologize.

I wonder if perhaps this is another example of a feature that should really be off by default in content and enabled explicitly by each embedder? We've seen this pattern before with full-stack features like FedCM and PaymentRequest which tend to be broken in downstream browsers until downstream work is done to explicitly add UI for them (despite all Chrome's code being open source). Then there are the rare examples like the Web Speech API which rely on some Google service or other closed source component which definitely require downstream work to function and validate. I don't think such features are necessarily a failure of the API owners process, but maybe more a failure of our upstream/downstream engineering systems? With my ATL hat on (not API owners), I'm supportive of finding some way to make cases like this off-by-default for downstream.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

Reilly Grant

unread,

May 4, 2026, 5:24:00 PMMay 4

to Alex Russell, blink-dev, Mike Wasserman, Rick Byers, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

On Mon, May 4, 2026 at 11:48 AM Alex Russell <sligh...@chromium.org> wrote:

Thanks for weighing in, all.

Rick: on process, given the import, every downstream vendor is going to make a call about this independently. We (Edge) are going to flag it off for now, and it seems like others will too. Not sure what that implies for API OWNERS process, but it at least suggests my failure to spot that this had 3 LGTMs is not a costless mistake. For that, I apologize.

My intuition is that with the API available with both Chrome and Edge's built-in models then developers will be able to target both. Otherwise you guarantee that developers will only test with Chrome. I understand the reasons you've stated for this decision but they seem based on a misunderstanding of the API's feature detection capabilities we've been discussing here. Therefore, I ask you to research this further before committing to this course.

Reilly: thanks for the example code; it's roughly what Mike had posted in response to developers asking about feature detection, right? Do I misunderstand that it potentially has the cost of cons'ing up a multi-gigabyte memory allocation?

My example should've used LanguageModel.availability() instead of create() (these take the same base arguments) to better demonstrate this. Checking model availability doesn't require loading the model to tell you whether the currently downloaded model supports the capabilities you're asking for. However, Chrome currently doesn't implement this optimally because if no model is downloaded it doesn't know whether the model configuration it will download will support specific modalities. It will therefore return "downloadable" instead of "unavailable". We're currently refactoring our model configuration management infrastructure to fix this.

On the ability to transition versions painlessly, is there a downside to letting the Gemini/Gemma 3 --> 4 transition play out behind OT to get more data on that front? Or is the set of OT users too small for us to derive that information?

We already transitioned from Gemini Nano 2 to 3 during the OT. The set of users is still quite small. I think a post-launch transition will gather better feedback.

Best,

Alex

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+...@chromium.org.

Alex Russell

unread,

May 5, 2026, 3:00:54 PMMay 5

to blink-dev, Reilly Grant, blink-dev, Mike Wasserman, Rick Byers, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin, Alex Russell

On Monday, May 4, 2026 at 2:24:00 PM UTC-7 Reilly Grant wrote:

On Mon, May 4, 2026 at 11:48 AM Alex Russell <sligh...@chromium.org> wrote:
Thanks for weighing in, all.

Rick: on process, given the import, every downstream vendor is going to make a call about this independently. We (Edge) are going to flag it off for now, and it seems like others will too. Not sure what that implies for API OWNERS process, but it at least suggests my failure to spot that this had 3 LGTMs is not a costless mistake. For that, I apologize.

My intuition is that with the API available with both Chrome and Edge's built-in models then developers will be able to target both. Otherwise you guarantee that developers will only test with Chrome. I understand the reasons you've stated for this decision but they seem based on a misunderstanding of the API's feature detection capabilities we've been discussing here. Therefore, I ask you to research this further before committing to this course.

We're happy to keep running the OT to enable developers to continue to test, but we don't have confidence about the multi-modal cliff here, and I don't think there's anything to do but to flag it off. You mentioned that there are ongoing meetings, and my colleagues at MSFT mentioned the same, but I haven't been invited to them. If you'd like to include me or Dan in those calls, happy to join.

Reilly: thanks for the example code; it's roughly what Mike had posted in response to developers asking about feature detection, right? Do I misunderstand that it potentially has the cost of cons'ing up a multi-gigabyte memory allocation?

My example should've used LanguageModel.availability() instead of create() (these take the same base arguments) to better demonstrate this. Checking model availability doesn't require loading the model to tell you whether the currently downloaded model supports the capabilities you're asking for. However, Chrome currently doesn't implement this optimally because if no model is downloaded it doesn't know whether the model configuration it will download will support specific modalities. It will therefore return "downloadable" instead of "unavailable". We're currently refactoring our model configuration management infrastructure to fix this.

This suggests to me that we probably need more bake time. Is that wrong?

On the ability to transition versions painlessly, is there a downside to letting the Gemini/Gemma 3 --> 4 transition play out behind OT to get more data on that front? Or is the set of OT users too small for us to derive that information?

We already transitioned from Gemini Nano 2 to 3 during the OT. The set of users is still quite small. I think a post-launch transition will gather better feedback.

As you know, from a risk-management perspective, I2S is permission to pour the API concrete to avoid re-litigation in the future. It's easier to relax an API's shape with future additions than to make it less restrictive. It's hard to read this transition + stable plan as anything but premature when we could do the same thing behind an OT. Why is that wrong?

Best,

Alex

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/25e6f203-87b6-4844-9edf-fa301ad36e3an%40chromium.org.

--
You received this message because you are subscribed to the Google Groups "blink-dev" group.

To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.

To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/ec9da2ca-20a1-41f2-b60b-ca0ecd959215n%40chromium.org.

Rick Byers

unread,

May 22, 2026, 5:19:11 PMMay 22

to Alex Russell, blink-dev, Reilly Grant, Mike Wasserman, Deepti Bogadi, Kenji Baheux, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

The API owners met twice to discuss the feedback on this intent. We agree there is significant cause for concern about plausible interoperability in the design of this API since LLMs are non-deterministic and evolving rapidly. We appreciate that it will likely take years of deployment experience before we really have all the answers here, but believe there are several things which could be done to mitigate the risk now before usage grows and reinforce Blink’s commitment to interoperability. Mike, can you please work with your team to evaluate the following ideas and let us know what the team can plan to deliver over the next 3-6 months in this regard?

First we agree that the biggest opportunity for promoting Prompt API interop is an open, high quality eval suite. For most features we require extensive conformance tests to exist in a neutral location with test scores easily available (like wpt.fyi). Is there some reason that isn’t possible or realistic here? We know that the Edge team is eager to collaborate here, and we’re thrilled with the aligned results from the latest manual eval they did. While Chrome can’t necessarily share details of who we know to be using any API, can we synthesize real-world use cases into an eval suite that we work to keep representative, high-quality, and under an open-governance umbrella? Can we automate or semi-automate results collection so that stakeholders can easily track the scores over time?

Secondly, since we know promoting interop may require Chrome to make changes to our behavior (and possibly even the API), this seems to fall under our web compat change principle of “accepted interop risk”. API owners are willing to accept a higher compat risk resulting from changing this API (and models behind it) in ways which increase interoperability. To what extent is the team willing to take on this risk as well? For example, if the community agrees that a breaking change in the API would promote interoperability, can we plan to do a phased transition over a few months where we support both and then deprecate the original form? Since there are lots of scenarios where this API won’t exist (eg. mobile browsers), we imagine all sites in practice will rely on feature detection and so worst case we always have the option to change the API name in order to do a breaking change transition, right?

Third, to the extent that interop may be hard for some smaller browsers to achieve with their own model, it would be helpful if upstream Chromium had a fully open-source implementation that performed similarly to Chrome on the eval suite. We don’t mind if Chrome wants to use a different implementation than Chromium otherwise, but of course sharing one implementation would be the easiest way to achieve this goal. Is this something the team can plan to invest in? Do we have any data on, for example, the gap between Google’s open-source Gemma model and Gemini Nano in the context of how prompt API is being used today?

Fourth, any model updates or changes Chrome makes are a golden opportunity to learn about web compat risk in practice, and hence inform other browsers’ interoperability strategy. To what extent can the team commit to sharing developer feedback (anonymized if necessary) and bug reports about compatibility issues when Chrome’s behavior changes? Will the team plan to promptly propose improvements to the eval to align with any lessons learned from real-world compat issues?

Thanks,

Rick on behalf of blink API owners

Kenji Baheux

unread,

Jun 3, 2026, 7:02:05 PMJun 3

to Rick Byers, Alex Russell, blink-dev, Reilly Grant, Mike Wasserman, Deepti Bogadi, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

Hi Rick,

Thank you to the API owners for the thoughtful feedback and for evaluating these vectors. Please accept our apologies for the delay in getting back to you; we wanted to ensure we took the time to sync thoroughly with our engineering leads and align closely with the recent Web ML CG discussions to provide concrete plans.

As we evaluated how to best address the API owners' questions, we also realized that our original Intent to Ship thread didn't adequately capture the momentum we see from the ecosystem for a generalized Prompt API.

What began as early experimentation has grown into significant developer interest and matured into live deployments.

Our Built-in AI Early Preview Program (EPP) scaled past 50,000 participants (over 57% representing commercial entities), and we observed compelling signals for the Prompt API across the board—whether it’s the 800 public GitHub files integrating the API, the steady stream of weekly downloads for type definitions and framework SDK providers, or the fact that the Prompt API is three times more popular than the second most requested capability among EPP members.
We are also seeing a growing number of major ecosystem partners—including Drupal, Temu, Trip.com, and Yahoo! Taiwan—who are actively evaluating or testing live implementations against this surface to handle the diverse, long-tail use cases that narrow task APIs cannot realistically cover. Those are just a few examples of the signals we are observing and that we could have shared earlier.

With that foundational context in mind, below is our evaluation and current plan of record for each of the areas raised by API owners:

On Sat, May 23, 2026 at 6:19 AM Rick Byers <rby...@chromium.org> wrote:

The API owners met twice to discuss the feedback on this intent. We agree there is significant cause for concern about plausible interoperability in the design of this API since LLMs are non-deterministic and evolving rapidly. We appreciate that it will likely take years of deployment experience before we really have all the answers here, but believe there are several things which could be done to mitigate the risk now before usage grows and reinforce Blink’s commitment to interoperability. Mike, can you please work with your team to evaluate the following ideas and let us know what the team can plan to deliver over the next 3-6 months in this regard?

First we agree that the biggest opportunity for promoting Prompt API interop is an open, high quality eval suite. For most features we require extensive conformance tests to exist in a neutral location with test scores easily available (like wpt.fyi). Is there some reason that isn’t possible or realistic here? We know that the Edge team is eager to collaborate here, and we’re thrilled with the aligned results from the latest manual eval they did. While Chrome can’t necessarily share details of who we know to be using any API, can we synthesize real-world use cases into an eval suite that we work to keep representative, high-quality, and under an open-governance umbrella? Can we automate or semi-automate results collection so that stakeholders can easily track the scores over time?

To address the limitations of traditional Web Platform Tests (WPTs) when evaluating non-deterministic surfaces, we intend to work with the ecosystem and within the Web Machine Learning Community Group (CG) to establish an open use-case benchmark for the Prompt API. This benchmark suite will represent what the ecosystem is actively building or interested in, providing clear direction on viability and interoperability across different browser-and-model combinations as our respective AI runtimes and models improve over time. We welcome collaboration with other ecosystem stakeholders, including Microsoft Edge, to explore how results collection can be automated or semi-automated over time to allow public tracking against structured, canonical benchmark use cases.

Secondly, since we know promoting interop may require Chrome to make changes to our behavior (and possibly even the API), this seems to fall under our web compat change principle of “accepted interop risk”. API owners are willing to accept a higher compat risk resulting from changing this API (and models behind it) in ways which increase interoperability. To what extent is the team willing to take on this risk as well? For example, if the community agrees that a breaking change in the API would promote interoperability, can we plan to do a phased transition over a few months where we support both and then deprecate the original form? Since there are lots of scenarios where this API won’t exist (eg. mobile browsers), we imagine all sites in practice will rely on feature detection and so worst case we always have the option to change the API name in order to do a breaking change transition, right?

We are aligned with the principle of "accepted interop risk" and believe that the API surface is structurally flexible enough to absorb breaking changes if the community deems them necessary. For instance, we can naturally leverage the API’s capability detection pattern (i.e., utilizing the availability() check alongside the create() call to initialize a session with specific use-case requirements) to safely execute these adjustments.

Third, to the extent that interop may be hard for some smaller browsers to achieve with their own model, it would be helpful if upstream Chromium had a fully open-source implementation that performed similarly to Chrome on the eval suite. We don’t mind if Chrome wants to use a different implementation than Chromium otherwise, but of course sharing one implementation would be the easiest way to achieve this goal. Is this something the team can plan to invest in? Do we have any data on, for example, the gap between Google’s open-source Gemma model and Gemini Nano in the context of how prompt API is being used today?

Investing in an aligned open-source implementation within upstream Chromium is already part of our plan of record. Our shift toward open-weight models like Gemma 4 (released under a permissive license) is another element which will inherently lower the barrier for upstream Chromium, and other browsers, to offer aligned alternative implementations.

Fourth, any model updates or changes Chrome makes are a golden opportunity to learn about web compat risk in practice, and hence inform other browsers’ interoperability strategy. To what extent can the team commit to sharing developer feedback (anonymized if necessary) and bug reports about compatibility issues when Chrome’s behavior changes? Will the team plan to promptly propose improvements to the eval to align with any lessons learned from real-world compat issues?

To ensure the ecosystem stays informed, we commit to full transparency regarding the impact and learnings from model updates. We will encourage partners and the community to use the public Chromium issue tracker to report production breakages, while concurrently using the Web Machine Learning Community Group to address spec gaps, support transparent compatibility work, and translate real-world learnings into benchmark and tooling improvements.

Kenji on behalf of the Chrome Built-in AI team.

Rick Byers

unread,

Jun 4, 2026, 4:56:28 PMJun 4

to Kenji Baheux, Alex Russell, blink-dev, Reilly Grant, Mike Wasserman, Deepti Bogadi, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

Thank you Kenji, this sounds good to me!

Can you share anything about planned timeframe for next milestones on the eval suite and open sourcing work? When should API owners expect to check-in with the team on material progress?

Thanks,

Rick

Kenji Baheux

unread,

Jun 21, 2026, 9:49:26 PMJun 21

to Rick Byers, Alex Russell, blink-dev, Reilly Grant, Mike Wasserman, Deepti Bogadi, Etienne Noël, Rob Kochman, Mike Taylor, Vladimir Levin

Hi Rick,

We don't have a concrete answer yet regarding a check-in time.

The team is currently breaking the project down into workstreams and next steps while developing a prioritization plan.

We will update this thread with more details once this planning step is complete.

Best regards,

Reply all

Reply to author

Forward