Contact emails
mca...@chromium.org, owe...@chromium.org
Spec
https://wicg.github.io/shape-detection-api/
Summary
OEMs have long been providing hardware-accelerated detection of image features such as faces and QR codes, given their high computational complexity. The Shape Detection API provides access to these hardware-accelerated detectors where available. (Note that it does _not_ provide any software fallback)
Link to “Intent to Implement” blink-dev discussion
Goals for experimentation
We would like to collect feedback on:
Ergonomics and amount of metadata provided by the API, e.g. face detection results include eyes and mouth currently, but some APIs provide more, which ones would be interesting to surface?
Performance of the API in general (via developer feedback)
Relative performance of the different DOM sources (the API works with a number of data inputs, e.g. <canvas>, <video>, <img>)
Any impact on bug/crash rate due to hardware/software combinations
Whether there are use cases we didn’t already consider
UMA usage count: https://uma.googleplex.com/timeline_v2?sid=5febed43e9e109b411b4bee056314100
Bug category: Blink>ImageCapture && label:ShapeDetection
Experimental timeline
Enabled in Chrome 61, 62, 63
Any risks when the experiment finishes?
None
Ongoing technical constraints
None.
Will this feature be supported on all five Blink platforms supported by Origin Trials (Windows, Mac, Linux, Chrome OS, and Android)?
Android and Mac are supported initially. Support for Windows 10 is on it’s way, expected to land during the trial.
ChromeOS support will need to rely on libraries, depending on the demand we gauge for this platform during the trial.
OWP launch tracking bug
Link to entry on the feature dashboard
https://www.chromestatus.com/features/4757990523535360--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to blink-dev+unsubscribe@chromium.org.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/b49b26bc-0571-43de-9744-71a3052b6610%40chromium.org.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAOMQ%2Bw_ymCMB8hjRM9pfAcg3bYvspM3jg-G0bXRHd%3DU15op5hw%40mail.gmail.com.
Looks cool!What's your thinking for how we might write interop tests for this API? Since this is relying on underlying platform APIs (even harder) I suppose we can't test the exact results of example test cases. But maybe we could have tests with some kind of fuzzy matching (eg. don't test the exact bounding box, just that a box was found or not containing a given point) and land test cases known to pass in all major OS implementations? How hard is it to find a set of test cases that pass in the Android, Mac and Windows implementations?
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/CAFUtAY8n3u8tRFjmtT%3DZav2%3DqeQDCeLpps-aTMjT8-VCYP2wuw%40mail.gmail.com.
Looks cool! This would be a great API to help bootstrap some AR-type scenarios.A couple of questions:
- The spec doesn't seem to provide a straightforward way to use a live camera feed as an image source. Is that intentional?
- The spec also doesn't seem to support object detection (a la Google Cloud Vision API, Amazon Rekognition, etc.). Is that something you'd see coming?
--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/ad908d9b-3d60-44e1-9b51-6f7d5288bc23%40chromium.org.
On 1 July 2017 at 04:42, Rick Byers <rby...@chromium.org> wrote:Looks cool!What's your thinking for how we might write interop tests for this API? Since this is relying on underlying platform APIs (even harder) I suppose we can't test the exact results of example test cases. But maybe we could have tests with some kind of fuzzy matching (eg. don't test the exact bounding box, just that a box was found or not containing a given point) and land test cases known to pass in all major OS implementations? How hard is it to find a set of test cases that pass in the Android, Mac and Windows implementations?It's true that every implementation is going to be a little different, but they should be comparable to within certain limits, i.e. GMS core and Android Face provide different bounding boxes (see e.g. the CL to sort this out w/ tests); IOW we expect to have repeatable tests enforcing there's a face in-or-around these coordinates, with eyes and mouth here and there, and similarly with barcode/qr codes etc.FTR there's a relevant discussion on the TAG review, which raised a similar point among others, and my reply was that current OS implementations are based on the same set of algorithms, so the results are quite comparable.
The biggest catch IMHO is not repeatability but accuracy: some implementations might detect faces better than others, e.g. in this codepen, not all implementations detect the faces w/ glasses.
On Tue, Jul 4, 2017 at 3:00 AM, Miguel Casas <mca...@google.com> wrote:On 1 July 2017 at 04:42, Rick Byers <rby...@chromium.org> wrote:Looks cool!What's your thinking for how we might write interop tests for this API? Since this is relying on underlying platform APIs (even harder) I suppose we can't test the exact results of example test cases. But maybe we could have tests with some kind of fuzzy matching (eg. don't test the exact bounding box, just that a box was found or not containing a given point) and land test cases known to pass in all major OS implementations? How hard is it to find a set of test cases that pass in the Android, Mac and Windows implementations?It's true that every implementation is going to be a little different, but they should be comparable to within certain limits, i.e. GMS core and Android Face provide different bounding boxes (see e.g. the CL to sort this out w/ tests); IOW we expect to have repeatable tests enforcing there's a face in-or-around these coordinates, with eyes and mouth here and there, and similarly with barcode/qr codes etc.FTR there's a relevant discussion on the TAG review, which raised a similar point among others, and my reply was that current OS implementations are based on the same set of algorithms, so the results are quite comparable.Great, thanks!
--
You received this message because you are subscribed to the Google Groups "blink-dev" group.
To view this discussion on the web visit https://groups.google.com/a/chromium.org/d/msgid/blink-dev/b49b26bc-0571-43de-9744-71a3052b6610%40chromium.org.