New API proposal: Document Scan

Paul Stewart

unread,

May 22, 2014, 2:35:25 PM5/22/14

to apps...@chromium.org, securit...@chromium.org

I have a new API I'd like to have reviewed. This provides a method
for performing paper document scanning. The design doc for this
feature is at https://docs.google.com/a/google.com/document/d/1x_Iu8EKEUL8Mc35t22xUguh1cpTiPITgDR6L7S6VyGQ
, and a CL displaying the details of the API is at
https://codereview.chromium.org/286933006/

Mustafa Emre Acer

unread,

May 23, 2014, 8:40:59 PM5/23/14

to Paul Stewart, apps-dev, security-enamel

I can't seem to comment on the doc so listing a couple of questions/thoughts here specifically about the CRX bits:

- Is this API going to be public, can any extension/app use it? If so you'd want to add a permission for it.

- Historically extension/app APIs that had methods to list and access devices have had two separate permissions for these two operations. I think a single one is sufficient for this API though (e.g. "documentScan" or something like that).

- You might also consider designing a UI for this API so that the user gets to pick the scanning device, sets the parameters and scans the document all through native Chrome UI without passing anything to the API, and the API only gets the end result.

To unsubscribe from this group and stop receiving emails from it, send an email to apps-dev+u...@chromium.org.

Benjamin Kalman

unread,

May 27, 2014, 2:08:52 PM5/27/14

to Mustafa Emre Acer, Paul Stewart, apps-dev, security-enamel

IIRC the cloud print people were work a general framework for interacting with devices, which (IIRC) included ways to read from them. How does this related?

Benjamin Kalman

unread,

May 28, 2014, 6:34:02 PM5/28/14

to Mustafa Emre Acer, Paul Stewart, apps-dev, security-enamel

(Picking this up again after some offline discussion)

The design doc is very implementation focused and I'd rather see the API framed in terms of a formal proposal.

Paul Stewart

unread,

May 28, 2014, 8:12:07 PM5/28/14

to Benjamin Kalman, Mustafa Emre Acer, apps-dev, security-enamel

I've created a formal proposal per the template at
https://docs.google.com/a/chromium.org/document/d/1FkgGF3l8wcK9SWllPGHHGwWlrEXqyh9jauHHFKiCSnw

Paul Stewart

unread,

Aug 4, 2014, 6:54:41 PM8/4/14

to Benjamin Kalman, Mustafa Emre Acer, apps-dev, security-enamel

I'm not sure how you do a "PTAL" in Google Docs, so I'm pinging here to indicate that I've incorporated suggestions (so far no details on web API vs Chrome) and updated the IDL in my CL to match. The rest of the implementation (e.g., the rendering of the new scan UI elements) is lagging though.

Reilly Grant

unread,

Aug 4, 2014, 7:05:32 PM8/4/14

to Paul Stewart, apps-dev, security-enamel

Unless more information about the scanner is made available through the API the scan options parameter is not very useful. It would better divide responsibilities if the app provided only a hint to the scanner UI (provided by Chrome or a 3rd party "driver" extension) about what kind of object is about to be scanned and what format is expected (document to PDF, photo to JPEG, etc.) and let the rest of the parameters be set out of band (probably with sensible defaults from the hardware plus tweaks from the user).

I think this would pair very well with an API that allowed scanner manufacturers to write drivers as extensions using the USB API.

~ Reilly

Paul Stewart

unread,

Aug 4, 2014, 7:36:31 PM8/4/14

to Reilly Grant, apps-dev, security-enamel

True. I'll scrap the options entirely, have the application provide a list of mime types it can handle, and have the API return what mime type the image(s) are encoded in.

Paul Stewart

unread,

Sep 22, 2014, 1:40:32 PM9/22/14

to Reilly Grant, apps-dev, security-enamel

Friendly ping. I think I've addressed the IDL issues, and from an API perspective I believe things are streamlined. The linked design docs have mocks for the scanning UI. I'm checking to see if it's okay to move forward with the implementation.

Reilly Grant

unread,

Sep 22, 2014, 1:53:37 PM9/22/14

to Paul Stewart, apps-dev, security-enamel

lgtm from an API perspective.

Mustafa Emre Acer

unread,

Sep 22, 2014, 8:00:53 PM9/22/14

to Reilly Grant, Paul Stewart, apps-dev, security-enamel

Hi Paul,

I see two documents in this thread (google.com and chromium.org) and the contents are different. The one for google.com has the UI mocks, is it the latest proposal?

Mustafa Emre Acer

unread,

Sep 22, 2014, 8:02:07 PM9/22/14

to Reilly Grant, Paul Stewart, apps-dev, security-enamel

Ah, never mind. Looks like the google.com doc points to the chromium.org doc which contains the API proposal.

Mustafa Emre Acer

unread,

Sep 26, 2014, 6:41:29 PM9/26/14

to Reilly Grant, Paul Stewart, apps-dev, security-enamel

Security lgtm.

Paul Stewart

unread,

Sep 26, 2014, 6:50:45 PM9/26/14

to Mustafa Emre Acer, Reilly Grant, apps-dev, security-enamel

One last minor change to the IDL: We need to be able to return multiple images so the ScanCallback now takes an array of DOMString. Sorry for the late change.

Benjamin Kalman

unread,

Oct 20, 2014, 6:37:43 PM10/20/14

to Paul Stewart, Mustafa Emre Acer, Reilly Grant, apps-dev, security-enamel

lgtm but I made some nits on the API itself. I would have liked to see this as a web API, what happened to that discussion (if any)?

Reply all

Reply to author

Forward