Document AI with AutoML model

563 views
Skip to first unread message

Henrique Delgado

unread,
Aug 26, 2020, 5:37:29 AM8/26/20
to Google Cloud Developers
Hey,

How do you train an AutoML model to use with Document AI?

Since there is not much information regarding this topic, I don't know if I should use an  AutoML Natural Language Classification model or an Entity Extraction one. Also, how should I train each one in order to achieve better results than using the pre-trained model provided by Document AI?

Ismail (Cloud Platform Support)

unread,
Aug 27, 2020, 1:43:49 PM8/27/20
to Google Cloud Developers
Hi Henrique,

Here's a good start on how to use a custom AutoML model with any Document AI feature.  

Henrique Delgado

unread,
Aug 28, 2020, 5:34:22 AM8/28/20
to Google Cloud Developers
Hey Ismail,

Thanks for the response, I've checked the documentation but my question was more specific on how to train my autoML model to identify and extract my key-value pairs and tables.
From what I understood, you can only use a classification or an entity extraction autoML model for Document AI.
That being said, I don't get how you can label/annotate your inputs in order to get all the key-value pairs and tables extracted correctly.

Thanks

Ismail (Cloud Platform Support)

unread,
Aug 31, 2020, 4:26:05 PM8/31/20
to Google Cloud Developers
Hi Henrique,

AutoML Does not support k/v pair or table extraction.  Only entities (eg. values within text, or within form field or table cells.)

Henrique Delgado

unread,
Sep 1, 2020, 4:23:02 AM9/1/20
to Google Cloud Developers
Hey Ismail,
Yeah, I get that, but how can you use a custom-trained AutoML model with Document AI then? And achieve good results with it regarding k/v pair or table extraction?

Ismail (Cloud Platform Support)

unread,
Sep 8, 2020, 5:02:31 PM9/8/20
to Google Cloud Developers
Hi Henrique,

The AutoML features 
for KV pairs and Tables are not yet supported - like AutoML Entity extraction. Therefore, I would highly recommend for you to follow the steps to submit a feature request for Document AI team.

Thank you

Constantine Vassilev

unread,
Oct 19, 2020, 9:08:32 AM10/19/20
to Google Cloud Developers
I would like to create Google Cloud Function in GCP. To send  a PDF and from there to call Document AI and return back the text as JSON. Is that functionality possible now. Could I use Golang? 

Olu

unread,
Oct 20, 2020, 9:26:30 AM10/20/20
to Google Cloud Developers

Cloud Functions supports Go Runtimes as per this documentation[1], so, I see no reason why your implementation could not be done in GoLang. Similarly, making a Call from your Cloud Function to Document AI should likewise be possible through the Google Client Libraries[2]. 

Reply all
Reply to author
Forward
0 new messages