Training pipeline failed with error message: Invalid column names:

1,858 views
Skip to first unread message

Steve Walker

unread,
Jan 13, 2021, 10:06:52 AM1/13/21
to cloud-automl-tables-discuss
Is there a way to find out more detailed information on this error?  I'm using the TF titanic dataset, nothing fancy.  


Chenyu Zhao

unread,
Jan 13, 2021, 12:42:04 PM1/13/21
to Steve Walker, cloud-automl-tables-discuss
I suspect you're using CSV and there's an erroneous comma somewhere causing there to be an extra column with the name "" (i.e. empty string)

Can you double-check? If you attach the CSV file, we can help take a look too.

--
You received this message because you are subscribed to the Google Groups "cloud-automl-tables-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cloud-automl-tables...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cloud-automl-tables-discuss/98b4b686-6fda-4918-9021-e019cf9438e4n%40googlegroups.com.

Chenyu Zhao

unread,
Jan 13, 2021, 12:42:21 PM1/13/21
to Steve Walker, cloud-automl-tables-discuss
*an erroneous comma in the header line

Pratap Ramamurthy

unread,
Feb 11, 2022, 6:20:43 PM2/11/22
to cloud-automl-tables-discuss
I am hitting the same error. Apparently VertexAI python SDK throws this error when column names have spaces. like "Column 1" etc.

I was able to get past this error when I renamed my columns names in the CSV file.

thanks
Pratap

Ivan Cheung

unread,
Feb 14, 2022, 10:57:33 AM2/14/22
to Pratap Ramamurthy, cloud-automl-tables-discuss
Thanks for bringing this to our attention.

Can you provide a truncated CSV (1 to 3 lines) and a code example for us to test?

Thanks!

Ivan Cheung

CloudAI Developer Programs Engineer

iva...@google.com



Pratap Ramamurthy

unread,
Feb 14, 2022, 2:24:48 PM2/14/22
to Ivan Cheung, cloud-automl-tables-discuss

I am using this notebook (from official repository).


The is the easiest way to recreate this error is to that that dataset csv file, just change one of the column names to contain a space, you will hit the error.
Make sure you also change it in this call: "aiplatform.AutoMLTabularTrainingJob()

thanks
Pratap



--

Pratap Ramamurthy

AI/ML Specialist Customer Engineer

prat...@google.com | (647) 244-8430

Ivan Cheung

unread,
Feb 15, 2022, 3:34:30 PM2/15/22
to Pratap Ramamurthy, cloud-automl-tables-discuss

Ivan Cheung

CloudAI Developer Programs Engineer

iva...@google.com


sokipriala jonah

unread,
May 15, 2022, 10:20:25 AM5/15/22
to cloud-automl-tables-discuss
Thanks, Guys. The suggestions worked for me; spaces on my table header caused the error

Michel Siqueira Reis

unread,
Aug 26, 2022, 8:33:11 PM8/26/22
to cloud-automl-tables-discuss
I solved preparing a DataFrame and using the pandas function:
df.to_csv("data.csv", index=False). This output the format that AutoML reads.

Manish Kumar

unread,
Jan 24, 2023, 3:27:26 AM1/24/23
to cloud-automl-tables-discuss
Even when my column names were having "_" in it I get this error message.

Michael Hu

unread,
Jan 24, 2023, 12:51:40 PM1/24/23
to cloud-automl-tables-discuss
Hi Manish,

Any column names that align with https://cloud.google.com/bigquery/docs/schemas#column_names should be allowed by AutoML. If you want, you can share your project number and training pipeline id with cloud-automl-t...@google.com and we can take a look.

Thanks,
Michael

Reply all
Reply to author
Forward
0 new messages