Hi Google ML Team,
I have three models running on one website split between category (1) and brand (2) propensity scores.
However, I am seeing significant differences in the evaluation metrics between these models, with the category model (Row 3) scoring poorly compared to the brand models. Below are the evaluation metrics for each:
Row 1 (Brand Model 1):
Row 2 (Brand Model 2):
Row 3 (Category Model):
To expand the events considered for the model, I modified the SQL scripts for all models. I changed the event_cnts portion of the SQL script to include event_name = search, replacing scroll, and added event_name = view_item.
Despite these adjustments, the category model's performance remains poor-- and maybe Brand Model 1 too. I am looking for guidance or suggestions on how to improve this model’s performance. Could you provide recommendations on tuning, feature engineering, or model selection that may help?
Thank you!
Tom