Yesterday we pushed a number of batch jobs, 10% of which failed for no provided reason. Could someone please take a look and hopefully explain what happened to our batch jobs so we can perhaps prevent these issues in the future?
They are across a number of different accounts. We had 91 batch jobs CANCELED with no error - examples are
Account: 1095363880, Batch: 1555479026
Account: 3732192484, Batch: 1555405878
Account: 6011282466, Batch: 1555832781
These accounts all seemed to be processing, and suddenly were CANCELED. Definitely not by us.
Then we had 117 batch jobs failed due to "INTERNAL_ERROR".
Examples:
Account: 1289959727, Batch: 1555616228 - 77% complete, 32325 operations executed, 32325 succeeded, 32325 written.
Account: 1167746827, Batch: 1555537352 - 16% complete, 56691 operations executed, 549 succeeded, 56691 written.
Account:
4724505641, Batch: 1555633246 - 71% complete, 47421 operations executed, 47421 succeeded, 47426 written. <-- this one wrote more than operations provided.
Account: 5936322484, Batch: 1556502467 - 0% complete, 20 operations executed, 20 succeeded, 20 written.
Account:
8635488849, Batch: 1556365425 - 99% complete, 99075 operations executed, 94935 succeeded, 99075 written.
Thanks,
Richard