Hello All,
A couple of weeks back, our barman server had a hardware outage leading to server shutdown for two days. Post the hardware outage, We had dropped the barman slot from the Postgres Server to avoid accumulating the WAL files. Once the server was back up and running, we didn't have any issues reconfiguring the server with barman. The barman check commands work fine for the server abc01 and so does backup retention policy. However, we recently noticed that barman is constantly spamming the error log with the following error:
2020-10-27 21:27:21,141 [173538] barman.server ERROR: WAL file '00000001000009CD000000BE' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:21,979 [173582] barman.server ERROR: WAL file '00000001000009CD000000BF' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:22,715 [173622] barman.server ERROR: WAL file '00000002.history' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:26,258 [173695] barman.server ERROR: WAL file '00000001000009CD000000BE' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:27,095 [173739] barman.server ERROR: WAL file '00000001000009CD000000BF' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:27,882 [173771] barman.server ERROR: WAL file '00000002.history' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:31,248 [173918] barman.server ERROR: WAL file '00000001000009CD000000BE' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:32,038 [173951] barman.server ERROR: WAL file '00000001000009CD000000BF' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:32,778 [173995] barman.server ERROR: WAL file '00000002.history' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:36,232 [174079] barman.server ERROR: WAL file '00000001000009CD000000BE' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:36,981 [174111] barman.server ERROR: WAL file '00000001000009CD000000BF' not found in server 'abc01' (SSH host: 192.168.0.02)
2020-10-27 21:27:37,792 [174155] barman.server ERROR: WAL file '00000002.history' not found in server 'abc01' (SSH host: 192.168.0.02)
We tried doing a receive-wal --reset and switch-wal to let Barman get over the WAL file but that didn't help. Nor did the rebuild-xlog. We would like not to delete all the backup.
PS: We are using barman 2.10 and PostgreSQL 10.12. The above error has originated from the time we have reconfigured the server.
Any insight on this would be highly appreciated.
Thanks,
Viral Shah
Senior Data Analyst