System health and auto recovery by Pega team
I am trying to get as much information about my system health as possible. I have access to PDC and have been able to enable notifications for certain events. What I'm looking for is: -How to define actionable items in event triggered notifications. I know I can see the error message but is there a way to include a 'To resolve this issue, do the following..' message? -Auto recovery for when a system goes down. Can we establish an auto recover process for when one of our system nodes runs out of memory to clear it? Or if some other kind of system component fails an automatic process starts to fix the issue? -Is there active monitoring done by the Pega cloud team that would have earlier insight into system failures before we do? These are a few points that I would like to gain more information on for each of our development and production environments.