Good starting point: Interview experience: https://medium.com/%40ajingnv/system-design-a-server-health-monitoring-system-9bdd0066bb9c
Solution: https://systemdesignschool.io/problems/realtime-monitoring-system/solution
Leetcode Discussion: https://leetcode.com/discuss/interview-question/system-design/958919/System-Design-Interview-or-Service-Health-Monitoring-and-Alerting-Service