📕
📕 Here are some good solutions we found for this question:
Good solution: https://systemdesignschool.io/problems/realtime-monitoring-system/solution
Nice article with overview: https://archive.is/Eezec
Good solution: https://systemdesignschool.io/problems/realtime-monitoring-system/solution
Nice article with overview: https://archive.is/Eezec
What kind of metrics will you pull from the servers?
How do you fetch metrics? Do you use push or pull?
How often do you collect metrics?
Where do you store metrics and query them?
How do you display the metrics?
How to you send alerts? When will you send them?