Design a Distributed Metrics Logging and Aggregation System

📕 Here are some good solutions we found for this question:

Detailed Solution from Google Engineer: https://www.youtube.com/watch?v=p_q-n09B8KA

Good overview from Google Engineer: https://www.youtube.com/watch?v=_KoiMoZZ3C8

Related - Ad Click Aggregation: https://www.hellointerview.com/learn/system-design/problem-breakdowns/ad-click-aggregator

Leetcode Discussion: https://leetcode.com/discuss/interview-question/system-design/622704/Design-a-system-to-store-and-retrieve-logs-for-all-of-eBay


🙋 Here are some details you should know about this question:

Let's assume the logs are generated from a distributed system, e.g, an EC2 compute cluster.

How to collect logs efficiently from distributed systems?

How to store and index logs for fast retrieval?

When you search for logs, what happens?

How about live streaming logs? How would that work?


← Back to Main Table