Common system design interview questions for senior software roles

Question

Design Gurus · Accepted Answer

System design interview questions test your ability to architect large-scale distributed systems under constraints like scalability, fault tolerance, and latency.

For senior software engineers interviewing at FAANG companies and top startups, these questions are the single biggest factor determining your hiring level and compensation.

This guide covers the 15 most frequently asked system design interview questions, the framework to answer them, and the trade-offs interviewers expect senior candidates to discuss.

Key Takeaways

Senior engineers face 1–2 system design rounds per interview loop, and performance here directly sets your offer level.
The top 15 questions cover four categories: data-intensive systems, real-time systems, infrastructure components, and search/discovery.
Interviewers evaluate your trade-off reasoning, not whether you pick the "right" architecture.
A structured framework (clarify → estimate → design → deep-dive → scale) beats memorized diagrams every time.
You should spend the first 5 minutes asking clarifying questions, not drawing boxes.

What Senior Engineers Are Actually Evaluated On

System design interview questions for senior engineers differ from mid-level questions in scope, depth, and ownership.

At the junior level, interviewers check whether you know a load balancer exists. At the senior level, they expect you to explain why you'd pick an L7 over an L4, what the latency impact is, and how you'd handle failover.

Evaluation Criteria Junior Expectation Senior Expectation
Requirements gathering Accept given constraints Drive the conversation, identify hidden requirements
High-level design Correct component diagram Justified architecture with explained trade-offs
Deep dives Basic understanding Production-grade depth in 2–3 areas
Scalability Mention caching/sharding Quantitative capacity planning with numbers
Trade-offs Acknowledge they exist Articulate specific pros/cons and pick a side
Operational concerns Rarely discussed Monitoring, alerting, deployment, failure recovery

If you are building your foundational knowledge before tackling these questions, the Grokking System Design Fundamentals course covers every building block — from databases and caches to load balancers and message queues — with hands-on examples.

The 15 Most Common System Design Interview Questions

These questions appear repeatedly across Google, Meta, Amazon, Netflix, Microsoft, and high-growth startups. I've grouped them by category so you can study patterns, not just individual problems.

Category 1: Data-Intensive Systems

1. Design a URL Shortener (TinyURL)

Why it's asked: This is the most popular system design interview question across all levels. For senior engineers, interviewers use it as a warm-up and then push hard on collision handling, analytics at scale, and cache eviction.

Core components: API gateway, hashing service (Base62 encoding or MD5 truncation), key-value store (DynamoDB or Redis), 301/302 redirect logic.

Senior-level deep dives the interviewer will push:

How do you handle hash collisions at 1 billion URLs? (Answer: check-and-retry with a counter suffix, or use a pre-generated key service that allocates unique IDs from a range.)
Read-to-write ratio is roughly 100:1. How does that shape your caching strategy? (Answer: aggressive read-through cache with Redis; TTL based on access frequency.)
Should you use 301 (permanent) or 302 (temporary) redirects? (Answer: 302 if you need analytics on every click; 301 if you want to reduce server load and let browsers cache.)

Scale reference: Bitly processes roughly 600 million link clicks per month. Your design should handle at least this order of magnitude.

2. Design a News Feed (Facebook/Twitter Timeline)

Why it's asked: It tests your understanding of fan-out strategies, ranking algorithms, and the tension between consistency and latency.

The critical trade-off — fan-out on write vs. fan-out on read:

Approach How It Works Pros Cons
Fan-out on write (push) Pre-compute feed for each follower when a post is created Fast reads, O(1) feed fetch Expensive for users with millions of followers (celebrity problem)
Fan-out on read (pull) Assemble feed at read time by querying followed users' posts No wasted writes Slow reads, high latency at scale
Hybrid (what Meta uses) Push for normal users, pull for celebrities Balances both More complex to implement

Senior-level follow-up: "A user follows 500 accounts and opens the app. Walk me through the exact data path from request to rendered feed, including cache layers." You should be able to trace the request through an API gateway, feed service, ranked feed cache (Redis/Memcached), and a fallback to the posts database with a merge-sort across followed users.

3. Design a Key-Value Store (Distributed Cache)

Why it's asked: This is a pure distributed systems question. It reveals whether you understand consistent hashing, replication, conflict resolution, and the CAP theorem at a practical level.

What to cover: Consistent hashing with virtual nodes (the technique DynamoDB borrowed from Amazon's 2007 Dynamo paper), configurable quorum reads/writes (W + R > N for strong consistency), vector clocks or last-write-wins for conflict resolution, gossip protocol for failure detection.

Senior-level question: "You have a 5-node cluster and need to tolerate 2 node failures. What values of N, W, and R do you choose?" (Answer: N=5, W=3, R=3. This gives you strong consistency and survives 2 failures, but writes require 3 acknowledgments, increasing latency.)

Category 2: Real-Time Systems

4. Design a Chat Application (WhatsApp/Messenger)

Why it's asked: It tests real-time communication, message delivery guarantees, and encryption considerations.

Core architecture: WebSocket connections for real-time delivery, Kafka for durability, a chat service for routing, and Cassandra for message history (write-optimized, wide-column design).

The presence system: Tracking online/offline status for 2 billion users requires a heartbeat mechanism (30-second timeout) with Redis for state. The interviewer will ask how you handle the thundering herd when a server with 100K connections goes down.

5. Design a Notification System

Why it's asked: It covers multiple delivery channels (push, SMS, email), priority queues, rate limiting, and delivery semantics.

Key design decision: Use a priority queue with separate workers per channel. High-priority notifications (security alerts, 2FA codes) skip the rate limiter. User preferences determine active channels per notification type.

Senior-level question: "How do you guarantee no duplicate notifications?" (Answer: idempotency key in a dedup cache with TTL. Before dispatching, check the cache. Use at-least-once delivery from Kafka and dedup at the consumer.)

6. Design a Real-Time Collaborative Editor (Google Docs)

Why it's asked: A staff-level question testing conflict resolution — specifically Operational Transformation (OT) or CRDTs.

The core problem: Two users edit simultaneously. Without conflict resolution, edits overwrite each other.

Algorithm Central Server? Complexity Used By
Operational Transformation (OT) Yes High Google Docs
CRDTs No Medium Figma, Yjs
Last-write-wins (naive) No Low Not viable for collaboration

Category 3: Infrastructure Components

7. Design a Rate Limiter

Why it's asked: Every API needs one, and the question reveals whether you understand the algorithms, their trade-offs, and distributed coordination challenges.

Algorithm comparison:

Algorithm How It Works Pros Cons
Token bucket Tokens added at fixed rate; each request consumes a token Allows bursts, smooth rate limiting Requires tuning bucket size and refill rate
Sliding window log Stores timestamp of each request in a sorted set Precise, no boundary issues Memory-intensive at high QPS
Fixed window counter Counts requests per time window Simple, low memory Boundary spike problem (2x burst at window edges)
Sliding window counter Weighted combination of current and previous window Good accuracy, low memory Approximate

Senior-level concern: In a distributed system with multiple API servers, where does the rate limit state live? (Answer: centralized Redis with Lua scripts for atomic check-and-increment. At 100K+ QPS, consider local counters with periodic sync to reduce Redis round trips.)

8. Design a Distributed Message Queue (Kafka)

Why it's asked: Message queues are foundational to event-driven architectures. This tests partitioning, consumer groups, ordering, and durability.

Key concepts: Topics and partitions, hash-based producer partitioning for per-key ordering, consumer groups for parallel processing, replication factor of 3, ISR (in-sync replicas) for durability, offset management for exactly-once semantics.

Scale reference: LinkedIn's Kafka clusters handle over 7 trillion messages per day.

9. Design a Content Delivery Network (CDN)

Why it's asked: CDNs reveal your understanding of caching hierarchies, DNS-based routing, and cache invalidation.

Architecture layers: Origin server → shield/mid-tier cache → edge PoPs. DNS or anycast routing directs users to the nearest edge. Cache invalidation uses TTLs combined with purge APIs.

Senior-level question: "A breaking news article goes viral and your origin is hammered despite the CDN. What's happening?" (Answer: cache miss stampede. Solution: request coalescing — the edge holds duplicate requests and serves them all from the single origin response.)

Category 4: Search, Discovery, and Data Processing

Common system design interview questions for senior software roles

Key Takeaways

What Senior Engineers Are Actually Evaluated On

The 15 Most Common System Design Interview Questions

Category 1: Data-Intensive Systems

1. Design a URL Shortener (TinyURL)

2. Design a News Feed (Facebook/Twitter Timeline)

3. Design a Key-Value Store (Distributed Cache)

Category 2: Real-Time Systems

4. Design a Chat Application (WhatsApp/Messenger)

5. Design a Notification System

6. Design a Real-Time Collaborative Editor (Google Docs)

Category 3: Infrastructure Components

7. Design a Rate Limiter

8. Design a Distributed Message Queue (Kafka)

9. Design a Content Delivery Network (CDN)

Category 4: Search, Discovery, and Data Processing

10. Design a Web Crawler

11. Design a Search Autocomplete (Typeahead)

12. Design a Video Streaming Platform (YouTube/Netflix)

14. Design an E-Commerce Platform (Amazon)

15. Design a Metrics and Logging System

The 4-Step Framework for Answering Any System Design Question

Step 1: Clarify Requirements (5 minutes)

Step 2: High-Level Design (10 minutes)

Step 3: Deep Dive (20 minutes)

Step 4: Scale and Harden (10 minutes)

Sample Follow-Up Questions and Model Answers

How to Prepare: A Study Plan for Senior Engineers

Frequently Asked Questions

How many system design rounds are there for senior engineer interviews at FAANG?

What is the difference between high-level design and low-level design in interviews?

How long should I spend on clarifying questions in a system design interview?

Which system design questions are asked most often at Google, Meta, and Amazon?

Do I need to know specific technologies like Kafka or Redis for system design interviews?

How do I handle a system design question I've never seen before?

Should I mention monitoring and observability in system design interviews?

What's the biggest mistake senior engineers make in system design interviews?

TL;DR

Evaluation Criteria	Junior Expectation	Senior Expectation
Requirements gathering	Accept given constraints	Drive the conversation, identify hidden requirements
High-level design	Correct component diagram	Justified architecture with explained trade-offs
Deep dives	Basic understanding	Production-grade depth in 2–3 areas
Scalability	Mention caching/sharding	Quantitative capacity planning with numbers
Trade-offs	Acknowledge they exist	Articulate specific pros/cons and pick a side
Operational concerns	Rarely discussed	Monitoring, alerting, deployment, failure recovery

Approach	How It Works	Pros	Cons
Fan-out on write (push)	Pre-compute feed for each follower when a post is created	Fast reads, O(1) feed fetch	Expensive for users with millions of followers (celebrity problem)
Fan-out on read (pull)	Assemble feed at read time by querying followed users' posts	No wasted writes	Slow reads, high latency at scale
Hybrid (what Meta uses)	Push for normal users, pull for celebrities	Balances both	More complex to implement

Algorithm	Central Server?	Complexity	Used By
Operational Transformation (OT)	Yes	High	Google Docs
CRDTs	No	Medium	Figma, Yjs
Last-write-wins (naive)	No	Low	Not viable for collaboration

Algorithm	How It Works	Pros	Cons
Token bucket	Tokens added at fixed rate; each request consumes a token	Allows bursts, smooth rate limiting	Requires tuning bucket size and refill rate
Sliding window log	Stores timestamp of each request in a sorted set	Precise, no boundary issues	Memory-intensive at high QPS
Fixed window counter	Counts requests per time window	Simple, low memory	Boundary spike problem (2x burst at window edges)
Sliding window counter	Weighted combination of current and previous window	Good accuracy, low memory	Approximate

Common system design interview questions for senior software roles

Key Takeaways

What Senior Engineers Are Actually Evaluated On

The 15 Most Common System Design Interview Questions

Category 1: Data-Intensive Systems

1. Design a URL Shortener (TinyURL)

2. Design a News Feed (Facebook/Twitter Timeline)

3. Design a Key-Value Store (Distributed Cache)

Category 2: Real-Time Systems

4. Design a Chat Application (WhatsApp/Messenger)

5. Design a Notification System

6. Design a Real-Time Collaborative Editor (Google Docs)

Category 3: Infrastructure Components

7. Design a Rate Limiter

8. Design a Distributed Message Queue (Kafka)

9. Design a Content Delivery Network (CDN)

Category 4: Search, Discovery, and Data Processing

10. Design a Web Crawler

11. Design a Search Autocomplete (Typeahead)

12. Design a Video Streaming Platform (YouTube/Netflix)

13. Design a Ride-Sharing Service (Uber/Lyft)

14. Design an E-Commerce Platform (Amazon)

15. Design a Metrics and Logging System

The 4-Step Framework for Answering Any System Design Question

Step 1: Clarify Requirements (5 minutes)

Step 2: High-Level Design (10 minutes)

Step 3: Deep Dive (20 minutes)

Step 4: Scale and Harden (10 minutes)

Sample Follow-Up Questions and Model Answers

How to Prepare: A Study Plan for Senior Engineers

Frequently Asked Questions

How many system design rounds are there for senior engineer interviews at FAANG?

What is the difference between high-level design and low-level design in interviews?

How long should I spend on clarifying questions in a system design interview?

Which system design questions are asked most often at Google, Meta, and Amazon?

Do I need to know specific technologies like Kafka or Redis for system design interviews?

How do I handle a system design question I've never seen before?

Should I mention monitoring and observability in system design interviews?

What's the biggest mistake senior engineers make in system design interviews?

TL;DR