Choosing a suitable database for high-volume transactional data

Question

Design Gurus · Accepted Answer

Choosing a database for high-volume transactional data is the most consequential decision in any system design because the database determines your latency floor, throughput ceiling, consistency guarantees, and scaling path—and changing it later requires a painful, risky migration. A transactional database (OLTP) handles thousands to millions of short, concurrent operations per second—inserts, updates, deletes—while maintaining ACID guarantees that ensure every transaction either fully succeeds or fully rolls back. In system design interviews, interviewers do not want you to name a database; they want you to derive the choice from requirements. "I would use PostgreSQL" is a statement. "I would use PostgreSQL because the workload is read-heavy with complex joins, requires ACID transactions for payment integrity, and the data model is relational with strong referential constraints" is engineering reasoning that earns points.

Key Takeaways

Start from workload shape, not database brand. Characterize the workload first: read/write ratio, query complexity, consistency requirements, data volume, and latency targets. The workload profile determines the database category; the specific product is secondary.  
ACID transactions are mandatory for financial data, inventory, bookings, and any system where partial updates cause business harm. Choose PostgreSQL, MySQL, or Aurora for single-region ACID. Choose Cloud Spanner or CockroachDB for globally distributed ACID.  
DynamoDB is the default for simple key-value and document workloads at massive scale—single-digit millisecond reads, automatic sharding, zero operational overhead. But it lacks joins, complex queries, and strong multi-item transactions.  
The database is always the bottleneck. Application servers scale horizontally with ease. Databases scale through read replicas (reads), connection pooling (connections), caching (repeated queries), and sharding (writes and storage). Each adds complexity.  
In interviews, justify every database choice with three properties: the access pattern it supports, the consistency guarantee it provides, and the scaling path it enables. This three-part justification is the framework interviewers expect.

The Database Selection Framework

Before naming a database, answer five questions about your workload.

1. What is the read/write ratio?

Read-heavy (10:1 or higher): B-tree databases (PostgreSQL, MySQL) excel. Add read replicas and caching for scale. Write-heavy (1:1 or write-dominant): LSM-tree databases (Cassandra, ScyllaDB) convert random writes to sequential I/O for higher throughput. Balanced: PostgreSQL with proper indexing handles balanced workloads well up to moderate scale.

2. What query patterns do you need?

Point lookups by primary key: DynamoDB, Redis, Cassandra—optimized for key-value access. Complex joins across multiple tables: PostgreSQL, MySQL—relational databases with query planners that optimize multi-table queries. Full-text search: Elasticsearch—purpose-built for text search with relevance ranking. Time-series queries: TimescaleDB, InfluxDB—optimized for time-ordered data with retention policies.

3. What consistency guarantees are required?

Strong consistency (linearizability): PostgreSQL, Spanner, CockroachDB—every read returns the latest write. Eventual consistency acceptable: DynamoDB (default), Cassandra—reads may be stale temporarily. Tunable consistency: DynamoDB (optional strong reads), Cassandra (per-query quorum)—choose consistency vs performance per operation.

4. What is the data volume and growth rate?

Under 1 TB: A single PostgreSQL instance handles this comfortably with proper indexing. 1–10 TB: PostgreSQL with partitioning, or DynamoDB for automatic sharding. 10+ TB: Sharded PostgreSQL (Citus), DynamoDB, Cassandra, or Spanner for distributed storage.

5. What are the latency targets?

Sub-millisecond: Redis (in-memory). Must be cached or in-memory data. Single-digit milliseconds: DynamoDB, Cassandra, Aurora—fast point reads on SSDs. 10–50ms: PostgreSQL with proper indexing—acceptable for most web applications. Seconds: Acceptable only for batch analytics (BigQuery, Redshift), not transactional workloads.

Database Options for High-Volume Transactions

PostgreSQL

Engine: B-tree (InnoDB-equivalent via heap + B-tree indexes) Consistency: Strong (ACID, serializable isolation available) Scaling: Vertical + read replicas. Horizontal via Citus extension. Max practical throughput: ~10,000–50,000 TPS on a single instance (hardware-dependent)

PostgreSQL is the default transactional database for system design interviews. It provides full ACID compliance, complex query support (joins, subqueries, CTEs, window functions), and the most mature extension ecosystem of any open-source database. Aurora PostgreSQL adds managed failover, up to 15 read replicas, and 5x throughput improvement over standard PostgreSQL.

Best for: E-commerce order management, payment systems, user account management, booking systems, CMS platforms—any workload requiring relational integrity with complex queries.

Interview application: "I would use Aurora PostgreSQL for the order service. The data model is relational—orders reference users, products, and addresses through foreign keys. The workload is 70% reads (order history, dashboards) and 30% writes (new orders, status updates). Aurora provides up to 15 read replicas for the read-heavy dashboard queries and automated failover for high availability."

MySQL (InnoDB)

Engine: B-tree (InnoDB storage engine) Consistency: Strong (ACID) Scaling: Vertical + read replicas. Vitess for horizontal sharding.

MySQL is the second most common relational database in interviews. InnoDB provides ACID transactions, row-level locking, and foreign key constraints. MySQL scales through read replicas and, at massive scale, through Vitess—the sharding middleware that YouTube, Slack, and GitHub use.

Best for: Web applications with well-understood relational schemas. Applications migrating from legacy MySQL deployments.

DynamoDB

Engine: LSM-tree internally (proprietary) Consistency: Eventually consistent (default); strongly consistent reads available at 2x cost Scaling: Automatic and unlimited—no manual sharding or capacity planning Throughput: Millions of requests per second

DynamoDB is the default for key-value and document workloads at scale. It provides single-digit millisecond latency for point reads, automatic sharding, and zero operational overhead. DynamoDB Global Tables replicate data across regions with single-digit millisecond replication lag.

Limitations: No joins. No complex queries beyond primary key and sort key access patterns. Limited secondary index flexibility. Multi-item transactions supported but limited to 100 items per transaction.

Best for: Session stores, user profiles (simple lookups), URL shorteners, shopping carts, gaming leaderboards, IoT device state—any workload dominated by single-item reads and writes with simple access patterns.

Interview application: "I would use DynamoDB for the session store. Each session is a simple key-value record accessed by session_id. DynamoDB provides sub-5ms reads at any scale with no operational overhead. The trade-off versus PostgreSQL: we lose JOIN capability, but sessions do not need joins—they are independent documents."

Cloud Spanner

Engine: Proprietary (split-based with Paxos consensus) Consistency: Strong (globally consistent using TrueTime synchronized clocks) Scaling: Horizontal, globally distributed, automatic resharding

Cloud Spanner is the only production database that provides globally distributed SQL with strong consistency. TrueTime uses GPS receivers and atomic clocks in every Google data center to synchronize time across regions, enabling consistent reads after writes across continents.

Best for: Global financial systems, multi-region inventory management, any system where global SQL consistency is a hard requirement.

Interview application: "For the global payment ledger serving users in North America, Europe, and Asia, I would use Cloud Spanner. It provides strongly consistent reads globally—a transfer initiated in Tokyo is immediately visible in New York. No other managed database offers this guarantee. The trade-off is cost: Spanner starts at approximately $0.90/node-hour versus $0.10/hour for Aurora."

Database	Engine	Consistency	Joins	Max TPS (single node)	Scaling Model	Best For
PostgreSQL	B-tree	Strong (ACID)	Full SQL	10K–50K	Vertical + replicas	Complex relational workloads
Aurora PostgreSQL	B-tree	Strong (ACID)	Full SQL	50K–200K	Managed replicas	High-availability relational
MySQL (InnoDB)	B-tree	Strong (ACID)	Full SQL	10K–50K	Vertical + Vitess	Web applications
DynamoDB	LSM-tree	Tunable	None	Unlimited (auto)	Auto-sharding	Key-value at massive scale
Cloud Spanner	Proprietary	Global strong	SQL	Scales horizontally	Auto-resharding	Global SQL consistency
Cassandra	LSM-tree	Tunable	Limited	100K+ writes	Leaderless horizontal	Write-heavy, time-series
CockroachDB	LSM-tree (Pebble)	Strong (serializable)	Full SQL	10K–50K	Horizontal	Distributed SQL

Choosing a suitable database for high-volume transactional data

Key Takeaways

The Database Selection Framework

Database Options for High-Volume Transactions

PostgreSQL

MySQL (InnoDB)

DynamoDB

Cloud Spanner

Cassandra / ScyllaDB

CockroachDB

Database Comparison for High-Volume Transactions

Scaling Transactional Databases

Read Scaling: Replicas

Connection Scaling: Pooling

Write Scaling: Sharding

Query Scaling: Caching

Frequently Asked Questions

How do I choose a database in a system design interview?

When should I choose PostgreSQL over DynamoDB?

What is the difference between OLTP and OLAP databases?

When should I consider Cloud Spanner?

How do I scale a transactional database?

What is connection pooling and why is it critical?

Should I use NoSQL for all high-throughput workloads?

What is the difference between Aurora and standard PostgreSQL?

When should I shard my transactional database?

How do I discuss database selection in a system design interview?

TL;DR