1. Bloom Filters

Grokking the Advanced System Design Interview

Ask Author

Back to course home

0% completed

Vote For New Content

1. Bloom Filters

Table of Contents

Contents are not accessible

Background

If we have a large set of structured data (identified by record IDs) stored in a set of data files, what is the most efficient way to know which file might contain our required data? We don't want to read each file, as that would be slow, and we have to read a lot of data from the disk. One solution can be to build an index on each data file and store it in a separate index file. This index can map each record ID to its offset in the data file. Each index file will be sorted on the record ID. Now, if we want to search an ID in this index, the best we can do is a Binary Search

.....

Like the course? Get enrolled and start learning!

Table of Contents

Contents are not accessible