Motivation
- A hash table is an implementation of Set/Map aiming for average (expected) constant-time operations.
Suppose we have an array:
data:image/s3,"s3://crabby-images/5ecd4/5ecd4ff9f9c7be89f2da7a796dfb6a7457fb21bd" alt=""
And we want to store a bunch of elements in it:
$$ \text{cat}, \space \text{bat}, \space \text{tap}, \space \text{mad}, \text{dam}, \space \text{nap}, \space \text{pat} $$
What will we do?
Well, we most likely store them sequentially, one after another.
data:image/s3,"s3://crabby-images/bf1c5/bf1c5952daa3a582b4d354bea2858ac357502326" alt=""
Here, insertion is fast (constant time), and searching is slow (linear time).
We can get a logarithmic-time search (via binary search) if we keep the data sorted, but that means extra work for insertion (linear time).
data:image/s3,"s3://crabby-images/e745f/e745f03253a7e3197caabc0a30d03360d1bdc32e" alt=""
We could organize the data in a (balanced) binary search tree to get logarithmic-time insert/remove and find.
data:image/s3,"s3://crabby-images/937d1/937d1a903ede2e74fa7a397fcfe1e3905316cc14" alt=""
But is there any way to get constant time for all the core operations?
A hash table is a data structure aiming for average (expected) constant-time operations.