jeremykun.com
Hashing to Estimate the Size of a Stream
Problem: Estimate the number of distinct items in a data stream that is too large to fit in memory. Solution: (in python) Discussion: The technique used here is to use random hash functions. The ce…