API Latest Releases: Java Core, C++ Core, Python, Memory, Pig, Hive,

Prev
Next

The KMV Sketch, Rejection Rules

Not needing to store hash values greater than V(kth) means we can automatically reject any incoming hash values greater than or equal to V(kth) up front.

Additionally, we can reject any incoming hash values that are duplicates of any of the hash values we already have in our KMV list.

KMV4

Prev
Next