GitXplorerGitXplorer
M

Cuckoo_filter

public
2 stars
0 forks
1 issues

Commits

List of commits on branch master.
Verified
498cdf973e074b4c3ed9056517301f6768cc2d8a

Create LICENSE

MMikeHeaton committed 6 years ago
Unverified
a8da1da4871ecce99bee225f52763e7abf69baf6

does the switchhash function in a more elegant way

MMikeHeaton committed 8 years ago
Unverified
9f26c5a36e6bb64f2106b2b70ab2bb22ae3d0791

whitespace

MMikeHeaton committed 8 years ago
Unverified
d40347f71ef4c82b6a0dabf5e0b90c5009a47268

Update README.md

MMikeHeaton committed 8 years ago
Unverified
ccabeb5f72a1debbed34e34223c702cef9e74bee

Update README.md

MMikeHeaton committed 8 years ago
Unverified
22aff08be3a95a51680c47a838935ec1adc21e85

Update README.md

MMikeHeaton committed 8 years ago

README

The README file for this repository.

Cuckoo_filter

Python implementation of the cuckoo filter data structure. Provides efficient compression and membership-testing abilities with O(1) add, query and remove functions. Cool!

Tested on the complete works of Shakespeare: 67108 words added.
0/67108 failures witnessed. (0.0%)
67108/78000 registers filled. (86.04%)

###Description

Cuckoo filters are a space- and time- efficient structure for storing and querying very large sets of strings.
*They are great for testing whether an element is a member of a big set: query is O(1) time.
*They are great for adding elements to the big set: addition is O(1) time.
*They are better than, say, rainbow tables because we can remove elements as well... in O(1) time.
*They are great for space efficiency: really big elements are hashed down to a couple of bytes.

They are bad for printing out the big set once its been created. This isn't very possible unless we're using a reversible fingerprinting function (in which case we probably lose the space saving.)

More information here: https://www.cs.cmu.edu/~dga/papers/cuckoo-conext2014.pdf

Blog post upcoming on www.mike-heaton.com.