GitXplorerGitXplorer
n

pico-cuckoo-filter

public
14 stars
3 forks
0 issues

Commits

List of commits on branch master.
Unverified
6dbd19dc26c93fea93caa806f5254e4470ac15e1

Enable coverage

nnewhoggy committed 9 years ago
Unverified
af1e917a2cf02b8ac7ad9fc17094c317c6c831d9

Use Circle CI status badge.

nnewhoggy committed 9 years ago
Unverified
69d0278ddf72aeb1c08c78e731faf3508687577f

Use Circle CI

nnewhoggy committed 9 years ago
Unverified
b393409b56dc6d405073bfe427cf6917164207b6

Allow Fingerprint to be defined.

committed 9 years ago
Unverified
639a6a4066c6043974a138b866a1a625c5673e11

Add pico-cuckoo-filter-examples subproject and assembly sbt plugin.

committed 9 years ago
Unverified
118ab6a8d0e2b68cfb5c71949e1b0ad2e87c8980

Documentation.

committed 9 years ago

README

The README file for this repository.

Circle CI

Cuckoo filter

A densely-packed configurable cuckoo filter implemented in Scala.

SBT settings

resolvers ++= Seq(
  "dl-john-ky-releases"   at "http://dl.john-ky.io/maven/releases",
  "dl-john-ky-snapshots"  at "http://dl.john-ky.io/maven/snapshots")

libraryDependencies += "io.john-ky" %% "pico-cuckoo-filter" % "0.0.1-af1b672"

Usage

These are the imports required to use the cuckoo filter:

import org.pico.cuckoo.filter._
import org.pico.hash.syntax._
import org.pico.hash.{Hash64, Hashable}
import org.pico.twiddle.syntax.anyVal._

The cuckoo filter requires instances of the Hashable type-trait to defined for the filtered element type, for Long and for Fingerprint.

For example, the following is an implementation of these instances for the filtered element type String:

import scala.util.hashing.MurmurHash3

implicit val hashableString = new Hashable[String] {
  override def hash(a: String): Hash64 = Hash64(MurmurHash3.stringHash(a))
}

implicit val hashableLong = new Hashable[Long] {
  override def hash(a: Long): Hash64 = Hash64(MurmurHash3.arrayHash(Array(a)))
}

implicit val hashableFingerprint = new Hashable[Fingerprint] {
  override def hash(a: Fingerprint): Hash64 = a.value.hashed
}
val filter = new CuckooFilter(
    16, // The number fingerprints per bucket
    24.bits, // The number of bits in a finger print
    5, // The maximum number of kicks to attempt before failing an insert
    128) // The total number of buckets

The filter can then be used like this:

filter.insert("Element") // Returns true if the insertion was successful
filter.lookup("Element") // Returns true since the element was just inserted
filter.delete("Element") // Returns true since the element was still in the filter
filter.lookup("Element") // Returns false since the element has just been deleted
filter.delete("Element") // Returns false since the element has already been deleted

Complete sample

import org.pico.cuckoo.filter._
import org.pico.hash.syntax._
import org.pico.hash.{Hash64, Hashable, Hashable2}
import org.pico.twiddle.syntax.anyVal._
import scala.util.hashing.MurmurHash3

object Main {
  implicit val hashableString = new Hashable[String] {
    override def hash(a: String): Hash64 = Hash64(MurmurHash3.stringHash(a))
  }

  implicit val hashable2String = new Hashable2[String] {
    override def hash2(a: String): Hash64 = Hash64(JLong.reverse(MurmurHash3.stringHash(a)))
  }

  implicit val hashableLong = new Hashable[Long] {
    override def hash(a: Long): Hash64 = Hash64(MurmurHash3.arrayHash(Array(a)))
  }

  implicit val hashable2Long = new Hashable2[Long] {
    override def hash2(a: Long): Hash64 = Hash64(JLong.reverse(MurmurHash3.arrayHash(Array(a))))
  }

  implicit val hashableFingerprint = new Hashable[Fingerprint] {
    override def hash(a: Fingerprint): Hash64 = a.value.hashed
  }

  def main(args: Array[String]): Unit = {
    val filter = new CuckooFilter(
        16, // The number fingerprints per bucket
        24.bits, // The number of bits in a finger print
        5, // The maximum number of kicks to attempt before failing an insert
        128) // The total number of buckets

    val a = filter.insert("Element") // Returns true if the insertion was successful
    val b = filter.lookup("Element") // Returns true since the element was just inserted
    val c = filter.delete("Element") // Returns true since the element was still in the filter
    val d = filter.lookup("Element") // Returns false since the element has just been deleted
    val e = filter.delete("Element") // Returns false since the element has already been deleted

    println(s"$a $b $c $d $e")
  }
}

References