Skip to content

Implement collision_probability for LSHFunction subtypes #6

Closed
@kernelmethod

Description

@kernelmethod

SimHash, MinHash, and LpHash currently all have implementations of single_hash_collision_probability (MIPSHash and SignALSH are currently missing them, see issue #4). There should also be a function collision_probability(::LSHFunction, sim) that computes the overall probability that the total hash (i.e. the vector of individual hashes) generated by an LSHFunction is equal for two inputs with similarity sim.

Note that if single_hash_collision_probability is already implemented then this is relatively easy to implement using

collision_probability(hashfn::LSHFunction, sim::Real) =
    single_hash_collision_probability(hashfn, sim)^nhashes(hashfn)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions