from Hacker News

Top-K: A Probabilistic Addition to RedisBloom

by itamarhaber on 7/2/19, 6:31 PM with 1 comments

  • by gkorland on 7/4/19, 1:53 PM

    "In this benchmark, we extracted a list of the most common words in the book War and Peace, which contains over 500,000 words... Its accuracy was 100% in most cases, except for very high Ks where it ‘only’ achieved 99.9% accuracy"

    Interesting how such probabilistic data structures can in most cases behave almost as good as deterministic.