1. 9
  1.  

  2. 6

    See also https://bravenewgeek.com/stream-processing-and-probabilistic-methods/ for a very concise explanations of those algorithms and data structures.

    1. 2

      The book is very good, with lots of great examples walking through how the algorithms work.

      1. 2

        Being the author of the book mentioned here, I personally like the explanation from @antirez (Redis creator) about HyperLogLog in Redis http://antirez.com/news/75

        1. 1

          Also PipelineDB (which is a postgres extention) has explicit aggregating functions, that expose probabilistic ds:

          http://docs.pipelinedb.com/probabilistic.html