This project isn’t brand new, but I recently discovered it and even deployed it in production for a difficult-to-satisfy use case.
Bloom filters are a handy data structure, and implementations tend to exist for every language. But many use cases require scalable (resizable) bloom filters, persistence, and an efficient server protocol – think clusters that do web crawling or analytics.
bloomd can make these applications very simple. The C code is also clearly-written. The Python driver also includes a naive round robin sharding scheme and multi-server support. Definitely worth looking at!