from Hacker News

Meilisearch expands search power with Arroy's filtered disk ANN

by Kerollmops on 12/24/23, 8:16 AM with 26 comments

  • by goestoo on 12/24/23, 4:51 PM

    I tested Meilisearch last year indexing 10mil documents it took 2 days! same dataset with the same server specs it took less than 2 hours to index in elasticsearch.
  • by ko_pivot on 12/24/23, 2:49 PM

    This is great. The only major thing Meilisearch is missing after hybrid search is introduced IMO is high availability. Without clustering, it’s hard to run any meaningful production workloads and there doesn’t seem to be an online upgrades story.
  • by marginalia_nu on 12/24/23, 11:24 PM

    > but we decided to go with RoaringBitmaps to reduce their size

    Interesting. Could you elaborate on the benefit of this?

    I've (possibly prematurely) discarded the notion of using RoaringBitmaps like this because while they use less memory, traversal is and mutation is so much slower using a fixed buffer out of a pool and considering an upper bounded slice of putative results at a time.

    Although this is for a search engine that typically deals with under-specified queries and is designed for best-effort retrieval given an upper computation time.

  • by wg0 on 12/24/23, 9:35 PM

    I'm thinking of putting this for software documentation. The idea is that content is all markdown in Astro or Seveltkit with SSG (static site generation) and then Index it all at build time and then search it all via Meliesearch.
  • by darkotic on 12/24/23, 11:04 PM

    I want to use meilisearch but it takes up more disk space than I'd like unless that has already improved.
  • by sgt101 on 12/24/23, 1:52 PM

    feels like product placement dressed as a blog