from Hacker News

Druid: fast column-oriented distributed data store

by samueladam on 3/31/16, 9:27 PM with 47 comments

  • by tschellenbach on 3/31/16, 10:04 PM

    Druid is quickly becoming the leading open source solution for building highly scalable analytics. We evaluated it for getstream.io. Unfortunately the setup and maintenance is still very labour intensive. For startups that's a concern. Many larger companies we spoke to were extremely happy about running Druid in production though.
  • by NamTaf on 4/1/16, 2:24 AM

    The realtime ingestion is interesting especially if I can still batch import. When processing machine data, I've found that a quantity of sources come in chunks (logfiles written out every 24 hours for exmaple) but the eventual aim is to migrate to realtime (i.e.: a data point every n seconds/minutes/etc. where you instantly consume that data point) streaming.

    If this transition is easy without reworking infrastructure, the solution is far more attractive.

  • by jnordwick on 3/31/16, 10:21 PM

    Every open source column database I've seen is very poor: text, no decent array oriented ability (give me the prevoius row), slow, json output, etc. When will somebody get it right?
  • by techwizrd on 4/1/16, 3:08 AM

    A friend of mine who interned with me at eBay used Druid and Angular to great success to build a tool for analysts to look at trends in our data. Druid is some seriously cool stuff.
  • by Exuma on 4/1/16, 2:11 PM

    Our 2-man team set up Druid........ i took 5+ months and was excruciating to configure and get running smoothly (things were slightly more complicated because we decided to use docker). It also took ~30 servers to make a truly fault-tolerant setup.

    With that said, it works very well, but it definitely came at the cost of a good dose of sanity.

  • by whitegrape on 3/31/16, 11:48 PM

    Has anyone done a meaningful private benchmark comparison with http://www.scylladb.com/ ? I didn't find one online.
  • by mrweasel on 4/1/16, 10:48 AM

    What's the advantage of a "column-oriented" data store/database?
  • by whitenoice on 4/1/16, 5:02 AM

    How does it compare to Vertica?
  • by sspring1 on 3/31/16, 9:55 PM

    Great, just what we need, a Druish database.
  • by rosalinekarr on 3/31/16, 11:13 PM

    Yeah, more database solutions that's what we need.

    https://xkcd.com/927/