Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Close

bigdata: All content about bigdata in NoSQL databases and polyglot persistence

Stripe's Hadoop tools open sourced

Stripe has put on GitHub 4 Hadoop related projects they’ve developed internally:

  1. a dashboard for Hadoop jobs
  2. a Scala framework for distributed learning
  3. a database for serving data in SequenceFile format
  4. a collection of command-line utilities.

As a side note, Stripe is using Cloudera Impala with Parquet.

Original title and link: Stripe’s Hadoop tools open sourced (NoSQL database©myNoSQL)

via: https://stripe.com/blog/four-new-hadoop-projects