A Little Ludwig Goes a Long Way

A smattering of opinions on technology, books, business, and culture. Now in its 4th technology iteration.

Confused about which Hadoop distribution to use

16 June 2013

Playing around with Hadoop a little and I am completely confused about where to start:

  • Apache Hadoop.
  • Hortonworks.
  • Cloudera.
  • Pivotal. Not yet available publicly but early access program available
  • Intel. Probably the standard licensing restrictions on Intel software, probably not viable.
  • AWS. Except I want to host locally.
  • And yet more distributions.

More confusing than choosing a linux distro.