Confused about which Hadoop distribution to use
Playing around with Hadoop a little and I am completely confused about where to start:
- Apache Hadoop.
- Hortonworks.
- Cloudera.
- Pivotal. Not yet available publicly but early access program available
- Intel. Probably the standard licensing restrictions on Intel software, probably not viable.
- AWS. Except I want to host locally.
- And yet more distributions.
More confusing than choosing a linux distro.