Skip to content

Latest commit

 

History

History
40 lines (32 loc) · 1.12 KB

TODO.md

File metadata and controls

40 lines (32 loc) · 1.12 KB

TODO

Services

  • Add plays to start/stop common services

Security

  • Use Hadoop Credential Provider for S3A credentials
  • Figure out SElinux policy for head node so we can leave it enabled
  • Create users instead of using root user
  • Verify checksums on tarballs

EMR Clone

  • Support configurable number of worker/core nodes
  • Support for scaling cluster after initial instantiation
  • Support for multiple ec2 clusters (filter ec2.py)
  • Install and configure Pig
  • Install and configure HUE

Hadoop

  • Update Hadoop to 2.8.2

Spark

  • Configure Spark History Server

Visibility

Performance

Support Kafka Streaming

  • Install and configure Zookeeper
  • Install and configure Kafka
  • Install and configure Secor