tree: a53c0436104589473892da8ed955e9b23a9a955a [path history] [tgz]
  1. aws_ec2_table.csv
  2. README.md
  3. run_systemds_script.sh
  4. spinup_systemds_cluster.sh
  5. systemds_cluster.config
  6. terminate_systemds_cluster.sh
scripts/aws/README.md

Instructions:

  1. Create aws account / use your existing aws account

  2. Install aws-cli on your system

(https://docs.aws.amazon.com/cli/latest/userguide/install-cliv2-linux.html)

  1. Create a user

    • Create a new user (https://console.aws.amazon.com/iam/home?#/users)

    • Create new group and add the following policies to it:

      • AmazonElasticMapReduceRole

      • AmazonElasticMapReduceforEC2Role

      • AdministratorAccess

      • AmazonElasticMapReduceFullAccess

      • AWSKeyManagementServicePowerUser

      • IAMUserSSHKeys

  2. Configure your aws-cli (https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html#cli-quick-configuration)

  3. Spin up an EMR cluster with SystemDS

    • Put your SystemDS artifacts (dml-scripts, jars, config-file) in the directory systemds

    • Edit configuration in: systemds_cluster.config

    • Run: ./spinup_systemds_cluster.sh

  4. Run a SystemDS script

    • Run: ./run_systemds_script.sh path/to/script.dml With args: ./run_systemds_script.sh path/to/script.dml “1.0, 2.6”
  5. Terminate the EMR cluster: ./terminate_systemds_cluster.sh

Further work