This code should help to jump start PySpark with Anaconda on AWS using Terraform.
This project is no longer actively maintained and has been archived on Jan 7, 2025.
- Install Terraform on Mac:
brew install terraform - Adjust the scripts (
bootstrap_actions.shandpyspark_quick_setup.sh) inscriptsif necessary - Set parameters in
terraform.tfvars - Start cluster:
terraform init
terraform apply
- Destroy cluster:
terraform destroy
- Configure AWS on your local machine:
aws configure - AWS instance cost for
eu-central-1
- Dat Tran, github: datitran
See LICENSE for details.