Conclusions and cleanup

Congratulations! you have reached the end of the workshop. In this workshop, you learned about the need to be flexible with EC2 instance types when using Spot Instances, and how to size your Spark executors to allow for this flexibility. You ran a Spark application solely on Spot Instances using EMR Instance Fleets, you verified the results of the application, and saw the cost savings that you achieved by running the application on Spot Instances.


Select the correct tab, depending on where you are running the workshop:

  1. In the EMR Management Console, check that the cluster is in the Terminated state. If it isn’t, then you can terminate it from the console.
  2. Go to the Cloud9 Dashboard and delete your environment.
  3. Delete the VPC you deployed via CloudFormation, by going to the CloudFormation service in the AWS Management Console, selecting the VPC stack (default name is Quick-Start-VPC) and click the Delete option. Make sure that the deletion has completed successfully (this should take around 1 minute), the status of the stack will be DELETE_COMPLETE (the stack will move to the Deleted list of stacks).
  4. Delete your S3 bucket from the AWS Management Console - choose the bucket from the list of buckets and hit the Delete button. This approach will also empty the bucket and delete all existing objects in the bucket.
  5. Delete the Athena table by going to the Athena service in the AWS Management Console, find the emrworkshopresults Athena table, click the three dots icon next to the table and select Delete table.

If you are running the workshop in an AWS event, then you do not need to take any cleanup steps. The account will be deleted automatically.

Thank you

We hope you found this workshop educational, and that it will help you adopt Spot Instances into your Spark applications running on Amazon EMR, in order to optimize your costs.
If you have any feedback or questions, click the “Feedback / Questions?” link in the left pane to reach out to the authors of the workshop.

Other Resources:

Visit the Amazon EMR on EC2 Spot Instances page for more information, customer case studies and videos.
Read the blog post: Best practices for running Apache Spark applications using Amazon EC2 Spot Instances with Amazon EMR
Watch the AWS Online Tech-Talk: Best Practices for Running Spark Applications Using Spot Instances on EMR - AWS Online Tech Talks