Discover the optimal Amazon EMR cluster design combining On-Demand and Spot Instances with Compute Savings Plans to minimize costs for daily critical batch processing tasks.
Table of Contents
Question
A solutions architect needs to review the design of an Amazon EMR cluster that is using the EMR File System (EMRFS). The cluster performs tasks that are critical to business needs. The cluster is running Amazon EC2 On-Demand Instances at all times for all task, primary, and core nodes. The EMR tasks run each morning, starting at 1:00 AM. and take 6 hours to finish running. The amount of time to complete the processing is not a priority because the data is not referenced until late in the day.
The solutions architect must review the architecture and suggest a solution to minimize the compute costs.
Which solution should the solutions architect recommend to meet these requirements?
A. Launch all task, primary, and core nodes on Spot Instances in an instance fleet. Terminate the cluster, including all instances, when the processing is completed.
B. Launch the primary and core nodes on On-Demand Instances. Launch the task nodes on Spot Instances in an instance fleet. Terminate the cluster, including all instances, when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
C. Continue to launch all nodes on On-Demand Instances. Terminate the cluster, including all instances, when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
D. Launch the primary and core nodes on On-Demand Instances. Launch the task nodes on Spot Instances in an instance fleet. Terminate only the task node instances when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
Answer
B. Launch the primary and core nodes on On-Demand Instances. Launch the task nodes on Spot Instances in an instance fleet. Terminate the cluster, including all instances, when the processing is completed. Purchase Compute Savings Plans to cover the On-Demand Instance usage.
Explanation
This option suggests launching the primary and core nodes on On-Demand Instances, which are necessary for the EMR cluster to function properly at all times. Task nodes, which are not essential for the cluster’s operation, can be launched on Spot Instances in an instance fleet, which are significantly cheaper than On-Demand Instances. This approach allows the cluster to leverage the cost benefits of Spot Instances for task nodes, while ensuring the reliability of the cluster with On-Demand Instances for primary and core nodes.
After the processing is completed, the entire cluster, including all instances, is terminated, which prevents any unnecessary costs from instances running idle. Additionally, purchasing Compute Savings Plans to cover the On-Demand Instance usage can provide significant cost savings, as Savings Plans offer a lower price than On-Demand pricing in exchange for a commitment to use a specific amount of compute power over a one or three year period.
Amazon AWS Certified Solutions Architect – Professional SAP-C02 certification exam practice question and answer (Q&A) dump with detail explanation and reference available free, helpful to pass the Amazon AWS Certified Solutions Architect – Professional SAP-C02 exam and earn Amazon AWS Certified Solutions Architect – Professional SAP-C02 certification.