📖 What is Amazon Glacier?
Amazon Glacier is a low-cost archive storage service designed for infrequently accessed data. It provides durable and secure storage with retrieval times ranging from minutes to hours. Glacier offers different retrieval options – Expedited, Standard, and Bulk – each with varying costs and access speeds.
"Glacier’s cost structure and retrieval times are frequently tested. Understand the trade-offs between retrieval speed and cost. Be aware of Glacier Deep Archive, which offers even lower storage costs but longer retrieval times. Distinguish Glacier from S3 Standard and S3 Glacier Instant Retrieval based on access patterns and cost."
📚 Certification: AWS Certified Solutions Architect - Associate (SAA-C03)
🔑 What are the Key Concepts of Amazon Glacier?
- ▸ Glacier is optimized for long-term data archiving, offering the lowest storage costs but with significant retrieval latency.
- ▸ Retrieval costs can exceed storage costs, especially with Expedited retrieval; carefully consider access patterns before archiving.
- ▸ Glacier Deep Archive provides even lower storage costs than Glacier, but with retrieval times measured in hours, not minutes.
- ▸ Lifecycle policies in S3 can automatically transition objects to Glacier or Deep Archive based on age or other criteria.
- ▸ Glacier uses a vault-based structure for organization and access control, requiring specific permissions for operations.
🎯 How does Amazon Glacier appear on the SAA-C03 Exam?
You may be asked to identify the most cost-effective storage class for data that needs to be retained for seven years for compliance purposes and is rarely accessed.
A scenario might describe a company needing to archive old log files; determine the best storage option considering infrequent access and potential retrieval needs.
Expect questions about configuring S3 lifecycle policies to automatically move data to Glacier after a specified period of inactivity to reduce storage costs.
❓ Frequently Asked Questions
When would I choose Glacier Deep Archive over standard Glacier?
Choose Deep Archive when data is rarely, if ever, accessed and you can tolerate retrieval times of 12-48 hours. It offers the lowest storage cost but highest retrieval cost and latency.
How do lifecycle policies interact with Glacier storage classes?
Lifecycle policies automate the transition of S3 objects to Glacier or Deep Archive based on defined rules. This helps optimize storage costs by moving infrequently accessed data to cheaper tiers.
What impact do retrieval options have on overall cost?
Expedited retrieval is the fastest but most expensive. Standard retrieval offers a balance, while Bulk retrieval is the cheapest but slowest. Choosing the right option depends on your access needs and budget.