Darum lohnt sich der Kurs
In diesem Kurs geht es um das Design und die Implementierung von Batch Data Analytics-Lösungen mit Amazon EMR – von Data Warehouses und Datalakes bis zur Speicherung und Optimierung umfangreicher Datenmengen. Sie lernen moderne Techniken zur Datenaufnahme, Transformation, Speicherung, Kompression, Performanceoptimierung und Skalierung kennen und wenden diese in praktischen Übungsaufgaben an.Seminarinhalt
This course is part of the Building Modern Data Analytics Solutions on AWS collection of four, one-day, intermediate-level classroom training courses.
Programm
- Compare the features and benefits of data warehouses, data lakes, and modern data architectures
- Design and implement a batch data analytics solution
- Identify and apply appropriate techniques, including compression, to optimize data storage
- Select and deploy appropriate options to ingest, transform, and store data
- And much more
Zielgruppen
- Data platform engineers
- Architects and operators who build and manage data analytics pipelines
Vorkenntnisse
- Students with a minimum one-year experience managing open-source data frameworks such as Apache Spark or Apache Hadoop will benefit from this course
- We suggest the AWS Hadoop Fundamentals course for those that need a refresher on Apache Hadoop
- We recommend that attendees of this course have:
- Completed either AWS Technical Essentials or Architecting on AWS
- Completed either Building Data Lakes on AWS or Getting Started with AWS Glue
