When you enroll in this course, you'll also be enrolled in this Professional Certificate.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate from DeepLearning.AI
There are 4 modules in this course
In this course, you will explore various types of source systems, learn how they generate and update data, and troubleshoot common issues you might encounter when trying to connect to these systems in the real world. You’ll dive into the details of common ingestion patterns and implement batch and streaming pipelines. You’ll automate and orchestrate your data pipelines using infrastructure as code and pipelines as code tools. You’ll also explore AWS and open source tools for monitoring your data systems and data quality.
In lesson 1, you will explore source systems data engineers commonly interact with. Then in lesson 2, you will learn how to connect to various source systems and troubleshoot common connectivity issues.
Lab Walkthrough - Troubleshooting Database Connectivity on AWS•13 minutes
Week 1 Summary•1 minute
9 readings•Total 45 minutes
Program Syllabus•5 minutes
[IMPORTANT] Guidelines before you start the labs in this course•10 minutes
[Optional] FAQ VS Code Lab Environment•5 minutes
Join the DeepLearning.AI Forum to ask questions, get support, or share amazing ideas!•2 minutes
[Optional] Connecting to an Amazon RDS MySQL Database•10 minutes
Basics of AWS IAM•10 minutes
[Optional] AWS Networking Overview- VPC•1 minute
Week 1 Resources•1 minute
Lecture Notes W1•1 minute
1 assignment•Total 30 minutes
Week 1 Quiz•30 minutes
1 programming assignment•Total 120 minutes
Assignment 1: Troubleshooting Database Connectivity on AWS•120 minutes
3 ungraded labs•Total 360 minutes
Practice Lab 1: Interacting With a Relational Database Using SQL•120 minutes
Practice Lab 2: Interacting With Amazon DynamoDB NoSQL Database•120 minutes
Practice Lab 3: Interacting With Amazon S3 Object Storage•120 minutes
Data Ingestion
Week 2•6 hours to complete
Module details
This week you will dive deep into the batch and streaming ingestion patterns. You will identify use cases and considerations for each, and then build a batch and a streaming ingestion pipeline. When looking at batch ingestion, you will compare and contrast the ETL and ELT paradigms. You will also explore various AWS services for batch and streaming ingestion.
Lab Walkthrough - Batch Processing to Get Data From an API•10 minutes
Conversation with a Software Engineer•5 minutes
Streaming Ingestion Details•6 minutes
Kinesis Data Streams Details•6 minutes
Lab Walkthrough - Streaming Ingestion•6 minutes
Week 2 Summary•2 minutes
6 readings•Total 32 minutes
Batch and Streaming Tools•10 minutes
Summary of the Differences: ETL vs. ELT•5 minutes
What is Change Data Capture (CDC)?•10 minutes
Summary: General Considerations for Choosing Ingestion Tools•5 minutes
Week 2 Resources•1 minute
Lecture Notes W2•1 minute
1 assignment•Total 30 minutes
Week 2 Quiz•30 minutes
1 programming assignment•Total 120 minutes
Assignment 2: Batch Data Processing from an API•120 minutes
1 ungraded lab•Total 120 minutes
Practice Lab: Streaming Ingestion•120 minutes
DataOps
Week 3•9 hours to complete
Module details
In the first lesson, you will explore DataOps automation practices, including applying CI/CD to both data and code, and using infrastructure as code tools like Terraform to automate the provisioning and management of your resources. Then in lesson 2, you will explore DataOps observability and monitoring practices, including using tools like Great Expectation to monitor data quality, and using Amazon CloudWatch to monitor your infrastructure.
Assignment 3: Testing Data Quality with Great Expectations•120 minutes
2 ungraded labs•Total 240 minutes
Practice Lab 1: Implementing DataOps with Terraform•120 minutes
Practice Lab 2: Implementing Monitoring with Amazon CloudWatch•120 minutes
Orchestration, Monitoring, and Automating Your Data Pipelines
Week 4•8 hours to complete
Module details
This week, you will learn all about orchestrating your data pipeline tasks. You'll identify the various orchestration tools, but will focus on Airflow -- one of the most popular and widely used tools in the field today. You'll explore the core components of Airflow, the Airflow UI, and how to create and manage DAGs using various Airflow features.
DeepLearning.AI is an education technology company that develops a global community of AI talent.
DeepLearning.AI's expert-led educational experiences provide AI practitioners and non-technical professionals with the necessary tools to go all the way from foundational basics to advanced application, empowering them to build an AI-powered future.
Since 2006, Amazon Web Services has been the world’s most comprehensive and broadly adopted cloud platform. AWS offers over 90 fully featured services for compute, storage, networking, database, analytics, application services, deployment, management, developer, mobile, Internet of Things (IoT), Artificial Intelligence, security, hybrid and enterprise applications, from 44 Availability Zones across 16 geographic regions. AWS services are trusted by millions of active customers around the world — including the fastest-growing startups, largest enterprises, and leading government agencies — to power their infrastructure, make them more agile, and lower costs.
Coursera and AWS have been partners since 2017 providing learners and enterprises globally, the skills they need to succeed. Coursera builds on AWS servers to scale with student demand with confidence around capacity and elasticity and in partnership with AWS. In 2019, Coursera achieved Advanced Tier Partner status and further extended the partnership with AWS Educate, AWS EdStart and AWS Academy collaborations.
Coursera's been able to make cloud skills more accessible with 8 AWS courses on the Coursera platform featuring top subject matter experts and the portfolio continues to grow.
To learn more about AWS, visit https://aws.amazon.com.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.