Building Data Pipelines with Apache Airflow

with Shashank Kalanithi
4.7/5
(3)

Design, Build, and Optimize Scalable Cloud Data Pipelines with AWS, Azure, and GCP

2 hours of content 68 students

$99.00

Lifetime access

Buy now
14-Day Money-Back Guarantee

What you get:

  • 2 hours of content
  • 20 Interactive exercises
  • World-class instructor
  • Closed captions
  • Q&A support
  • Future course updates
  • Course exam
  • Certificate of achievement

Building Data Pipelines with Apache Airflow

$99.00

Lifetime access

Buy now
14-Day Money-Back Guarantee

What you get:

  • 2 hours of content
  • 20 Interactive exercises
  • World-class instructor
  • Closed captions
  • Q&A support
  • Future course updates
  • Course exam
  • Certificate of achievement

$99.00

Lifetime access

Buy now

$99.00

Lifetime access

Buy now
14-Day Money-Back Guarantee

What you get:

  • 2 hours of content
  • 20 Interactive exercises
  • World-class instructor
  • Closed captions
  • Q&A support
  • Future course updates
  • Course exam
  • Certificate of achievement

What You Learn

  • Design and implement scalable cloud-based data pipelines using tools like Apache Airflow, AWS Glue, Azure Data Factory, and Google Cloud Composer
  • Master ETL/ELT processes and integrate data orchestration seamlessly across AWS, Azure, and GCP
  • Optimize cloud resources and manage costs with advanced strategies for logging, auto-scaling, and reserved capacity
  • Ensure data pipeline security with best practices like secrets management, role-based access controls, and data encryption
  • Troubleshoot pipeline failures, manage outages, and build resilient systems with effective monitoring and logging tools
  • Develop hands-on expertise in data storage and warehousing solutions, including AWS S3, Azure Blob Storage, and Google BigQuery

Top Choice of Leading Companies Worldwide

Industry leaders and professionals globally rely on this top-rated course to enhance their skills.

Course Description

Are you ready to master the art of building robust and efficient data pipelines in the cloud?

Are you interested in advancing your career in data engineering or improving your cloud expertise?

If so, our Building and Managing Data Pipelines in the Cloud course is perfect for you.

Learn the art of cloud-based data engineering from course instructor Shashank Kalanithi, a seasoned professional with extensive experience in the data and tech industry. Shashank has held various roles including data analyst, data scientist, data engineer, and is currently a software engineer at Meta. His passion for teaching, coupled with his hands-on expertise, ensures that you’ll receive not just knowledge but actionable insights that you can apply directly to your work.

Why is this course perfect for aspiring data engineers?

  • Gain an understanding of cloud-based data pipelines and how they differ from traditional on-prem systems.
  • Explore the three major cloud providers—AWS, Azure, and GCP—and the tools they offer for data engineering.
  • Build a strong foundation in data orchestration, transformation, storage, and monitoring using cloud-native tools.
  • Learn how to ensure cost efficiency, optimize resources, and maintain security for your cloud pipelines.

Why is this the perfect course for current data professionals?

  • Enhance your expertise in managing complex cloud pipelines and reducing costs through best practices.
  • Deepen your understanding of tools like AWS Glue, Azure Data Factory, Google Cloud Dataflow, and more.
  • Learn how to handle challenges like pipeline failures, outages, and scaling resources effectively.
  • Gain insights from Shashank’s real-world experience to tackle common pitfalls in cloud-based data engineering

Building and Managing Data Pipelines in the Cloud starts with an introduction to cloud computing fundamentals and the benefits of cloud data engineering. You will learn about key orchestration tools like Apache Airflow and cloud-native solutions such as AWS MWAA, Azure Data Factory, and Google Cloud Composer. The course covers everything from designing ETL/ELT pipelines to leveraging data storage solutions like AWS S3, Azure Blob Storage, and Google BigQuery.

You’ll also dive into advanced topics like:

  • Pipeline reliability and failure management
  • Cloud security practices like secrets management and role-based access controls
  • Cost management strategies to ensure pipelines are efficient and scalable
  • Tools for monitoring and logging such as AWS CloudWatch, Azure Monitor, and Datadog

Finally, you’ll explore real-world scenarios and case studies to understand how to create scalable, secure, and cost-effective pipelines that meet business needs.

This course is designed to prepare you for the challenges of cloud data engineering and to give you a comprehensive toolkit for success.

Get ready to revolutionize the way you think about data pipelines. Start your journey today!

Learn for Free

Introduction to data pipelines

1.1 Introduction to data pipelines

5 min

Data pipeline architecture

1.2 Data pipeline architecture

7 min

ETL vs. ELT

1.4 ETL vs. ELT

3 min

Designing a data pipeline

1.5 Designing a data pipeline

3 min

Introduction to Apache Airflow

2.1 Introduction to Apache Airflow

4 min

Curriculum

  • 1. Understanding data pipelines
    4 Lessons 18 Min

    Introduction to data pipelines
    5 min
    Data pipeline architecture
    7 min
    ETL vs. ELT
    3 min
    Designing a data pipeline
    3 min
  • 2. Hands-on with Apache Airflow
    10 Lessons 44 Min

    Introduction to Apache Airflow
    4 min
    Installation of Apache Airflow
    4 min
    Airflow UI
    7 min
    DAGs and tasks
    5 min
    Airflow architecture
    5 min
    Airflow operators
    4 min
    Airflow hooks
    2 min
    Introduction to the BashOperator
    7 min
    Introduction to the PythonOperator
    3 min
    Building an end-to-end pipeline
    3 min
  • 3. Advanced data pipeline concepts
    5 Lessons 34 Min

    Advanced data pipeline concepts
    7 min
    Pipeline failure
    4 min
    Ensuring data pipeline reliability
    11 min
    Backfilling pipelines
    7 min
    Change data capture
    5 min
  • 4. Building pipelines in the cloud
    4 Lessons 26 Min

    Building pipelines in the cloud
    8 min
    Security in the cloud
    6 min
    Cost management in the cloud
    8 min
    Managing outages in the cloud
    4 min

Topics

Data Engineering

Tools & Technologies

apache airflow

Course Requirements

  • Intro to Data Engineering
  • Intro to Python

Who Should Take This Course?

Level of difficulty: Beginner

  • Aspiring Data Engineers
  • Current Data Engineers
  • Data Analysts and Scientists
  • Cloud Enthusiasts
  • Tech-Savvy Business Professionals
  • tudents and Graduates

Exams and Certification

A 365 Data Science Course Certificate is an excellent addition to your LinkedIn profile—demonstrating your expertise and willingness to go the extra mile to accomplish your goals.

Exams and certification

Meet Your Instructor

Shashank Kalanithi

Shashank Kalanithi

Data Engineer at

3 Courses

636 Reviews

7618 Students

Shashank Kalanithi is data engineer at Meta. His previous experience includes being a senior data analyst at the fashion retailer Nordstrom, where he worked on ML solutions to help augment the data team’s capabilities. He designed tools and dashboards that optimize the workflow and gather valuable data on the company’s numerous locations. Shashank also runs his own data analyst service where he helps companies organize, study, and extract insights to increase in-house efficiency and profitability. His YouTube channel, which he started in 2020, has accumulated over 149K subscribers.

What Our Learners Say

365 Data Science Is Featured at

Our top-rated courses are trusted by business worldwide.