Tags
Language
Tags
December 2024
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31 1 2 3 4

Build Data Engineering Pipelines With Azure Data Factory

Posted By: ELK1nG
Build Data Engineering Pipelines With Azure Data Factory

Build Data Engineering Pipelines With Azure Data Factory
Published 2/2023
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz
Language: English | Size: 3.31 GB | Duration: 7h 39m

Get hands on and build a modern data lake with Azure Data Factory, Data Lake Storage, Azure SQL and more!

What you'll learn

Overview and Background of Azure Big Data Analytics Solutions

Azure Data Factory

Azure Storage Accounts

Azure SQL Databases and Upsert Operations

Integrating Data From Various File Formats

Authentication and Access: Service Principals, Managed Identities and Azure Key Vault

Data Pipelines, Copy Activity, Data Flows, Control Flow and Transformation Activities

Implementing a Modern Data Lake

Control Flow, Parameters and Variables

Real World Application of Azure Data Factory

Requirements

Microsoft Azure Subscription

Basic SQL

Description

Hello students!I am pleased to present this course on one of the most in demand data engineering tools around… Azure Data Factory!As the demand for cloud-based data integration services continues to skyrocket, there is a huge demand for professionals with knowledge of services like Azure Data Factory. By learning Azure Data Factory, users can enhance their skills and increase their job prospects in the field of data engineering and analytics.In this course you will primarily be using Azure Data Factory on Microsoft Azure in addition to other services such as Azure Blob Storage, Azure Data Lake Storage Gen 2 and Azure SQL Database.The course is packed with lectures, code-along videos and a dedicated course project. As an added benefit you will also have lifetime access to all of the lectures…This course will cover the following topics:Azure Storage Solutions such as Azure Blob Storage and Azure Data Lake Gen2 StorageThe basics of Azure Data Factory including the core components such as Linked Services, Datasets, Activities, Data Flows, Pipelines and Integration RuntimesIntegrating data from various file formats such as CSV, JSON and ParquetThe Copy Activity in Azure Data FactoryData Flows and Control Flow and Transformation Activities in Azure Data FactoryOrchestrating Data Integration WorkflowsHow to create Schedules and Triggers to execute your pipelinesHow to use Parameters and Variables with your Linked Services, Datasets and PipelinesHow to use Azure Data Factory with SQL DatabasesAuthentication and Access including Managed Identities, Service Principals and Azure Key Vault

Overview

Section 1: Course Overview and Set Up

Lecture 1 Course Overview

Lecture 2 Azure Big Data Analytics

Lecture 3 Azure Account Set Up

Lecture 4 Azure UI Overview and Creating Your Subscription

Lecture 5 Services and Regions

Lecture 6 Naming Conventions and Creating the Course Resource Group

Lecture 7 Cost Management and Pricing

Lecture 8 Links and Resources

Section 2: Azure Storage

Lecture 9 Azure Storage Solutions Overview

Lecture 10 Classification of Data

Lecture 11 ADF Supported File Formats

Lecture 12 Customer Orders Dataset Download and Overview

Lecture 13 Blueprint for the Data Lake

Lecture 14 Storage Account Creation

Lecture 15 Container Creation and File Upload

Lecture 16 Links and Resources

Section 3: ADF - Overview and Set Up

Lecture 17 Azure Data Factory Overview

Lecture 18 ADF Resource Creation

Lecture 19 ADF User Interface and Components Overview

Lecture 20 Links and Resources

Section 4: Data Movement

Lecture 21 Copy Activity Overview

Lecture 22 Section Objective

Lecture 23 Copy Activity - Order Items Part 1 (Linked Services and Source/Sink Datasets)

Lecture 24 Copy Activity - Order Items Part 2 (Copy Activity and Pipeline)

Lecture 25 Copy Activity - Customers JSON

Lecture 26 Copy Activity - Orders Parquet

Lecture 27 Copy Activity - Stores and Products JSON

Lecture 28 Organising Data Factory Objects in Folders

Lecture 29 Chaining Activities

Lecture 30 Working with Multiple Files as a Source and Copy Behaviour

Lecture 31 Links and Resources

Section 5: Data Transformation

Lecture 32 Data Flow Overview

Lecture 33 Section Objective

Lecture 34 Data Flow Demo and User Interface

Lecture 35 Data Flow Debug (Pricing)

Lecture 36 Data Flow Debug (Row Limit)

Lecture 37 Source and Sink Partitioning

Lecture 38 Select Transformation

Lecture 39 Cast Transformation

Lecture 40 Import Projection

Lecture 41 Derived Column Transformation

Lecture 42 Raw to Cleansed Data Flow Requirement

Lecture 43 Raw to Cleansed Data Flow Walkthrough

Lecture 44 Partitioned Datasets

Lecture 45 Filter and Sort Transformations

Lecture 46 Aggregate Transformation

Lecture 47 Join Transformation

Lecture 48 Conditional Split and Union Transformations

Lecture 49 Cleansed to Structured Data Flow Requirement

Lecture 50 Cleansed to Structured Data Flow Walkthrough

Lecture 51 Structured to Analytics Data Flow Requirement

Lecture 52 Structured to Analytics Data Flow Walkthrough

Lecture 53 Links and Resources

Section 6: Scheduling and Chaining Pipelines for Execution

Lecture 54 Execute Pipeline Activity

Lecture 55 Schedule Triggers

Lecture 56 Storage Events Trigger

Lecture 57 Tumbling Window Triggers

Lecture 58 Links and Resources

Section 7: Control Flow, Parameters and Variables

Lecture 59 Data Lake Preparation

Lecture 60 Delete Activity

Lecture 61 Get Metadata Activity

Lecture 62 Lookup Activity

Lecture 63 If Condition and Switch Activities

Lecture 64 Introduction to Parameters in ADF

Lecture 65 Introduction to Variables in ADF

Lecture 66 ForEach Activity

Lecture 67 Metadata Based Approach (Example)

Lecture 68 Course Project Activity: Dynamic Datasets and Folder Structure

Lecture 69 Links and Resources

Section 8: Azure SQL Database for Structured Data

Lecture 70 Section Overview

Lecture 71 Creating our Azure SQL Resource

Lecture 72 Creating our Schema and Tables

Lecture 73 Adding a Unique Identifier Column to Structured Orders Data

Lecture 74 Creating Linked Service and Datasets for SQL

Lecture 75 Important Note Regarding the Dynamic Datasets

Lecture 76 Copy Activity with SQL Sink

Lecture 77 Data Flow with SQL Sink

Lecture 78 Course Project Activity: Storing our Structured Data in the SQL Database

Lecture 79 Links and Resources

Section 9: Tumbling Window Triggers

Lecture 80 Tumbling Window Triggers Overview

Lecture 81 Ingest Data for Specific Time Intervals

Lecture 82 Tumbling Window Trigger Dependencies

Lecture 83 Links and Resources

Section 10: Authentication and Access

Lecture 84 Overview

Lecture 85 Azure Key Vault

Lecture 86 System Assigned Managed Identities

Lecture 87 User Assigned Managed Identities

Lecture 88 Service Principals

Lecture 89 Fine Grained Access Control for Storage Accounts

Lecture 90 Links and Resources

Anyone interested in Azure or Cloud Data Engineering,Anyone interested in working with Big Data,Anyone working with cloud data