8781  Reviews star_rate star_rate star_rate star_rate star_half

Data Analytics using Azure Databricks

This Azure Databricks training teaches learners proven, real-world techniques to leverage the power of cloud data engineering and analytics on the Microsoft Azure platform. Learners explore...

Read More
$1,495 USD
Duration 2 days
Course Code WA3714
Available Formats Classroom

Overview

Course Description

This Azure Databricks training teaches learners proven, real-world techniques to leverage the power of cloud data engineering and analytics on the Microsoft Azure platform. Learners explore fundamental Big Data principles, the practical applications of Apache Spark, and hands-on utilization of Azure Databricks for scalable data engineering and analysis. This comprehensive, hands-on course focuses on practical skills, giving learners the knowledge and confidence to integrate data lake storage, master Delta Lake fundamentals, manage databases, and apply advanced techniques for data analysis, pipeline automation, and performance optimization.

Skills Gained

  • Master the foundational concepts of Big Data, Data Warehousing, and ETL/ELT processes.
  • Explain the role and architecture of Apache Spark and the overall structure and components of the Azure Databricks environment.
  • Utilize Azure Databricks Workspaces and Notebooks to manage compute clusters and execute queries using Databricks SQL and Magic Commands.
  • Implement the Data Lakehouse architecture and apply Unity Catalog for robust data governance, access, and discoverability.
  • Work with Delta Lake, demonstrating proficiency in creating and manipulating data objects (tables, views, UDFs), performing DML operations, and leveraging versioning and Time Travel.
  • Conduct Exploratory Data Analysis (EDA), create and share AI/BI Dashboards, and automate data workloads using Databricks Jobs and Pipelines.

Who Can Benefit

This course is designed for data engineers, analysts, and professionals seeking to enhance their skills in cloud data engineering with Azure Databricks, spanning from beginners to intermediate-level learners.

Prerequisites

A basic understanding of SQL and Python is helpful.

Software Requirements

  • A computer with an internet connection is required
  • A remote lab VM with an Azure account will be provided to the participants

Course Details

Course Details

Big Data and Data Warehousing

  • What is Big Data?
  • Traditional Data Warehouse vs Cloud Data Warehouse
  • Data Storage Types and Formats
  • Data Roles and Tools
  • ETL and ELT Processes

Apache Spark and Azure Databricks

  • What is Spark?
  • Spark Architecture and Workflow
  • Azure Databricks Overview
  • Azure Databricks Architecture and Data Sources

Azure Databricks Environment

  • The Azure Databricks Main Page
  • Compute Options and Clusters
  • Workspaces and Notebooks
  • Databricks SQL and Magic Commands

Data Lakehouses and Unity Catalog in Azure Databricks

  • Understanding Data Lakehouse
  • Unity Catalog and Data Governance
  • Data Access and Discoverability

Azure Databricks Databases and Tables

  • Data Object Hierarchy and Tables
  • Table Partitioning
  • Creating Tables with PySpark

Delta Lake in Azure Databricks

  • Delta Lake and Parquet
  • Delta Transaction Log and ACID
  • Versioning and Time Travel

Azure Databricks Objects

  • Views and Materialized Views
  • User Defined Functions (UDFs)
  • Built-in Functions and Security

DML Operations in Azure Databricks

  • Performing INSERT, UPDATE, DELETE
  • Joining Tables with SQL and PySpark
  • SQL Differences in T-SQL and Databricks

EDA and AI/BI Dashboards in Azure Databricks

  • Data Visualization Options
  • Exploratory Data Analysis (EDA)
  • Creating and Sharing Dashboards

Automation with Jobs & Pipelines in Azure Databricks

  • Pipeline and Job Automation
  • Creating and Scheduling Jobs
  • Sample Code and Business Cases

Azure Databricks Performance Monitoring and Optimization

  • Monitoring with Spark UI
  • Performance Optimization Techniques

Azure Databricks Assistant

  • Understanding the Assistant
  • Using the Assistant in Notebooks and Dashboards

Schedule

FAQ

How do I get a Microsoft exam voucher?

Pearson Vue Exam vouchers can be requested and ordered with your course purchase or can be ordered separately by clicking here.

  • Vouchers are non-refundable and non-returnable. Vouchers expire 12 months from the date they are issued unless otherwise specified in the terms and conditions.
  • Voucher expiration dates cannot be extended. The exam must be taken by the expiration date printed on the voucher.

Do Microsoft courses come with post lab access?

Most Microsoft official courses will include post-lab access ranging from 30 to 180 calendar days after instructor led course delivery. A lab training key in class will be provided that can be leveraged to continue connecting to a remote lab environment for the individual course attendee.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour-long after 3-3.5 hours of the class day.

What languages are used to deliver training?

Microsoft courses are conducted in English unless otherwise specified.

Reviews

my experince was great from the day i regetered to the actuall day of the class.

Instructor was great, course was mostly very good except for too much focus on pricing

I found this course informative. It was easy to follow and provided some good information.

This was a good program to get prepared for the solutions architect associate exam.

Fantastic and great training. Tons of hands-on labs to really make you understand the material being thought.