8621  Reviews star_rate star_rate star_rate star_rate star_half

Data Analytics using Azure Databricks

This Azure Databricks training teaches learners proven, real-world techniques to leverage the power of cloud data engineering and analytics on the Microsoft Azure platform. Learners explore...

Read More
$1,495 USD
Duration 2 days
Course Code WA3714
Available Formats Classroom

Overview

This Azure Databricks training teaches learners proven, real-world techniques to leverage the power of cloud data engineering and analytics on the Microsoft Azure platform. Learners explore fundamental Big Data principles, the practical applications of Apache Spark, and hands-on utilization of Azure Databricks for scalable data engineering and analysis. This comprehensive, hands-on course focuses on practical skills, giving learners the knowledge and confidence to integrate data lake storage, master Delta Lake fundamentals, manage databases, and apply advanced techniques for data analysis, pipeline automation, and performance optimization.

Skills Gained

  • Master the foundational concepts of Big Data, Data Warehousing, and ETL/ELT processes.
  • Explain the role and architecture of Apache Spark and the overall structure and components of the Azure Databricks environment.
  • Utilize Azure Databricks Workspaces and Notebooks to manage compute clusters and execute queries using Databricks SQL and Magic Commands.
  • Implement the Data Lakehouse architecture and apply Unity Catalog for robust data governance, access, and discoverability.
  • Work with Delta Lake, demonstrating proficiency in creating and manipulating data objects (tables, views, UDFs), performing DML operations, and leveraging versioning and Time Travel.
  • Conduct Exploratory Data Analysis (EDA), create and share AI/BI Dashboards, and automate data workloads using Databricks Jobs and Pipelines.

Who Can Benefit

This course is designed for data engineers, analysts, and professionals seeking to enhance their skills in cloud data engineering with Azure Databricks, spanning from beginners to intermediate-level learners.

Prerequisites

A basic understanding of SQL and Python is helpful.

Course Details

Software Requirements

  • A computer with an internet connection is required
  • A remote lab VM with an Azure account will be provided to the participants

Big Data and Data Warehousing

  • What is Big Data?
  • Traditional Data Warehouse vs Cloud Data Warehouse
  • Data Storage Types and Formats
  • Data Roles and Tools
  • ETL and ELT Processes

Apache Spark and Azure Databricks

  • What is Spark?
  • Spark Architecture and Workflow
  • Azure Databricks Overview
  • Azure Databricks Architecture and Data Sources

Azure Databricks Environment

  • The Azure Databricks Main Page
  • Compute Options and Clusters
  • Workspaces and Notebooks
  • Databricks SQL and Magic Commands

Data Lakehouses and Unity Catalog in Azure Databricks

  • Understanding Data Lakehouse
  • Unity Catalog and Data Governance
  • Data Access and Discoverability

Azure Databricks Databases and Tables

  • Data Object Hierarchy and Tables
  • Table Partitioning
  • Creating Tables with PySpark

Delta Lake in Azure Databricks

  • Delta Lake and Parquet
  • Delta Transaction Log and ACID
  • Versioning and Time Travel

Azure Databricks Objects

  • Views and Materialized Views
  • User Defined Functions (UDFs)
  • Built-in Functions and Security

DML Operations in Azure Databricks

  • Performing INSERT, UPDATE, DELETE
  • Joining Tables with SQL and PySpark
  • SQL Differences in T-SQL and Databricks

EDA and AI/BI Dashboards in Azure Databricks

  • Data Visualization Options
  • Exploratory Data Analysis (EDA)
  • Creating and Sharing Dashboards

Automation with Jobs & Pipelines in Azure Databricks

  • Pipeline and Job Automation
  • Creating and Scheduling Jobs
  • Sample Code and Business Cases

Azure Databricks Performance Monitoring and Optimization

  • Monitoring with Spark UI
  • Performance Optimization Techniques

Azure Databricks Assistant

  • Understanding the Assistant
  • Using the Assistant in Notebooks and Dashboards

Schedule

FAQ

How do I get a Microsoft exam voucher?

Pearson Vue Exam vouchers can be requested and ordered with your course purchase or can be ordered separately by clicking here.

  • Vouchers are non-refundable and non-returnable. Vouchers expire 12 months from the date they are issued unless otherwise specified in the terms and conditions.
  • Voucher expiration dates cannot be extended. The exam must be taken by the expiration date printed on the voucher.

Do Microsoft courses come with post lab access?

Most Microsoft official courses will include post-lab access ranging from 30 to 180 calendar days after instructor led course delivery. A lab training key in class will be provided that can be leveraged to continue connecting to a remote lab environment for the individual course attendee.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour-long after 3-3.5 hours of the class day.

What languages are used to deliver training?

Microsoft courses are conducted in English unless otherwise specified.

Reviews

The training was good but needed the basic skills of maximo before getting deep in the configuration of it.

Overall it was a good bootcamp. A lot to cover so it is understandable that the pace had to be a little fast.

it was good and very informative. Instructure covered everything in detail.

I was very pleased with the course setup by ExitCertified and the instructor.

Sean is the very good instructor. I would like to take his class again in the future.