cloudera-training-courses
8411  Reviews star_rate star_rate star_rate star_rate star_half

Analyzing with Cloudera Data Warehouse

This four-day Analyzing with Data Warehouse course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals...

Read More
$3,520 USD
Duration 4 days
Course Code DANA-262
Available Formats Classroom

Overview

This four-day Analyzing with Data Warehouse course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages.

Skills Gained

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the ecosystem, learning how to:

  • Use Apache Hive and Apache Impala to access data through queries
  • Identify distinctions between Hive and Impala, such as differences in syntax, data formats, and supported features
  • Write and execute queries that use functions, aggregate functions, and subqueries
  • Use joins and unions to combine datasets
  • Create, modify, and delete tables, views, and databases
  • Load data into tables and store query results
  • Select file formats and develop partitioning schemes for better performance
  • Use analytic and windowing functions to gain insight into their data
  • Store and query complex or nested data structures
  • Process and analyze semi-structured and unstructured data
  • Optimize and extend the capabilities of Hive and Impala
  • Determine whether Hive, Impala, an RDBMS, or a mix of these is the best choice for a given task
  • Utilize the benefits of CDP Public Cloud Data Warehouse

Who Can Benefit

This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators.

Prerequisites

Some knowledge of SQL is assumed, as is basic Linux command-line familiarity.

Course Details

Foundations for Big Data Analytics

  • Big Data Analytics Overview
  • Data Storage: HDFS
  • Distributed Data Processing: YARN, MapReduce, and Spark
  • Data Processing and Analysis: Hive and Impala
  • Database Integration: Sqoop
  • Other Data Tools
  • Exercise Scenario Explanation

Introduction to Apache Hive and Impala

  • What Is Hive?
  • What Is Impala?
  • Why Use Hive and Impala?
  • Schema and Data Storage
  • Comparing Hive and Impala to Traditional Databases
  • Use Cases

Querying with Apache Hive and Impala

  • Databases and Tables
  • Basic Hive and Impala Query Language Syntax
  • Data Types
  • Using Hue to Execute Queries
  • Using Beeline (Hive's Shell)
  • Using the Impala Shell

Common Operators and Built-In Functions

  • Operators
  • Scalar Functions
  • Aggregate Functions

Analyzing Text

  • Using Regular Expressions with Hive and Impala
  • Processing Text Data with SerDes in Hive
  • Sentiment Analysis and n-grams in Hive

Apache Hive Optimization

  • Understanding Query Performance
  • Cost-Based Optimization and Statistics
  • Bucketing
  • ORC File Optimizations

Apache Impala Optimization

  • How Impala Executes Queries
  • Improving Impala Performance

Extending Hive and Impala

  • User-Defined Functions
  • Parameterized Queries

Data Management

  • Data Storage
  • Creating Databases and Tables
  • Loading Data
  • Altering Databases and Tables
  • Simplifying Queries with Views
  • Storing Query Results

Data Storage and Performance

  • Partitioning Tables
  • Loading Data into Partitioned Tables
  • When to Use Partitioning
  • Choosing a File Format
  • Using Avro and Parquet File Formats

Working with Multiple Datasets

  • UNION and Joins
  • Handling NULL Values in Joins
  • Advanced Joins

Analytic Functions and Windowing

  • Using Analytic Functions
  • Other Analytic Functions
  • Sliding Windows

Complex Data

  • Complex Data with Hive
  • Complex Data with Impala

Choosing the Best Tool for the Job

  • Comparing Hive, Impala, and
  • Relational Databases
  • Which to Choose?

CDP Public Cloud Data Warehouse

  • Data Warehouse Overview
  • Auto-Scaling
  • Managing Virtual Warehouses
  • Querying Data Using CLI and Third-Party Integration

Appendix: Apache Kudu

  • What Is Kudu?
  • Kudu Tables
  • Using Impala with Kudu

Schedule

FAQ

Does the course schedule include a Lunchbreak?

Classes typically include a 1-hour lunch break around midday. However, the exact break times and duration can vary depending on the specific class. Your instructor will provide detailed information at the start of the course.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does Ascendient Learning deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Ascendient Learning instructors have an average of 27 years of practical IT experience and have also served as consultants for an average of 15 years. To stay current, instructors spend at least 25 percent of their time learning new, emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth.  We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact info@ascendientlearning.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How do I request a W9 for Ascendient Learning?

View our filing status and how to request a W9.

Reviews

The instructor really took his time and made sure I was able to understand the concepts.

The training was good but needed the basic skills of maximo before getting deep in the configuration of it.

Very interactive and in-depth course that really got me ready for the industry

Sean is the very good instructor. I would like to take his class again in the future.

I thought the course was informative and the tools to go over the material were very nice.