sas-training-courses
8420  Reviews star_rate star_rate star_rate star_rate star_half

Introduction to SAS(R) and Hadoop

This course teaches you how to use SAS programming methods to read, write, and manipulate Hadoop data. Base SAS methods that are covered include reading and writing raw data with the DATA step and...

Read More
$1,600 USD GSA  $1,446.35
Course Code DIAHSM
Available Formats Classroom

Overview

This course teaches you how to use SAS programming methods to read, write, and manipulate Hadoop data. Base SAS methods that are covered include reading and writing raw data with the DATA step and managing the Hadoop file system and executing Pig code from SAS via the HADOOP procedure. In addition, the SAS/ACCESS Interface to Hadoop methods that allow LIBNAME access and SQL pass-through techniques to read and write Hadoop Hive table structures are discussed. Although not covered in detail, a brief overview of additional SAS and Hadoop technologies, including executing DS2 in Hadoop with the SAS Code Accelerator for Hadoop and using Hadoop data sources for the distributed in-memory analytics platform of SAS Viya, is included. This course is included in the Expert Exchange on Hadoop: Using SAS/ACCESS service offering to configure SAS/ACCESS Interface to Hadoop or SAS/ACCESS Interface to Impala to work with your Hadoop environment.

Skills Gained

  • Read and write Hadoop files with the FILENAME statement.
  • Execute and use Hadoop commands with the HADOOP procedure.
  • Invoke the execution of Pig programs in Hadoop within a SAS program.
  • Access Hadoop distributions using the LIBNAME statement and the SQL pass-through facility.
  • Create and use SQL procedure pass-through queries.
  • Use options and efficiency techniques for optimizing data access performance.
  • Join data using the SQL procedure and the DATA step.
  • Use Base SAS procedures with Hadoop.
  • Modify DS2 programs to execute in-database in Hadoop.
  • Use data in Hadoop as disk storage for SAS Viya in-memory tables.

Who Can Benefit

  • SAS programmers that need to access data in Hadoop from within SAS

Prerequisites

  • Before attending this course, you should be comfortable programming in SAS and Structured Query Language (SQL). You can gain the required SAS programming knowledge from the SAS® Programming 1: Essentials course. You can gain the required knowledge of SQL from the SAS® SQL 1: Essentials course. A working knowledge of Hadoop is helpful.

Course Details

Introduction

  • What is Hadoop?
  • How SAS interfaces with Hadoop.

Accessing HDFS and Invoking Hadoop Applications from SAS

  • Overview of methods available in Base SAS for interacting with Hadoop.
  • Reading and writing Hadoop files using Base SAS.
  • Methods.
  • Executing MapReduce code.
  • Executing Pig code using PROC HADOOP.

Using the SQL Pass-Through Facility

  • Understand the SQL procedure pass-through facility.
  • Connecting to a Hadoop Hive database.
  • Learning methods to query Hive tables.
  • Investigating Hadoop Hive metadata.
  • Creating SQL procedure pass-through queries.
  • Creating and loading Hive tables with SQL pass-through EXECUTE statements.
  • Handling Hive STRING data types.

Using the SAS/ACCESS LIBNAME Engine

  • Using the LIBNAME statement for Hadoop.
  • Using data set options.
  • Creating views.
  • Combining tables.
  • Benefits of the LIBNAME method.
  • Using the HDMD procedure to access delimited data, XML data, and other non-Hive formats.
  • Performance considerations for the SAS/ACCESS LIBNAME statement.
  • Copying data from a SAS library to a Hive library.

Partitioning and Clustering Hive Tables

  • Identifying partitioning, clustering, and indexing methods in Hive.
  • Understanding how partitioning and clustering can increase query performance.
  • Creating and loading partitioned and clustered Hive tables.

SAS Viya and Hadoop and SAS Code Accelerator for Hadoop

  • Loading and saving CAS tables to HDFS.
  • Executing DS2 threads in Hadoop to summarize data.
  • Using the HDMD procedure to access HDFS files.

Overview of SAS In-Memory Analytics and the Code Accelerator for Hadoop (Self-Study)

  • Using high-performance procedures and the SASHDAT library engine.
  • Creating a SAS LASR Analytic Server session.
  • Using the SASIOLA engine.
  • Executing DS2 threads in the Hadoop cluster to summarize data.
  • Using PROC HDMD to access HDFS files.

Schedule

FAQ

Does the course schedule include a Lunchbreak?

Classes typically include a 1-hour lunch break around midday. However, the exact break times and duration can vary depending on the specific class. Your instructor will provide detailed information at the start of the course.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does Ascendient Learning deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Ascendient Learning instructors have an average of 27 years of practical IT experience and have also served as consultants for an average of 15 years. To stay current, instructors spend at least 25 percent of their time learning new, emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth.  We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact info@ascendientlearning.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How do I request a W9 for Ascendient Learning?

View our filing status and how to request a W9.

Reviews

Sean is the very good instructor. I would like to take his class again in the future.

I was very satisfied about how the course was organized. Sean Did a very good work

Labs and the study materials provided for Architecting on AWS course are very easy to understand and explains all the topics required to pass the Associate certification.

ExitCertified provided us with a great opportunity to learn more about React and in easy to follow way.

Great company -- easy to sign up and very organized. Loved my teacher and class overall.