8293  Reviews star_rate star_rate star_rate star_rate star_half

Data Science Using Python Deep Dive

This Comprehensive Data Science with Python training course is ideal for engineers, data scientists, statisticians, and other quantitative professionals looking to hone their Python programming...

Read More
$2,995 USD
Duration 5 days
Course Code WA3364
Available Formats Classroom

Overview

This Comprehensive Data Science with Python training course is ideal for engineers, data scientists, statisticians, and other quantitative professionals looking to hone their Python programming skills. Our experienced instructors will guide you through all the basics, helping you to become a proficient Python programmer.

Skills Gained

  • Understand the basic Python types, collections, and control flow.
  • Learn how to use NumPy for matrix computing and data analysis.
  • Master the fundamentals of Pandas for data manipulation and exploration.
  • Apply exploratory data analysis techniques to visualize and understand data.
  • Implement inferential statistics in Python to test hypotheses and make predictions.

Prerequisites

All attendees should have prior programming experience and an understanding of basic statistics.

Course Details

Outline

An Accelerated Introduction and Overview to Python for Data Science Foundations

  • Introduction to course and computing environment
  • Up and running with Jupyter notebooks
  • Fundamental Python types: String literals, numeric, Boolean, and dates
  • Understanding Python ‘variables’ (reference assignment)
  • Slicing syntax
  • Fundamental collections: tuples, lists, dictionaries, and sets
  • Control flow iteration in Python (if/then, for, while, list comprehension)
  • Writing your own functions
  • Handling exceptions

Matrix Computing with NumPy

  • Introduction to the ndarray
  • Dtypes in NumPy
  • NumPy operations, uFuncs
  • Broadcasting
  • Missing data in NumPy (masked array)
  • Random number generation

Managing, Exploring, and Cleaning Data with Pandas

  • Fundamental Pandas: Series and DataFrames
  • Exploring objects with attributes/methods
  • Importing data from different structured sources
  • Basic DataFrame summaries
  • Creating new variables (columns)
  • Scaling and standardizing data elements
  • Discretizing continuous data
  • Mapping categorical data to new values
  • Establishing dummy codes (one hot encoding)
  • Filtering rows and selecting columns
  • Managing the indices
  • Identifying duplicate rows
  • Quantifying and managing missing data
  • Combining datasets
  • Merging datasets
  • Transposing datasets
  • Changing data from long to wide formats and back

Exploratory Data Analysis with Pandas (including visualization with Seaborn)

  • Univariate Statistical Summaries and Detecting Outliers, visually with graphical approaches and numerically.
  • Multivariate Statistical Summaries and Outlier Detection, visually with graphical approaches and numerically.
  • Groupwise calculations
  • Pivot Table type operations to aggregate by group
  • Pandas DataFrame plotting methods

Data Pseudo-Coding Process, Extension to Data-Centric Problems

  • Identifying data verbs
  • Answering a question using a well-formatted analytic dataframe
  • Understanding the unit of analysis
  • Identifying the unit of analysis for a given question – is my dataframe organized this way?
  • Leveraging normalized data to create the analytic dataframe through combinations of data verbs
  • Identify the question and unit of analysis
  • Define the desired analytic dataframe
  • Examine the normalized source data
  • Create data pseudo-code to map source data to the final analytic dataframe
  • Implement with Python

Focus on Graphics with Python: Seaborn, Matplotlib, and Plotly

  • Using seaborn for 1 and 2 variable summaries
  • Advanced statistical plots with Seaborn
  • Controlling plot details through Seaborn
  • Making graphs interactive with Plotly
  • Introduction to Matplotlib for full control of parameters

Overview of Descriptive versus Inferential Analytics

  • Identifying the null hypothesis
  • P-value interpretation
  • The idea of statistical power and type 1/2 errors

Implementing Inferential Statistics in Python

  • Analyzing an A/B randomized test:
  • T-tests/ANOVA
  • Chi-square tests
  • Correlation methods

Multivariate Models: Linear Regression

  • Estimating the mean
  • Identifying p-values of interest
  • Adding a categorical predictor and the link to t-tests
  • Nonlinear trends: Polynomial regression and spline modeling
  • Interaction terms
  • Confounding
  • Model building approaches (choosing the best model)
  • Scoring new data from the model (making predictions)

Multivariate Models: Logistic Regression

  • GLMs and the link function
  • Understanding the logit function
  • The binomial distribution and
  • Recovering the average event probability from the model
  • Interpreting the coefficient – the odds ratio
  • Categorical predictors and the connection to the chi-square test
  • Expansion to more complex models (non-linear trends, multiple predictors)
  • Confounding
  • Interaction terms
  • Making predictions
  • Comparing models and picking the ‘best’ model

Conclusion

Optional modules depending on student interest and timing

Analyzing unstructured data with Python

  • Overview of structure versus unstructured data
  • Implementing regular expressions in Python
  • Converting unstructured data to structured data for analysis

Missing Data

  • Exploring and understanding patterns in missing data
  • Missing at Random
  • Missing Not at Random
  • Missing Completely at Random
  • Data imputation methods

Schedule

FAQ

Does the course schedule include a Lunchbreak?

Classes typically include a 1-hour lunch break around midday. However, the exact break times and duration can vary depending on the specific class. Your instructor will provide detailed information at the start of the course.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does Ascendient Learning deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Ascendient Learning instructors have an average of 27 years of practical IT experience and have also served as consultants for an average of 15 years. To stay current, instructors spend at least 25 percent of their time learning new, emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth.  We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact info@ascendientlearning.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How do I request a W9 for Ascendient Learning?

View our filing status and how to request a W9.

Reviews

Course was great and informative. The instructor had a good flow and was very personable.

The class covered the concepts needed for the AWS Cloud Practitioner Certification.

I registered a day before class and am happy that I received all the materials and links in time for the class. Thanks.

Concise and good to follow along. Although it is a lot to take in under a short period of time.

Sean is the very good instructor. I would like to take his class again in the future.