NVLogo wht bg v2
8621  Reviews star_rate star_rate star_rate star_rate star_half

Adding New Knowledge to LLMs

Large Language Models (LLMs) are powerful, but their knowledge is often general-purpose and may lack the specific, up-to-date, or specialized information required for enterprise applications. The...

Read More
$500 USD
Duration 1 day
Course Code NV-ANK-LLM
Available Formats Classroom

Overview

Large Language Models (LLMs) are powerful, but their knowledge is often general-purpose and may lack the specific, up-to-date, or specialized information required for enterprise applications. The "Adding Knowledge to LLMs" workshop provides a comprehensive, hands-on guide to the essential techniques for augmenting and customizing LLMs.
This workshop takes you on a complete journey from raw data to a fine-tuned, optimized model. You will begin by learning how to curate high-quality datasets and generate synthetic data with NVIDIA NeMo Curator. Next, you will dive deep into the crucial process of model evaluation, using benchmarks, LLM-as-a-judge, and the NeMo Evaluator to rigorously assess model performance. With a solid foundation in evaluation, you will then explore a suite of powerful customization techniques, including Continued Pretraining to inject new knowledge, Supervised Fine-Tuning to teach new skills, and Direct Preference Optimization (DPO) to align model behavior with human preferences.
Finally, you will learn to make your customized models efficient for real-world deployment by exploring essential optimization techniques like quantization, pruning, and knowledge distillation using TensorRT-LLM and the NeMo framework. The workshop culminates in a hands-on assessment where you will apply your new skills to align an LLM to a specific conversational style, solidifying your ability to tailor models for any application.

Skills Gained

By participating in this workshop, participants will be equipped to:

  • Curate high-quality datasets and generate synthetic data using NVIDIA NeMo Curator.
  • Rigorously evaluate LLM performance with benchmarks (MMLU), LLM-as-a-judge, and the NeMo Evaluator.
  • Inject new domain-specific knowledge into LLMs using Continued Pretraining (CPT).
  • Teach LLMs new skills and align them to specific tasks with Supervised Fine-Tuning (SFT).
  • Align model behavior to human preferences for style, tone, and safety using Direct Preference Optimization (DPO).
  • Compress and optimize LLMs for efficient deployment using Quantization, Pruning, and Knowledge Distillation with TensorRT-LLM and NeMo.
  • Apply end-to-end model customization workflows to solve real-world problems.

Prerequisites

  • Familiarity with Python programming and Jupyter notebooks.
  • Basic understanding of Large Language Models and their applications.
  • Conceptual knowledge of deep learning and neural networks.

Course Details

Topics Covered

In service of teaching and demonstrating how to add knowledge to and customize LLMs for enterprise use, this workshop will cover the following topics and technologies:

  • Data Curation and Synthetic Data Generation
  • Advanced LLM Evaluation Techniques (including LLM-as-a-Judge and ELO)
  • Continued Pretraining (CPT) for Knowledge Injection
  • Supervised Fine-Tuning (SFT) for Skill Acquisition
  • Direct Preference Optimization (DPO) for Behavioral Alignment
  • Model Optimization: Quantization, Pruning, and Knowledge Distillation
  • NVIDIA NeMo Framework, NeMo Curator, NeMo Evaluator, and NeMo-RL
  • TensorRT-LLM for High-Performance Inference

Data Curation and Synthetic Data Generation

  • Learn to prepare large-scale, high-quality datasets using NVIDIA NeMo Curator.
  • Perform essential data curation tasks: text cleaning, filtering, and PII removal.
  • Generate high-quality synthetic Question-Answer pairs to create robust datasets for Supervised Fine-Tuning (SFT).
  • Understand the importance of data quality in the LLM development lifecycle.

Evaluating Large Language Models

  • Explore multiple LLM evaluation techniques, from simple "eyeballing" to systematic, quantitative methods.
  • Evaluate models against industry-standard benchmarks like MMLU.
  • Implement LLM-as-a-judge for nuanced, automated evaluation.
  • Use the NeMo Evaluator microservice to compare zero-shot vs. few-shot (in-context learning) performance.
  • Track and visualize evaluation experiments using MLflow.

Customizing LLMs

  • Dive into three key customization techniques: CPT, SFT, and DPO.
  • Use Continued Pretraining (CPT) to teach a model new knowledge about a specific domain.
  • Apply Supervised Fine-Tuning (SFT) to teach a model new skills, such as solving math problems in a different language.
  • Utilize Direct Preference Optimization (DPO) to align a model's conversational style to human preferences (e.g., formal vs. informal, specific dialects).
  • Gain hands-on experience with the NeMo framework for all customization tasks.

Optimizing LLMs for Deployment

  • Learn to compress and accelerate LLMs for efficient inference.
  • Apply Post-Training Quantization (PTQ) to reduce model size and memory usage using TensorRT-LLM, focusing on the FP8 format.
  • Use Depth Pruning to reduce model size by removing entire layers.
  • Employ Knowledge Distillation to recover performance lost during pruning by training a smaller "student" model to mimic a larger "teacher" model.
  • Evaluate the performance vs. accuracy trade-offs of each optimization technique.

Interactive Assessment

  • Apply your knowledge in a hands-on coding assessment.
  • Use Direct Preference Optimization (DPO) to align a Llama 3.1 8B model to a unique conversational style (Shakespearean English).
  • Demonstrate your ability to prepare a preference dataset, run an alignment job with NeMo-RL, and evaluate the final model.
  • Earn a certificate of competency by successfully completing the assessment.

Schedule

FAQ

Does the course schedule include a Lunchbreak?

Classes typically include a 1-hour lunch break around midday. However, the exact break times and duration can vary depending on the specific class. Your instructor will provide detailed information at the start of the course.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does Ascendient Learning deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Ascendient Learning instructors have an average of 27 years of practical IT experience and have also served as consultants for an average of 15 years. To stay current, instructors spend at least 25 percent of their time learning new, emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth.  We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact info@ascendientlearning.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How do I request a W9 for Ascendient Learning?

View our filing status and how to request a W9.

Reviews

Both course material and instructor demonstrated a sound foundation on Maximo material

Instructor, Training material & span of the training is neatly planned.

I think the platform is very good and look forward to taking my next course in early October.

The instructor really took his time and made sure I was able to understand the concepts.

Very interactive and in-depth course that really got me ready for the industry