8515  Reviews star_rate star_rate star_rate star_rate star_half

Small Language Models for Production Applications

Learn to deploy and optimize Small Language Models (SLMs) for real-world applications. This hands-on course guides participants through selecting, deploying, and fine-tuning compact AI models that...

Read More
Duration 2 days
Course Code GAI-2105
Available Formats Classroom

Overview

Learn to deploy and optimize Small Language Models (SLMs) for real-world applications. This hands-on course guides participants through selecting, deploying, and fine-tuning compact AI models that deliver efficient performance at lower cost. Participants learn practical techniques for prompt engineering, quantization, fine-tuning with LoRA, and building RAG systems optimized for SLMs, while understanding critical trade-offs between model size and capability

Skills Gained

By the end of this course, participants will be able to:

  • Understand SLM capabilities and optimal use cases vs large models
  • Deploy quantized models locally with optimized memory footprint
  • Engineer effective prompts tailored to SLM constraints
  • Evaluate when to fine-tune vs prompt engineer for specific tasks
  • Build lightweight RAG systems with appropriate retrieval strategies
  • Benchmark and evaluate SLM performance for production deployment

Who Can Benefit

  • ML Engineers
  • Data Scientists
  • Software Developers
  • Technical Architects

Prerequisites

  • Ascendient Learning's Customizing Generative AI Models course or equivalent knowledge of LLMs
  • Basic prompt engineering skills
  • Simple app development exposure with LLMs

Course Details

Software

All attendees must have a modern web browser and an Internet connection.

What Are SLMs & Why They Matter

  • Size Taxonomy and Model Categories
  • Comparing Small and Large Models
  • Parameters, Context Length, Cost, and Latency
  • Use Cases for Edge Deployment, Privacy, and Cost Optimization
  • Trade-offs in Capability and Performance

Deployment Modalities & Quantization

  • Deployment Options for SLMs
  • Quantization Techniques for Inference
  • Memory Requirements by Model Size and Quantization
  • Hardware Requirements and Considerations
  • Tools for SLM Deployment

Task Design & Prompt Engineering for SLMs

  • Task Suitability Framework for SLMs
  • Context Length Constraints and Management
  • Chunking, Summarization, and Prompt Compression
  • Prompt Optimization Techniques
  • Evaluating Prompt Effectiveness

Fine-Tuning with LoRA/QLoRA

  • When to Fine-Tune vs Prompt Engineer
  • LoRA Mechanics and Benefits
  • Training Efficiency with Parameter-Efficient Methods
  • QLoRA for Quantized Training
  • Dataset Requirements and Overfitting Prevention

Retrieval Augmentation & Tool Use

  • Lightweight RAG Architecture for SLMs
  • Retrieval Quality vs Context Budget Trade-offs
  • Optimal Chunking Strategies
  • Retrieval Parameters and Reranking
  • Tool Augmentation for Enhanced Capabilities
  • Hybrid Architectures with Model Routing

Evaluation, Benchmarking & Production Considerations

  • Evaluation Metrics for SLMs
  • SLM-Specific Failure Modes
  • A/B Testing Strategy and Gradual Rollout
  • Emerging Trends in Small Language Models

Schedule

FAQ

Does the course schedule include a Lunchbreak?

Classes typically include a 1-hour lunch break around midday. However, the exact break times and duration can vary depending on the specific class. Your instructor will provide detailed information at the start of the course.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does Ascendient Learning deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Ascendient Learning instructors have an average of 27 years of practical IT experience and have also served as consultants for an average of 15 years. To stay current, instructors spend at least 25 percent of their time learning new, emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth.  We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact info@ascendientlearning.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How do I request a W9 for Ascendient Learning?

View our filing status and how to request a W9.

Reviews

The platform is very intuitive and easy to navigate. Great tool for learning

Very good company. I've done technical trainings at their facility in downtown Montreal in the past and I'Ve always appreciated them.

I was very pleased with the course setup by ExitCertified and the instructor.

Overall ExitCertified is a great training provider and the remote learning is as effective as in person.

Simply great training provider that I can go for updating/acquiring my skill sets.